Metadata Snapshot - Microservice hosted in Collibra Hosted Infrastructure (CHI) (November 2023 to October 2024)
Metadata Snapshot is a multi-tenant microservice that provides the ability to store and retrieve snapshots (hashes) of a metadata source. It consists of an API layer and a database:
* Metadata Snapshot API: REST API that exposes methods to help synchronize metadata with the source system.
* Metadata Snapshot Store: PostgreSQL database provisioned by Crossplane, hosted on AWS/GCP, used to store snapshots containing hashes of metadata sources.
Responsibilities
- Authored the OpenAPI specification for the Metadata Snapshot API, configured and generated source code from API specs using Gradle, and implemented InputStreamResource and StreamingResponse to stream up to 1 billion entity hashes, enabling efficient and immediate data transfer between services.
- Independently developed core functionalities of the service using Java 21, Spring Boot, Spring Data JDBC, JobRunr Job Scheduler, and PostgreSQL, incorporating both text and binary encoding to optimize data storage and retrieval. The binary format improved transfer efficiency, reducing payload size by 55% compared to text encoding.
- Implemented scheduled and configurable asynchronous jobs for cleanup and verification using JobRunr.
- Managed database schema evolution using Spring Data JDBC and Liquibase, ensuring reliable and consistent schema migrations across environments.
- Configured Helm Charts to deploy the service in CHI, integrated with external microservices for session authentication, set up Crossplane to provision PostgreSQL databases, and integrated with ArgoCD for scalable and manageable cloud deployments. Also utilized Docker Desktop and Kubernetes for local development and testing.
- Designed and configured Jenkins pipelines for CI workflows, incorporating JUnit 5 for unit/integration testing and Gatling for performance testing.
- Conducted large-scale volume testing on AWS and GCP, validating service performance and resilience under extreme data loads, and fine-tuning configurations to optimize throughput and latency.
- Enhanced observability using OpenTelemetry and Micrometer, improving service monitoring and diagnostics to enable proactive issue resolution.
- Contributed to service integration by implementing the consumption of the Metadata Snapshot service for an understated partner team, supporting their successful and timely delivery of a key product component.
- Managed sprint planning and demoed progress of the service during sprint reviews; also presented a technical overview of the Metadata Snapshot service during a company-wide engineering knowledge-sharing session.
Asset Import & Export, Core Platform (February 2021 to October 2023)
The Import API allows to efficiently create or edit data in bulk in Collibra Data Intelligence Cloud.
- Participated in Design discussions, Backlog refinements, Estimations and Sprint Reviews.
- Developed feature requests for Import module in Java 17, REST API, Spring Boot, Hibernate, Postgres, Jenkins, Docker, Kubernetes, Datadog tech stack.
- Developed a tool to automate testing of Knowledge Graph API (GraphQL) queries by generating equivalent Output Module (Collibra's proprietary query engine) queries and comparing the result of execution of both queries.
- Study and Analysis of existing system (Dataloader) and summarizing its features, benefits, limitations, and proposal of features to implement in the Import API.
- Coordinated with SRE team and implemented structured logs using OpenTelemetry and created Grafana Dashboards to track Import API metrics
- Developed unit tests, integration tests using Junit 5, do end-to-end testing of API using Postman, Altair GraphQL, UI testing; Also supported QA in their testing.
- Triaged L3 bugs and provided resolution.
- Actively helped triage issues, propose solutions, and respond to questions posted in public slack channel regarding Import API
- Updated, Reviewed and Provided feedback for documentation published to customers.
- Participated in customer meetings to answer any questions Customers might have in moving to Import API V2.
- Liaison with a cross functional team (Integrations) to answer questions they had regarding using Import API and to update them of any new changes to the Import API.
- Peer Review of code implemented by team members.
- Mentoring and helping team members.
- Been DCG Guardian to monitor the Jenkins pipelines of Collibra’s infrastructure, actively address failures and coordinate with various teams to keep them healthy.
- Volunteered to be a bootcamp coordinator to help onboard new Collibra engineers.
- Conducted technical interviews and assessments for potential candidates applying for backend software engineer roles in Collibra and evaluated their technical skill and cultural fit with the team.
Scrum Master (January 2022 to May 2023)
- Conducted Daily Scrum and Sprint Retrospectives.
- Formulated and finalized ‘Definition of Ready’ and ‘Definition of Done’ for the team.
- Took the role of Demo Master to present our team’s sprint report, team’s accomplishments in the past sprint and the goal for the upcoming sprint.