Summary
Overview
Work History
Education
Skills
Timeline
Generic

Basavaraja Manasa

Denton,TX

Summary

Experienced professional with over 3+ years of experience in designing, developing, testing, implementing, and maintaining Data Warehousing Systems and Business Intelligence applications across diverse platforms and industries. Proficient in working with GCP services such as BigQuery, Cloud Pub/Sub, Dataflow, Cloud Composer, Cloud Storage, Cloud Functions, Cloud Bigtable, Cloud Dataproc, and IAM roles and policies. Skilled in working with AWS services including S3, Redshift, EMR, Glue, Data Pipeline, Step Functions, CloudWatch, SNS, and CloudFormation. Expertise in building ETL and ELT data pipelines using Databricks and AWS Glue. Hands-on experience in real-time data streaming solutions using Apache Spark. Strong scripting skills in Python, Linux, and UNIX Shell. Proficient in data migration, transformation, and integration. Knowledgeable in data modeling concepts such as Star-Schema Modeling, Snowflake Schema Modeling, and Fact and Dimension tables. Adheres to best practices for Data Warehousing, Data Lake, and Lake House methodologies. Familiar with Metadata and repositories within a disciplined lifecycle methodology. Adaptable team player with the ability to tackle Big Data challenges in both on-premises and cloud environments. Experienced in working with Agile and Scrum methodologies, proficient in Agile methodologies, and skilled in using Jira for managing sprints and issue tracking.

Overview

5
5
years of professional experience

Work History

Data Engineer

Cardinal Health
01.2023 - Current


  • Contributed to successful product launches by collaborating closely with project managers, developers, testers, and other stakeholders throughout the development process.
  • Successfully led a high-performing team of four data engineers in profiling, analyzing, transforming, and deploying 12 complex data tables within an aggressive six-week timeline, ensuring exceptional quality and efficiency.
  • Achieved a 52.20% reduction in monthly costs by switching a Transaction Table Stored Procedure to Scheduled Query for Real-time updates, cutting expenses from $350 to $125.
  • Engineered and implemented a design that reduced latency for reading L2 views from Tableau, decreasing from 180 seconds to just a few seconds.
  • Leveraged Secoda's advanced data lineage tracking capabilities to visualize data flows and transformations, enhancing regulatory compliance and reinforcing robust data governance practices across the organization.
  • Utilized Infrastructure as Code (IaC) tools like Terraform and set up GIT Integration of Google Cloud Platform pipelines, implementing workloads via CI/CD to UAT and Production environments, enhancing deployment efficiency.
  • Expertise in Data Modelling, Designing, and Data Analysis with Conceptual, Logical, and Physical Modelling for Online Transaction Processing (OLTP), Online Analytical Processing (OLAP), and Data Warehousing.
  • Utilized Airflow for scheduling and managing complex data workflows within GCP, improving data processing efficiency.
  • Collaborated with cross-functional teams to analyze and integrate data from multiple sources, achieving a unified view of customer data and a 30% reduction in data integration time. Implemented a disaster recovery plan for critical GCP services to ensure business continuity.
  • Delivered 12 story points within a condensed 2-week sprint while contributing as part of the production support team, demonstrating urgency in project delivery.


Data Analyst/Engineer

Caltech Innovations Pvt. Ltd
04.2019 - 12.2020
  • Monitored and evaluated engineering performance to recommend improvements.
  • Conducted research to identify and evaluate new technologies and concepts.
  • Orchestrated and automated the development of Python scripts to streamline data migration from on-premises to AWS S3, optimizing transfer processes for enhanced efficiency.
  • Used Airflow to manage and execute complex data pipelines on AWS EMR, overseeing data read and write operations on S3.
  • Implemented the basics of ETL pipeline development in Databricks, including data extraction, transformation, and loading to enhance data processing skills and improve workflow efficiency.
  • Supported senior engineers in optimizing data models, resulting in a 10% increase in database performance and faster query response times.
  • Collaborated with the team on integrating RESTful APIs into data pipelines, gaining hands-on experience in python frameworks like FLASK, data flow management and team coordination.
  • Designed and Developed ETL Processes in AWS Glue to migrate data from external sources like S3, ORC/Parquet/Text Files into AWS Redshift.
  • Developed and implemented ETL processes to load and transform data into Amazon Redshift, leveraging AWS Glue and Python scripting to automate and optimize data integration.
  • Integrated multiple sources of disparate data into cohesive datasets using ETL processes, improving overall analytic capabilities.
  • Developed required policies and procedures that reflected actual goals, tasks and workflows, while meeting all regulatory compliance requirements.


Education

Master of Science - Computer Science And Engineering

University of North Texas
Denton, TX
12.2022

Bachelor of Science - Electronics And Communications Engineering

Visvesvaraya Technological University
Bangalore, India
06.2019

Skills

  • TECHNICAL SKILLS
  • Languages: Python, C, SQL
  • Version Control: Git, GitHub
  • Databases: SQL Server, PostgresSQL, NoSQL, MongoDB
  • Cloud: GCP(Cloud Storage, BigQuery, Pub/sub, Cloud Function, DataStream), AWS (S3, EC2, Redshift, Lambda, Glue, Snowflake, Kinesis)
  • Python Modules: NumPy, Pandas, TensorFlow, scikit-learn
  • IDE Tools: VS Code, IntelliJ, Jupyter

Timeline

Data Engineer

Cardinal Health
01.2023 - Current

Data Analyst/Engineer

Caltech Innovations Pvt. Ltd
04.2019 - 12.2020

Master of Science - Computer Science And Engineering

University of North Texas

Bachelor of Science - Electronics And Communications Engineering

Visvesvaraya Technological University
Basavaraja Manasa