Accomplished solution architect skilled in AWS cloud services, data warehousing, and analytics with a demonstrated history in healthcare, banking, and payments. Adept in full-stack development, data governance, and project execution with proficiency in Python, Apache Spark, and ETL processes. Strategically collaborates with stakeholders to innovate and enhance data solution frameworks.
Overview
8
8
years of professional experience
Work History
Architect
Altimetrik Corp
05.2023 - Current
ELT (Extract, Load and Transform) using big data stack. Every one from top down is hands-on.
Work with internal business groups on implementation opportunities, challenges, and gathers requirements of various applications.
Analyze requirements and provide recommendations to address and resolve business issues for a specific business group.
Provide application software development services or technical support for backed features and components.
Scaling the backend architecture and codebase to support growth and improvise application efficiencies.
Convert requirements into robust and performance agnostic design and implement by developing optimized and resource efficient ELT pipelines with code readability and reviewing skills.
Expertise in Cloudera Data Platform 7.x, Apache Spark 3.x, Apache Airflow, Hive, Impala, SQOOP, Informatica, Python.
Implementing CI/CD with latest DevOps technologies and best practices.
Worked on data loading on Snowflake using Apache Spark.
Experience in trouble shooting and performance optimizations in Airflow, Cloudera Hadoop Distribution components.
Architect
Altimetrik India Pvt Ltd
06.2017 - 04.2023
Architect and implement analytics platforms, ensuring alignment with business needs and technical specifications.
Design and develop logical and physical infrastructures, incorporating AWS services and data modeling techniques.
Coordinate with stakeholders for architectural decisions, data governance, and service selection.
Provide technical support and troubleshooting for application issues, with a focus on optimizing user experience.
Spearhead migration strategies, including data onboarding and integration across different platforms.
Execute graph-based data analytics, ingestion, and extraction to enable advanced decision-making capabilities.
Develop and maintain CI/CD pipelines, automating data transfer and deployment processes.
Construct analytical tools and dashboards to facilitate data analysis and visualization for various business use cases.
Administer cloud-based environments, including EC2 instance management, data migrations, and server monitoring.
Associate
Cognizant Technology Solutions, CTS
09.2016 - 06.2017
Developed different layer for the hive table data and storing the data as Parquet file.
Implemented business logic through Apache spark sql on trillions of records in hive table.
Involved in converting Hive queries into Spark transformations using Spark RDDs, Scala.
Optimizing of existing algorithms in Hadoop using Spark Context, Spark-SQL, DataFrames.
Developed a Python Program to convert CSV file to XML file.
Hands on experience in implementing Partitions, bucketing on Hive tables and designed both Managed
and External tables in Hive to optimize performance.
Worked on Data Validation between GMR data with external data such as Duns and Bradstreet, Acquirer
merchant master file and Group on data.
Implementing Hive queries to pull data from transaction tables based on Ad hoc request.
Performed similarity analysis on matching similar merchant address using the combination of levenshtein,
jaro winkler and N gram Similarity technique.
Senior Software Engineer
Alchemy Techsol India Pvt Ltd(Contract: Cognizant Technology Solutions)
10.2015 - 09.2016
Developed different layer for the hive table data and storing the data as Parquet file.
Implemented business logic through Apache spark sql on trillions of records in hive table.
Involved in converting Hive queries into Spark transformations using Spark RDDs, Scala.
Optimizing of existing algorithms in Hadoop using Spark Context, Spark-SQL, .
Developed a Python Program to convert CSV file to XML file.
Hands on experience in implementing Partitions, bucketing on Hive tables and designed both Managed and External tables in Hive to optimize performance.
Worked on Data Validation between GMR data with external data such as Duns and Bradstreet, Acquirer merchant master file and Group on data.
Implementing Hive queries to pull data from transaction tables based on Ad hoc request.
Performed similarity analysis on matching similar merchant address using the combination of , jaro winkler and N gram Similarity technique.
Education
Bachelor of Engineering - Electronics And Communication Engineering