Nearly 7 years of professional experience in IT data analytics projects, specializing in designing and developing on-premises ETL solutions and modernizing them into AWS cloud-based architectures. Proficient in leveraging AWS native services such as AWS Glue for ETL workflows, Amazon Redshift for data warehousing, and Amazon S3 for scalable storage. Well-versed in orchestration tools like AWS Step Functions and AWS Data Pipeline for seamless workflow automation. Extensive experience in building and optimizing data pipelines using Pentaho to facilitate ETL processes, migrating data from MariaDB and flat files to AWS Cloud Services. Skilled in implementing serverless ETL solutions using AWS Lambda and processing large-scale datasets with AWS EMR (Elastic MapReduce). Hands-on experience with AWS Database Migration Service (DMS) and Schema Conversion Tool (SCT) for seamless data migration to AWS environments. Knowledgeable in AWS Batch for efficient batch processing of large datasets. Designed and built real-time data streaming solutions utilizing AWS Kinesis and developed data warehousing solutions on AWS using Amazon Redshift. Optimized Amazon S3 storage strategies for cost-effectiveness and improved data retrieval performance. Strong understanding of data modeling principles, including Fact and Dimension tables, as well as Snowflake and Star Schema modeling techniques. Extensive experience in data warehousing and data mart development using distributed SQL technologies such as Hive SQL, MySQL, and MS SQL. In-depth experience working with AWS services like AWS Lambda, AWS Glue, and AWS SDKs to develop and optimize data solutions. Expertise in writing SQL-based stored procedures, functions, and complex queries for database design and performance optimization. Comprehensive experience in the entire software development lifecycle (SDLC), from requirements analysis and design to development, testing, and deployment. Documented best practices, optimization strategies, and performance tuning methodologies for Snowflake-based data pipelines. Enhanced collaboration across teams, improving knowledge sharing and driving operational efficiencies in data engineering workflows. Improved ETL performance by 50% through optimized data transformations and parallel processing techniques using PySpark. Leveraged PySpark for advanced analytics applications, including machine learning model training and predictive analytics within Snowflake, enabling actionable insights. Strong understanding of Search Engine Marketing (SEM), Search Engine Optimization (SEO), product ads, and keyword analysis. Experienced in data preparation, modeling, and visualization using Power BI, along with expertise in developing interactive dashboards and reports using Tableau. Excellent communication and interpersonal skills with a strong ability to quickly adapt and learn new technologies.