
Over 10 years of IT experience working with both on-premises and cloud environments, specializing in Big Data technologies and cloud platforms. Skilled in designing and developing scalable data pipelines for migrating and processing data using Databricks and other big data tools. Extensive experience with data extraction, cleaning, processing, and pipeline creation using cloud services including: AWS: S3, Glue, EMR, Lambda, CloudFormation, EC2, Secrets Manager, Athena Azure: Azure Data Factory (ADF), Blob Storage, Azure Databricks Successfully created and migrated AWS applications across multiple AWS accounts. Developed EMR and Glue scripts for batch processing in AWS, and Python utilities for data pipeline development and automation. Engaged in product design and implementation, focusing on data cleaning, standardization, duplicate identification, and merging scenarios. Collaborated closely with clients and developers to prepare test plans and scripts ensuring high-quality software delivery. Demonstrated excellent interpersonal, analytical, and relationship-building skills with a strong process-oriented approach to meet cost, profit, service, and organizational goals. Proven leadership abilities with strong communication skills and experience motivating teams and collaborating with upper management. Hands-on expertise in production environment management, including proactive monitoring, debugging, issue mitigation, and fixes. Strong background in Hadoop ecosystem technologies, primarily with Cloudera Distribution (CDH), including development, testing, and deployment in distributed environments. Delivered comprehensive unit testing plans and documentation to ensure robust software quality.