
Experienced data engineering professional with a proven track record of developing and managing efficient data systems. Demonstrated ability to deliver impactful solutions through collaborative and results-driven approaches. Recognized for expertise in data warehousing and ETL processes, as well as the flexibility to adapt to evolving project requirements. Dedicated to delivering impactful results, I bring extensive experience in designing and implementing scalable data architectures, optimizing data pipelines, and leveraging big data technologies. Collaborative team leadership, adaptability, and a results-driven approach have been key to my success. Proficient in SQL, Python, Spark, and cloud platforms, I possess a keen ability to align technical solutions with business objectives.
Big Data Technologies: Hadoop, Spark, PySpark, Hive, Kafka, Flume, Sqoop, Oozie, Zookeeper, MapReduce, Cloudera Manager
Cloud Platforms: AWS, Azure, GCP
AWS: EMR, EC2, S3, Redshift, Athena, Lambda, Step Functions, DynamoDB, CloudWatch, CloudTrail, SNS, SQS, Kinesis
Azure: HDInsight, Databricks, Data Lake, Cosmos DB, Data Factory, Azure Functions, Synapse Analytics, Event Hub, Azure Monitor
GCP: BigQuery, Dataflow, Dataproc, Pub/Sub, Compute Engine, Cloud Storage, Kubernetes Engine (GKE), Cloud Functions
Databases: SQL, Oracle, MySQL, PostgreSQL, Teradata, DB2
NoSQL Databases: HBase, Cassandra, Redis, MongoDB, DynamoDB, Cosmos DB
ETL Tools: Informatica, DataStage, SSIS, Pentaho
Programming & Scripting: Python, Scala, PySpark, R, SQL, PowerShell, Shell Scripting
IDE: PyCharm, Visual Studio Code, IntelliJ, SSMS, Data Studio
Monitoring & Reporting: Tableau, Power BI, CloudWatch, Azure Monitor
Version Control & CI/CD: Git, GitHub, GitLab, Jenkins, Bamboo, Maven, SVN
Operating Systems: Linux, Unix, Windows, macOS