Innovative and detail-oriented data engineer with over 4+ years of experience designing and optimizing scalable data pipelines across AWS, Azure, and GCP. Proficient in Python, SQL, Scala, Spark, and Kafka, with expertise in big data processing, ETL, and real-time streaming. Skilled in cloud data warehousing (Snowflake, Redshift, BigQuery) and workflow automation (Apache Airflow, CI/CD). Adept at data modeling, security, and performance optimization, delivering high-impact and efficient data solutions that drive business intelligence and innovation. Passionate about solving complex data challenges and enabling data-driven decision-making.
Professional Experience
Environment: Azure Data Lake, Synapse Analytics, Databricks, SQL, MongoDB, Cassandra, PostgreSQL, MySQL, PySpark, Hive, MapReduce, Apache Kafka, Flume, Zookeeper, Impala, Sqoop, Apache Airflow, Git, Maven, Azure DevOps, Hadoop.
Professional Experience
Environment: AWS S3, Redshift, EMR, Python, PySpark, Hive, SQL, Snowflake, MySQL, PostgreSQL, MongoDB, Cassandra, HBase, Kafka, Flume, Zookeeper, Google BigQuery, Hadoop, MapReduce, Impala, AWS CloudFormation, EC2, Git, Maven, AWS CodePipeline, Apache Airflow.
Professional Experience
Environment: Google Cloud Dataflow, ETL, Google Cloud Storage (GCS), BigQuery, Google Cloud Pub/Sub, Apache Kafka, Apache Spark, Hadoop, Presto, Terraform, CI/CD, Jenkins, SQL, Python, Java, Apache Beam.
Professional Experience
Environment: Apache Kafka, Apache Spark, ETL, AWS S3, SQL, NoSQL databases, Amazon Redshift, AWS Kinesis, Apache Airflow, AWS Glue, Terraform, AWS CloudFormation, Tableau, Power BI, Looker, TensorFlow.
Hadoop
undefined