Data engineer with expertise in optimizing cloud-based data infrastructures and designing scalable data pipelines. Proficient in AWS, Google BigQuery, Python, SQL, Apache Airflow, Spark, and Kafka. Experienced in developing robust ETL processes and ensuring data governance across complex systems. Collaborates effectively with cross-functional teams to deliver data-driven solutions that enhance operational reporting and business insights.
Programming Skills:
Python, SQL, Bash, PySpark, Scala (basic), JavaScript (basic)
Databases:
PostgreSQL, MySQL, MongoDB, AWS Redshift, Snowflake, Google BigQuery
Web Technologies & Libraries:
Flask (API development), REST APIs, Pandas, NumPy, SQLAlchemy, Jupyter Notebooks, HTML/CSS (basic)
Cloud Platforms & Services:
Amazon Web Services (AWS): S3, RDS, Redshift, Lambda, Glue
Google Cloud Platform (GCP): BigQuery
Others: Azure (basic), Snowflake
Company: DataNest Technologies Pvt. Ltd. – Hyderabad, India
Tech Stack: AWS S3, AWS Glue, Redshift, Python, Airflow, PostgreSQL
Company: WVU Medicine – Morgantown, WV
Tech Stack: Python, AWS Lambda, Redshift, Snowflake, Airflow, Great Expectations
Tech Stack: Kafka, Spark Streaming, MongoDB, Grafana
Tech Stack: AWS RDS, Redshift, Python, Pandas, Power BI