
Experienced Senior Data Engineer with over a decade of work in building reliable, secure, and scalable data environments on AWS, Azure, and GCP. Skilled in ETL/ELT development, data modeling, streaming, governance, and automation using modern cloud tools such as Snowflake, Databricks, and dbt. Known for practical problem solving, clear documentation, and collaboration across data science, analytics, and DevOps teams. Familiar with compliance frameworks including HIPAA, SOC 2, and FedRAMP. Focused on quality, maintainability, and data-driven decision making.
Programming & Scripting:Python (NumPy, Pandas, PySpark), SQL (T-SQL, PL/pgSQL, Spark SQL), Scala, Java, R, Go, C, Bash, PowerShell, TypeScript, JavaScript, YAML, JSON
Cloud Platforms & Services:AWS (S3, Redshift, Glue, Lambda, Kinesis, SageMaker, Aurora, CloudFormation) Azure (Synapse, Data Factory, Databricks, Cosmos DB, Event Hubs, AKS, Power BI Service) GCP (BigQuery, Dataflow, Cloud Composer, Vertex AI) Snowflake
ETL / ELT & Data Orchestration: dbt, Apache Airflow, Dagster, NiFi, Talend, Informatica, SSIS, ADF, Matillion, StreamSets, Alteryx, Fivetran, Airbyte, Prefect
Data Engineering & Streaming: Apache Spark (Python & Scala), Kafka, Flink, Beam, Hadoop, Hive, Delta Lake, Presto, Trino, Databricks SQL
Modeling & Warehousing: Dimensional Modeling (Kimball, Inmon), Data Vault 20, Star/Snowflake Schemas, Semantic Layers, Data Marts, Lakehouse Architecture
Databases (SQL / NoSQL / Graph): PostgreSQL, MySQL, SQL Server, Oracle, MongoDB, Cassandra, Cosmos DB, DynamoDB, Redis, Elasticsearch, Neo4j, ArangoDB
Governance & Security: Collibra, Alation, Apache Atlas, Monte Carlo, Data Cataloging, Lineage, MDM, GDPR, HIPAA, SOC 2, RBAC, IAM, Encryption at Rest/In Transit
Business Intelligence & Visualization: Power BI (Advanced DAX), Tableau, Looker, Sigma, Qlik, QuickSight, Mode, Metabase, Grafana, Plotly, Matplotlib, Seaborn, Jupyter Notebooks
Machine Learning, AI & MLOps:SageMaker, MLflow, TensorFlow, Scikit-learn, Feature Stores, Model Deployment
DevOps, CI/CD & DataOps:Git, Jenkins, Terraform, Docker, Kubernetes, Azure DevOps
APIs, Integration & Event Architecture:REST APIs, GraphQL, Microservices, Event-Driven Architecture, Kafka Messaging
Knowledge Graphs & Semantic Web:Ontology Design, RDF, SPARQL, Neo4j
Domain Expertise:Healthcare Data, Financial Analytics, E-Commerce Analytics, Government Data Systems
Leadership & Professional Skills:Technical Mentorship, Agile/Scrum Delivery, Stakeholder Communication, Cross-Team Collaboration
Databricks Certified Data Engineer Professional
Microsoft Certified: Azure Data Engineer Associate (DP-203)
AWS Certified Data Engineer – Associate