
Strategic Data Architect with 9+ years of experience designing and scaling cloud-native platforms, big data ecosystems, and real-time streaming systems. Expert in multi-cloud (AWS, Azure, GCP), Kafka/Spark/Flink, and modern data stacks (Snowflake, Databricks, dbt, Delta Lake). Skilled in healthcare data standards (FHIR, HL7, HIPAA, GDPR), enabling secure, compliant solutions that power AI/ML and enterprise analytics. Adept at leading teams, modernizing legacy systems, and aligning data strategies with business goals, delivering cost savings, performance gains, and mission-critical insights.
Cloud Platforms & Data Services: AWS (EC2, S3, Lambda, Redshift, Kinesis, Glue), Azure (Synapse, Data Factory, Databricks, AKS), GCP (BigQuery, Dataflow, Pub/Sub, Vertex AI), Snowflake, Palantir Foundry, Multi-Cloud Deployments, Cost Optimization
Data Engineering & Orchestration: ETL/ELT (PySpark, dbt, Airflow, Talend, Dagster, NiFi), Change Data Capture (CDC), Event-Driven Design, Data Mesh & Data Fabric, Real-Time Pipelines
Data Warehousing & Storage: Snowflake, BigQuery, Redshift, Azure Synapse, Delta Lake, Apache Iceberg, Hudi, PostgreSQL, ClickHouse, MongoDB, Cassandra
Programming & Scripting: Python, SQL (T-SQL/PL-SQL), Scala, Java, Bash, YAML/JSON, REST, gRPC
Streaming & Real-Time Processing: Apache Kafka (Streams, KSQL), Apache Flink, Apache Beam, AWS Kinesis, GCP Pub/Sub, Redpanda, Materialize
Machine Learning & AI Data Engineering: MLOps, MLflow, Feature Stores (Databricks), Real-Time Scoring Pipelines, Retrieval-Augmented Generation (RAG), Vector Databases (Pinecone, FAISS), AWS SageMaker, TensorFlow, PyTorch, Vertex AI
Data Modeling & Architecture: Dimensional Modeling (Kimball), Star/Snowflake Schema, Data Vault, Common Data Model (CDM), Schema Evolution, Partitioning, High-Volume Data Processing, Fault Tolerant Design
Governance, Security & Compliance: HIPAA, GDPR, SOC 2, CCPA, PII/PHI Handling, OpenMetadata, Collibra, Alation, Unity Catalog, Metadata Management, Data Lineage, Great Expectations, dbt Tests, RBAC, RLS, Encryption, HashiCorp Vault, IAM
Observability & Monitoring: Prometheus, Grafana, Datadog, Monte Carlo, ELK Stack, Anomaly Detection, Operational Metrics, SIEM Tools
BI & Analytics: Tableau, Power BI, Looker, Apache Superset, Metabase, Semantic Modeling, Self-Service BI Enablement
DevOps & Infrastructure Automation: CI/CD (GitHub Actions, Jenkins, Azure DevOps), Terraform, Docker, Kubernetes (AKS, EKS, GKE), Helm, ArgoCD, GitOps, Cloud Run, Microservices
Leadership & Collaboration: Agile/Scrum, Cross-Functional Leadership, Stakeholder Engagement, Strategic Planning, Architecture Reviews, Mentorship, Data Contracts, JIRA, Confluence
Healthcare Data Expertise: HL7, FHIR, EHR/EMR Data, RxNorm, NCPDP, Specialty Pharmacy Data, ICD-10, CPT Codes, 340B Program, SaaS Integrations
Google Cloud – Professional Data Engineer