Programming & Scripting: Python (Pandas, NumPy, Scikit-learn), R, SQL Data Engineering & Pipelines: ETL Processes, Data Warehousing, Data Integration, Data Modeling, Workflow Automation, API Data Extraction, Airflow (familiar)
Databases & Storage: MySQL, PostgreSQL, SQL Server, NoSQL (MongoDB – exposure), Data Lakes & Warehouses (Snowflake, Redshift, BigQuery – exposure)
Big Data & Processing (exposure): Apache Spark, Hadoop, Kafka
Cloud Platforms: AWS (S3, Glue, Redshift), Azure (Data Factory, Synapse), Google Cloud (BigQuery, Dataflow)
Data Analysis & Statistics: Exploratory Data Analysis (EDA), Regression Analysis, Hypothesis Testing, A/B Testing, Trend & Gap Analysis
Visualization & Reporting: Power BI, Tableau, Excel (PivotTables, VLOOKUP, Macros)
Data Governance & Quality: Data Validation, Data Cleaning, Error Handling, Compliance Standards, Metadata Documentation
Methodologies & Tools: Agile, Git, Jira, Microsoft Visio