Analytical and solution-oriented Data Analyst/Data Engineer with over 5 years of experience designing, developing, and optimizing scalable data pipelines and analytical solutions across cloud and distributed environments, primarily on Microsoft Azure and Databricks. Proven ability to transform structured and unstructured data into actionable business insights through ETL/ELT processes, advanced data modeling, and real-time processing frameworks using PySpark, Scala, and SQL. Strong expertise in applying statistical analysis, data mining, and predictive modeling to uncover trends and support data-driven decision-making. Adept at leveraging tools like Azure Synapse, Delta Lake, Power BI, and Azure Data Factory to deliver measurable business value. Skilled in collaborating across multidisciplinary teams, with excellent communication and problem-solving abilities. Committed to continuous learning, automation, and innovation to improve performance and enable data maturity across organizations.
Data Visualization & Reporting: Power BI (DAX, Power Query/M, Semantic Model Design, Dashboard Development, Power BI Admin), SSRS, Azure Analysis Services, Tableau
Querying & Scripting: T-SQL, PL/SQL, PostgreSQL, MySQL, Oracle SQL, Power Query (M), DAX, Python, SQL for Data Warehousing Programming Languages: Python, Java, Scala
Big Data & Distributed Processing: Apache Spark, Hadoop, Kafka, Azure Databricks
Data Integration & ETL: Azure Data Factory (ADF), SSIS, Azure Databricks, SQL Server Agent, Power BI Dataflows
Databases & Warehousing:SQL Server, Azure SQL Database, PostgreSQL, MySQL, Oracle, Snowflake, Azure Synapse Analytics
Cloud & Data Platforms:Microsoft Azure (ADLS Gen2, Azure SQL, Synapse Analytics, Azure Logic Apps, Azure Monitor, Azure Key Vault, Azure Databricks), AWS (basic exposure)
Data Modeling & Architecture: Dimensional Modeling, Star and Snowflake Schema Design, Semantic Layer Design, Stored Procedures, Views, Triggers, Data Quality Assurance, Data Mining & Segmentation Techniques
Monitoring & Support: Data Load Monitoring (including after-hours and on-call), Data Pipeline Troubleshooting, Performance Tuning, Hybrid Cloud Environment Adaptability
Productivity & Collaboration: Microsoft Excel, PowerPoint, Outlook, Teams, Zoom, Agile Work Environments
Microsoft certified: DP 203 Azure Data Engineer Associate, DP 900 Azure Data Fundamentals.
Oracle Certification: Oracle Cloud Infrastructure 2024 Generative AI Certified Professional.
Snowflake Certification: Hands On Essentials – Data Warehouse & Data Engineering Badges.
Databricks Certification: Academy Accreditation – Generative AI Fundamentals, Azure Databricks Platform Architect Badge, Databricks Lakehouse Fundamentals Badge.