Summary
Overview
Work History
Education
Skills
Personal Projects
Timeline
Generic

Anish Dhawan

Redmond,WA

Summary

  • Highly motivated engineer and experienced in full-stack software development. Specializes in designing, implementing, and optimizing data-driven solutions and software applications to enhance operational efficiency and data accuracy in diverse industry settings.
  • Expertise in Databricks, Snowflake, DBT, Spark, and web development, with a proven track record of increasing processing efficiency and enhancing data quality and system usability.

Overview

5
5
years of professional experience

Work History

Software Engineer

T-Mobile
Bellevue, WA
08.2024 - Current
  • Transformed data workflows from on-premises systems to Azure Kubernetes Service (AKS) and Snowflake, ensuring scalability and reduced maintenance overhead
  • Designed and implemented data pipelines to integrate legacy systems with modern cloud platforms, utilizing Splunk, REST APIs, OSS nodes, and Snowflake
  • Migrated critical data processing operations from file-based dependencies to streamlined, cloud-native solutions using Python and Pandas
  • Established robust pipelines for loading data into Snowflake Bronze, Silver, and Gold tables, enabling business teams to access actionable insights
  • Optimized legacy processes for detecting and resolving bouncing issues through automated workflows with Splunk and edge nodes
  • Reduced full data ingestion times by over 25%, ensuring faster reporting and decision-making
  • Built reusable modules, including a Snowflake-Python Connector for seamless data integration and scalable Iceberg tables
  • Developed comprehensive documentation and provided support to cross-functional teams for transitioning to cloud systems
  • Delivered solutions that aligned with T-Mobile's strategic goal of leveraging cloud technologies for enhanced reliability and performance

Data Engineer

Slalom Build
Seattle, WA
01.2022 - 06.2024
  • Designed and implemented a chunking strategy to optimize data processing and retrieval for training a large language model (LLM)
  • Selected and configured a vector database to efficiently store and manage vectorized wine data, enhancing the model's performance
  • Utilized LlamaIndex to improve data indexing and querying capabilities, facilitating accurate and quick information retrieval for the virtual sommelier
  • Successfully created and implemented a generative AI solution, resulting in an interactive and knowledgeable virtual sommelier for wine recommendations
  • Spearheaded the migration of Workday's critical workflows from Alteryx to Redshift SQL, significantly boosting process speed and reducing overall workflow duration
  • Identified and resolved critical bugs and inefficiencies in the existing workflow, enhancing system reliability and performance
  • Led the unit testing efforts to ensure the migrated workflows' integrity and accuracy, establishing a solid foundation for operational excellence
  • Authored detailed documentation of the migration process, encapsulating each step and decision for Workday, which served as a blueprint for future migrations and facilitated knowledge sharing across teams
  • Led the design and deployment of a Snowflake-based data platform
  • Achieved a 30% boost in data processing efficiency
  • Orchestrated data ingestion from over five distinct sources using Matillion
  • Enhanced data quality and accuracy significantly
  • Utilized DBT for advanced data transformations, improving data retrieval speeds by 25%
  • Ensured HIPAA compliance, reinforcing data security and privacy standards
  • Constructed a data platform that enhanced data enrichment and harmonization efficiency
  • Implemented Apache Spark and Hive pipelines for large dataset management
  • Accelerated data processing speeds by over 35%
  • Engineered a data warehousing solution that consolidated multiple data sources into a streamlined pipeline, improving data retrieval times by 50% and supporting a 30% increase in analytical productivity using Spark, and Databricks

Data Engineer Intern

Dog Paws
Seattle, WA
06.2020 - 04.2021
  • Recommend innovative methods for the acquisition and analysis of data to augment the quality and efficiency of the platform's data systems
  • Continuously assess and refine data processing workflows to meet evolving community engagement objectives
  • Recommended and implemented cutting-edge methods for data acquisition and analysis, significantly improving the platform's data system quality and efficiency
  • Utilized SQL for data querying, Python for data manipulation, and Azure Data Lake for data storage, optimizing data processing workflows
  • Worked alongside internal teams and management to define business needs, integrating JIRA for project management and Slack for team communication to facilitate collaboration

Front End Developer Intern

Seattle Children's Hospital
Seattle, WA
06.2020 - 09.2020
  • Developed a 3D web-based surgical guidance system used monthly in surgeries
  • Enhanced surgical outcomes and procedure efficiency
  • Designed and implemented an improved web UI, enhancing user engagement and navigation efficiency by 40%
  • Integrated real-time communication using Sockets.io, reducing surgeon response times in critical scenarios
  • Employed Three.js for 3D visualization, aiding surgical precision

Education

Bachelor's of Science - Informatics

University of Washington
Seattle, WA
06.2021

Skills

  • Programming Languages: Java, Python, C#, SQL, No-SQL, R, JavaScript, React-JS, Git
  • Data Engineering & Analytics: Snowflake, DBT, Apache Spark, Hive, S3, AWS Redshift, Databricks, Azure Synapse, Azure Data Explorer, PostgreSQL, Azure Data Factory, Lambda, Matillion, Big Data, NLP, LlamaIndex
  • Web Development: Web UI design, real-time communication with Socketsio, 3D visualization with Threejs
  • Tools & Platforms: Microsoft Power Platform, Tableau, Machine Learning, GitHub Copilot, Azure DevOps

Personal Projects

AI-Powered Search & Analysis System,

  • Developed an AI-powered search and document processing system using LlamaIndex, Azure Cognitive Services, OpenAI embeddings, and Azure Machine Learning to extract insights, classify documents, and enable semantic search.
  • Built an NLP pipeline using Azure Cognitive Services for Named Entity Recognition (NER) and key phrase extraction, improving text analysis accuracy.
  • Trained and deployed a document classification model in Azure ML, categorizing research papers and technical documents based on topics and keywords. Implemented a high-accuracy search engine using Azure OpenAI embeddings and FAISS, reducing search latency and improving document retrieval efficiency.
  • Automated data ingestion and processing workflows in Azure Databricks optimizing large-scale data handling and reducing manual intervention.

Airbnb Listing Price Prediction, 

  • Developed a predictive model for Airbnb listing prices in Seattle using Python libraries (Scikit-learn, Statsmodels), performing comprehensive feature engineering and regression analysis (simple, full, intermediate).
  • Created Ordinary Linear Regression (OLS) models, estimating unknown parameters to improve model accuracy and reliability.

Timeline

Software Engineer

T-Mobile
08.2024 - Current

Data Engineer

Slalom Build
01.2022 - 06.2024

Data Engineer Intern

Dog Paws
06.2020 - 04.2021

Front End Developer Intern

Seattle Children's Hospital
06.2020 - 09.2020

Bachelor's of Science - Informatics

University of Washington
Anish Dhawan