Highly motivated engineer and experienced in full-stack software development. Specializes in designing, implementing, and optimizing data-driven solutions and software applications to enhance operational efficiency and data accuracy in diverse industry settings.
Expertise in Databricks, Snowflake, DBT, Spark, and web development, with a proven track record of increasing processing efficiency and enhancing data quality and system usability.
Overview
5
5
years of professional experience
Work History
Software Engineer
T-Mobile
Bellevue, WA
08.2024 - Current
Transformed data workflows from on-premises systems to Azure Kubernetes Service (AKS) and Snowflake, ensuring scalability and reduced maintenance overhead
Designed and implemented data pipelines to integrate legacy systems with modern cloud platforms, utilizing Splunk, REST APIs, OSS nodes, and Snowflake
Migrated critical data processing operations from file-based dependencies to streamlined, cloud-native solutions using Python and Pandas
Established robust pipelines for loading data into Snowflake Bronze, Silver, and Gold tables, enabling business teams to access actionable insights
Optimized legacy processes for detecting and resolving bouncing issues through automated workflows with Splunk and edge nodes
Reduced full data ingestion times by over 25%, ensuring faster reporting and decision-making
Built reusable modules, including a Snowflake-Python Connector for seamless data integration and scalable Iceberg tables
Developed comprehensive documentation and provided support to cross-functional teams for transitioning to cloud systems
Delivered solutions that aligned with T-Mobile's strategic goal of leveraging cloud technologies for enhanced reliability and performance
Data Engineer
Slalom Build
Seattle, WA
01.2022 - 06.2024
Designed and implemented a chunking strategy to optimize data processing and retrieval for training a large language model (LLM)
Selected and configured a vector database to efficiently store and manage vectorized wine data, enhancing the model's performance
Utilized LlamaIndex to improve data indexing and querying capabilities, facilitating accurate and quick information retrieval for the virtual sommelier
Successfully created and implemented a generative AI solution, resulting in an interactive and knowledgeable virtual sommelier for wine recommendations
Spearheaded the migration of Workday's critical workflows from Alteryx to Redshift SQL, significantly boosting process speed and reducing overall workflow duration
Identified and resolved critical bugs and inefficiencies in the existing workflow, enhancing system reliability and performance
Led the unit testing efforts to ensure the migrated workflows' integrity and accuracy, establishing a solid foundation for operational excellence
Authored detailed documentation of the migration process, encapsulating each step and decision for Workday, which served as a blueprint for future migrations and facilitated knowledge sharing across teams
Led the design and deployment of a Snowflake-based data platform
Achieved a 30% boost in data processing efficiency
Orchestrated data ingestion from over five distinct sources using Matillion
Enhanced data quality and accuracy significantly
Utilized DBT for advanced data transformations, improving data retrieval speeds by 25%
Ensured HIPAA compliance, reinforcing data security and privacy standards
Constructed a data platform that enhanced data enrichment and harmonization efficiency
Implemented Apache Spark and Hive pipelines for large dataset management
Accelerated data processing speeds by over 35%
Engineered a data warehousing solution that consolidated multiple data sources into a streamlined pipeline, improving data retrieval times by 50% and supporting a 30% increase in analytical productivity using Spark, and Databricks
Data Engineer Intern
Dog Paws
Seattle, WA
06.2020 - 04.2021
Recommend innovative methods for the acquisition and analysis of data to augment the quality and efficiency of the platform's data systems
Continuously assess and refine data processing workflows to meet evolving community engagement objectives
Recommended and implemented cutting-edge methods for data acquisition and analysis, significantly improving the platform's data system quality and efficiency
Utilized SQL for data querying, Python for data manipulation, and Azure Data Lake for data storage, optimizing data processing workflows
Worked alongside internal teams and management to define business needs, integrating JIRA for project management and Slack for team communication to facilitate collaboration
Front End Developer Intern
Seattle Children's Hospital
Seattle, WA
06.2020 - 09.2020
Developed a 3D web-based surgical guidance system used monthly in surgeries
Enhanced surgical outcomes and procedure efficiency
Designed and implemented an improved web UI, enhancing user engagement and navigation efficiency by 40%
Integrated real-time communication using Sockets.io, reducing surgeon response times in critical scenarios
Employed Three.js for 3D visualization, aiding surgical precision
Data Engineering & Analytics: Snowflake, DBT, Apache Spark, Hive, S3, AWS Redshift, Databricks, Azure Synapse, Azure Data Explorer, PostgreSQL, Azure Data Factory, Lambda, Matillion, Big Data, NLP, LlamaIndex
Web Development: Web UI design, real-time communication with Socketsio, 3D visualization with Threejs
Tools & Platforms: Microsoft Power Platform, Tableau, Machine Learning, GitHub Copilot, Azure DevOps
Personal Projects
AI-Powered Search & Analysis System,
Developed an AI-powered search and document processing system using LlamaIndex, Azure Cognitive Services, OpenAI embeddings, and Azure Machine Learning to extract insights, classify documents, and enable semantic search.
Built an NLP pipeline using Azure Cognitive Services for Named Entity Recognition (NER) and key phrase extraction, improving text analysis accuracy.
Trained and deployed a document classification model in Azure ML, categorizing research papers and technical documents based on topics and keywords. Implemented a high-accuracy search engine using Azure OpenAI embeddings and FAISS, reducing search latency and improving document retrieval efficiency.
Automated data ingestion and processing workflows in Azure Databricks optimizing large-scale data handling and reducing manual intervention.
Airbnb Listing Price Prediction,
Developed a predictive model for Airbnb listing prices in Seattle using Python libraries (Scikit-learn, Statsmodels), performing comprehensive feature engineering and regression analysis (simple, full, intermediate).
Created Ordinary Linear Regression (OLS) models, estimating unknown parameters to improve model accuracy and reliability.