Junior data engineer/scientist looking for a role in data engineering/data science/ML OPS.
Overview
2
2
years of professional experience
1
1
Certification
Work History
Data Scientist
Kharon
05.2022 - 03.2023
Built a feature on a Zyte SmartProxy Selenium based web scraping tool in Airflow to increase data extraction by 30% using SQL to update a relational database and CQL to prevent duplication.
Wrote a ETL Python script to acquire data from an SFTP server.
Created daily updating dashboards in Retool using Javascript that made API requests to a lambda function made with an OpenFaas template created in our Kubernetes cluster.
Created a proof of concept question answering system using OpenAI's completions, fine-tuning and embeddings API that also provided citations and answered with 80% accuracy.
Debugged and fixed issues in data pipelines and incorporated feature requests from the data operations team for data transformations.
Built data ingestion systems to extract data, index them for OpenSearch/ElasticSearch, load them in AWS S3 and make them queryable in AWS Athena, Kibana and Databricks
Data Analytics Teaching Assistant
Correlation One
03.2022 - Current
Lead weekly Python classes for beginners for the whole cohort.
Assisted with Excel, SQL and Tableau instruction for 40 fellows.
Held office hours and participated in curriculum design weekly.
Risk Analyst (Intern)
United Auto Credit
06.2021 - 12.2021
Built a system that extracts dealer websites using Google Maps API. 31% of data extraction automated which was used for the decision engine.
Analyzing KPIs and used Excel and SQL to query and wrangle data.
Created visualizations in Domo.
Education
Master of Science - Applied Statistics
California State University, Long Beach
Long Beach, CA
12.2021
Bachelor of Science - Computational And Systems Biology, Minor in Math