Summary
Overview
Work History
Education
Skills
Timeline
Generic

Jacob Collins

Reston,VA

Summary

Software Developer with professional experience as a Data Scientist for 4.5 years. Setup data ingestion pipelines for use in text analysis, LLMs, and general tabulated data storage. Refactored old code bases with more modern libraries and techniques to achieve better data quality and ingest speed. Maintained, and added onto as needed, five projects simultaneously to insure the consistent flow of new, clean data. Ability to learn new languages and systems for projects as required.

Overview

4
4
years of professional experience

Work History

Data Scientist

Science Applications International Corporation
03.2019 - 08.2023
  • Extensive usage of Python's Pandas library to organize messy data into tabulated information for SQL tables and to help visualize data for methodologists and statisticians
  • Created custom XML parsers to clean and format Microsoft Word and PDF text documents using Python and REGEX
  • Developed data ingest pipelines from start to finish; API calls, to data formatting and cleaning, to data storage and documentation
  • Setup SQL database schemas and tables with necessary key constraints for effective data storage
  • Extracted and formatted data from databases to help drive improvement of product development and business strategies and processes
  • Refactored old ingest pipelines and code bases with newer libraries, multi-processing, and enhanced error handling to significantly improve data quality and ingest speed by over 100%

Education

Bachelor of Science - Computer Science

Christopher Newport University
Newport News, VA
05.2018

Skills

  • Python
  • SQL
  • Java
  • C
  • REGEX
  • Jupyter Notebook
  • GitHub
  • Text Mining

Timeline

Data Scientist

Science Applications International Corporation
03.2019 - 08.2023

Bachelor of Science - Computer Science

Christopher Newport University
Jacob Collins