NASA HQ ODEO – 08/2022 – PRESENT
- Work on a team of eight data scientists and data engineers to create an Enterprise Data Platform (EDP)
- Create CI/CD pipeline and aggregate data from API calls
- Develop data processing and analysis algorithms using python
- Utilize advanced querying, visualization and analytics tools to analyze and process complex data sets
- Develop and refine new analytic capabilities
- Develop and maintain web portals for data sharing and data visualization
- Provide in-depth reporting and recommendations based on analysis
AEROSPACE FINANCIAL DASHBOARD – 03/2022 – 08/2022
- Extract data and store it in a local machine.
- Use Python to transform the CSV files into Parquet files so the data is compressed, which will result in less data being processed and will be more cost effective.
- Use AWS CLI and AWS CDK via Visual Studios to create AWS S3 buckets and upload the data (both CSV and Parquet) to their respective buckets.
- Use AWS Toolkit from Visual Studios to confirm buckets were created and the files were uploaded.
- Use AWS Glue client to create a database, create a crawler, and crawl through the Parquet files to create tables.
- Use AWS Athena client to join tables and perform queries.
- Use AWS QuickSight and Tableau to visualize the results.