Pandas
Highly independent professional with strong interpersonal and analytical skills. Able to quickly learn new techs and adapt to the client's demand as needed. SFSU BS Computer Science graduate (Spring 2019). Experienced with Python, Pandas, Java, Android Studio, as well as practiced and gained knowledge on Hive, Hadoop, PySpark and other related techs.
- Designed and implemented an ETL pipeline for the
processing of medical, RX claims and other patient
information.
- Orchestrated a replication process via AWS Glue,
and Step functions facilitating seamless integration
between Postgres, MySQL, and Redshift tables.
- Created a data coherence monitoring tool in
Quicksight for alignment across Redshift, Postgres,
and MySQL tables.
- Assumed responsibility for the Redshift database,
serving as the main repository for processed data.
- Configured and managed AWS EC2 instances to
optimize infrastructure performance.
- Executed Terraform scripts to whitelist specific IP
addresses.
Python
Pandas
Tableau
AWS - (S3, EC2, RDS, Glue, Step)
Spring Boot
Docker
Sqoop, Map Reduce, Hive, Hadoop, Spark
Apple Cloud Infrastructure - (Simcloud)