Data Scientist with a proven track record at SDSU Research Foundation, excelling in genomic ETL pipeline development and interactive dashboard creation. Expert in Python and Tableau, I enhance data-driven decision-making while fostering collaboration across teams. Achieved a 40% reduction in processing time, showcasing my commitment to impactful results.
Programming Languages: Python, SQL, R, Java, C, Rust
Databases and Frameworks: MySQL, PostgreSQL, MongoDB, Flask, Snowflake, NoSQL, ChromaDB, Pinecone, SQLite, Kubernetes, FastAPI, Langgraph, LlamaIndex, CrewAI, SAS, Kafka, AirFlow, MLflow ML/DL: KNN, RNN, CNN, Transformers, BERT, GPT-4,LLaMa
Libraries: TensorFlow, PyTorch, Keras, Matplotlib, Seaborn, PySpark, Spacy, Numpy, Pandas, SciPy, OpenAI, NLTK, MLlib, OpenCV, BeautifulSoup, Scikit-Learn, Streamlit, Git, Bitbucket, Transformers, NLP
Technologies and Tools: Selenium, AWS (EC2, S3, RDS, Lambda, SageMaker), Azure (Data Factory, Data Lake, Databricks, Blob Storage), GCP (GCS, Dataflow, BigQuery, Vertex AI), GitHub, Apache Spark, Excel, Agile,
Tableau, Sigma, Looker, Power Bi, Microsoft Fabric