
Python, SQL, PySpark, Bash
Machine LearningRegression, Classification, XGBoost, LightGBM, Random Forest, Clustering, PCA, Time Series Modeling
Deep LearningTensorFlow, PyTorch, CNNs, RNNs, LSTM, Autoencoders, Transformer Architectures
Large Language Models (LLMs) & Generative AIBERT, RoBERTa, GPT-35/4, Mistral, LLaMA2, Hugging Face, LoRA, PEFT, Prompt Engineering, RAG Pipelines
Natural Language ProcessingTokenization, NER, Summarization, Topic Modeling, Sentiment Analysis, Conversational AI
Vector Databases & LLM ToolingFAISS, Pinecone, PGVector, LangChain, LlamaIndex, OpenAI API
Cloud PlatformsAWS (S3, EC2, SageMaker),
Azure (Databricks, ML Studio, Data Factory),
GCP (Vertex AI, BigQuery)
MLflow, Airflow, Docker, Kubernetes, GitHub Actions, GitLab CI/CD
Big Data & Data EngineeringSpark, Databricks, Hadoop, Hive, Kafka, Snowflake, ETL/ELT Pipelines
DatabasesPostgreSQL, MySQL, SQL Server, MongoDB, Vector DBs (Pinecone, PGVector)
Visualization & Application DevelopmentTableau, Power BI, Matplotlib, Seaborn, Streamlit, Flask