Data Engineer with over 5 years of experience in developing and implementing advanced large language models, NLP, and machine learning solutions. Proficient in Python, PySpark, and SQL, with extensive experience in AWS and Azure for scalable infrastructure. Proven expertise in data warehousing, ETL processes, and big data technologies like Apache Hadoop and Spark. Skilled in using Airflow for scheduling and workflow management, and experienced in analytics and monitoring to drive data-driven decision-making and business transformation
• Programming Languages : Python, Java, R, SQL, PLSQL,NoSQL
• Databases : MySQL, MongoDB, Oracle, PostgreSQL, Snowflake, Oracle Exadata
• Cloud Technology : Amazon Web Services(AWS), Azure AI
• Big Data : Apache Hadoop, HDFS, MapReduce, Hive, HBase, Spark(PySpark)
• ML Frameworks : Flask, NumPy, Pandas, Scikit-learn, TensorFlow, Keras, Matplotlib, PyTorch, Seaborn
• Web Development : HTML5, CSS3
• Scheduling &CI/CD Tools : Airflow, GitLab, Jenkins, Kubernetes, Jira, Ansible
• Data Warehouse : Prism, data mapping, Informatica ETL
• Large Language Models : HuggingFace, OpenAI, Llama
• Other Tools : Power BI, Tableau, Excel, AutoSys
AWS Certified Solutions Architect- Associate, Azure AI Engineer Associate, HackerRank Python Certification, IBM Python Certification