Dynamic Data Scientist and AI/ML Engineer with expertise in machine learning, statistical analysis, and data modeling. Proficient in Python, R, SQL, and cloud technologies like AWS, and Azure. Proven track record of designing and deploying advanced AI models that improve operational efficiency by 25%, and reduce processing time by 30%. Skilled in leading teams to integrate AI solutions into scalable architectures, driving business growth, and optimizing decision-making through actionable insights.
Programming languages: Python, SQL, Scala, Presto SQL, C,
MATLAB, NoSQL, Java
Statistical Methods: Hypothetical Testing, ANOVA, Time Series, and Statistics
Big Data and Cloud: Apache Kafka, Alteryx, Databricks, Hadoop, Spark, Flink, Hive, MapReduce, Apache Spark, Pig, BigQuery, AWS S3, Azure, Azure Data Explorer, and WordPress
Packages: NumPy, Pandas, Matplotlib, SciPy, Scikit-learn,
Seaborn, TensorFlow, PySpark, SQLAlchemy, OpenPyXL
Data Processing Tools: Apache Airflow, Apache NiFi, Talend, Bash, Informatica
Data Warehousing: Snowflake, Google BigQuery, Apache Hive, Azure Cloud Services, Azure Databricks, AWS (EC2, RDS, S3,
Lambda, Redshift, Athena, Glue)
Containerization: Docker, Kubernetes, Jenkins, Git, SVN
Data Visualization: Tableau, Power BI, SSRS, SSIS, MATplotib
Databases: MySQL, MongoDB, PostgreSQL, Firebase, and Neo4J
Data Modeling: ER Diagrams, Dimensional Modeling, Star Schema, Data Vault
Machine Learning: Regression analysis, Bayesian Method, Decision Tree, Random Forests, Support Vector Machine, Neural Network, K-Means, KNN, SVM, Naive Bayes, NLP, CNN, Deep Learning, Predictive Models, MLOps, and Linux Development
Methodologies: SDC, Agile/Scrum, API Integration, Waterfall, Ignition Troubleshooting
Other Skills: Django, MS Office, Atlassian Jira, Confluence,
PeopleSoft, CI/CD, ALM, Postman, Data Cleaning, Data Wrangling, Critical Thinking, CI/CD Pipelines, Communication Skills, Presentation Skills, Problem-Solving, Salesforce,
A/B Testing Analysis, ETL, Automated test scripts, UI/UX, React