Cloudera Hadoop


Dynamic Data Engineering and Analytics Solution Architect with over 18 years of experience in the IT industry, specializing in delivering innovative solutions across on-premises and cloud platforms. Proven ability to manage multiple data engineering projects simultaneously while consistently achieving high-quality results. Expertise includes a wide range of technologies such as Hadoop, ETL tools, and cloud services like Azure and AWS, along with strong skills in programming languages including Python, Java, and Scala. Recognized for leadership in agile project management and a collaborative approach to problem-solving, driving success in complex environments within the life sciences and healthcare sectors.
Held dual positions of solution architect and technical program manager for analytics applications. Managed timely delivery of high-quality applications across various projects. Simultaneously directed technical solutioning and people management initiatives. Engaged with Johnson & Johnson in the life sciences industry to fulfill requirements. Expertise in developing applications using Cloudera Hadoop tools for robust data solutions. Utilized Azure and Databricks for cloud-based application development to improve performance. Incorporated generative AI elements to enhance user experience within applications.
· Architected and managed multiple projects simultaneously in Life Science Domain.
· Built supply chain and clinical trial data engineering and analytics applications in Medical Device and Pharma sectors respectively.
· Solutioning, designing and building big data analytics applications build in Cloudera Hadoop and Azure Cloud
· Expertise in building big data Hadoop analytics applications using HDFS, Sqoop, Hive, Impala, Map Reduce, Spark, HBase, Kudu, Pig, Oozie and NiFi.
· Experience in building Azure applications using Data factory, Data Lake Store, Blob Storage, Databricks, HD Insights and Synapse
· Building streaming applications using Kafka and Spark Streaming
· Building data warehouse applications using Teradata
· Used ETL tools such Informatica and Talend
· Build pilot and POC application using GenAi using OpenAI LLM and secured Ai agteway integration
· Application orchestration and scheduling using Control-M, Tidal, Oozie, NiFi, ADF
· Reports are build using Tableau.
· Programmed using Python, Java, Scala, and Shell scripts.
· Implemented CICD using JIRA, BitBucket, Jenkins and JFrog Artifactory
· Performed POC on Snowflake, Denodo, Cloudera Search, Solr, Tesseract, NLP and CICD
· Experience in building front ends using python flask and Streamlit.
· Trained in GCP and AWS
· Expertise in building applications in Agile
· Prepare and present the effort estimations, design proposals and value additions.
· Coordinate with business and technical stake holders directly at Onsite.
· Ensure the end-to-end delivery including the quality of the delivery.
Managed and delivered successfully multiple teams and programs simultaneously.
Cloudera Hadoop
HDFS
Hive
Impala
Spark
Sqoop
Kudu
Oozie
Python
Scala
Java
MS SQL
Oracle SQL
Tidal
Contrl-M
NiFi
Azure ADLS
Azure Data Factory
Databricks
Azure Logic Apps
Azure Synapse
Snowflake
Teradata
Informatica
Tableau
Open AI LLM
C# NET with WCF
C with MFC