Over 7+ years of work experience in IT, which includes experience in Data Engineering and Implementation of Hadoop, Spark and cloud Data warehousing solutions. Experience in developing SPARK applications using Spark tools like RDD transformations, Spark core, Spark streaming and Spark SQL. Experienced in writing Spark Applications in Scala and Python (PySpark). Experience in creating Spark Contexts, Spark SQL Contexts, and Spark Streaming Context to process huge sets of data. Experience in writing distributed Scala code for efficient big data processing. Experience in Managing scalable Hadoop clusters including Cluster designing, provisioning, custom configurations, monitoring and maintaining using Hadoop distributions: Cloudera CDH, Horton Works HDP. Hands on experience on architecting the ETL transformation layers and writing spark jobs to do the processing. Experience structural modifications using Map-Reduce, Hive and analyze data using visualization/reporting tools (Tableau). Experience writing scripts using Python and familiarity with the following tools: AWS Cloud Lambda, AWS S3, AWS EC2, AWS Redshift, AWS Postgres. Hands on experience deploying KAFKA connect in standalone and distributed mode creating docker containers using DOCKER. Experience in Data Analysis, Data Validation, Data Cleansing, Data Verification and identifying data mismatch. Hands on experience on Star Schema Modeling, Snow-Flake Modeling, FACT and Dimensions Tables, Physical and Logical Data Modeling using Erwin. Experience in Amazon Web Services (AWS) concepts like EC2, S3, EMR, ElasticCache, DynamoDB, Redshift, Aurora. Experience in data analysis using Hive, Pig Latin, and Impala. Experience working on Dockers Hub, creating Dockers images and handling multiple images primarily for middleware installations and domain configuration. Strong experience in CI (Continuous Integration)/ CD (Continuous Delivery) software development pipeline stages like Commit, Build, Automated Tests, and Deploy using Jenkins. Hands on experience in SQL and NOSQL database such as Snowflake, HBase, Cassandra and MongoDB. Experience working in both Waterfall and Agile methodologies. A self-motivated exuberant learner and adequate with challenging projects and work in ambiguity to solve complex problems independently or in the collaborative team. Strong skills in analytical, presentation, communication, problem solving with the ability to work independently as well as in a team and had the ability to follow the best practices and principles defined for the team.