Over 8 years of professional IT experience and expert in Requirements Gathering, designing, development, implementation and testing of Multi-tired, Distributed Applications and Web Based Applications using Big data and JAVA/J2EE technologies. Strong experience in various phases of Software Development Life Cycle (SDLC) as requirement gathering, modeling, analysis, architecture design, development, testing and implementation. Strong experience on designing Big data pipelines such as Data Ingestion, Data Processing (Transformations, enrichment and aggregations) and Reporting. Strong experience in developing jobs using Apache Spark. Extensive knowledge in programming with DataFrames and Resilient Distributed Datasets (RDDs). Strong experience in submitting Spark applications in different clusters such as Spark Standalone and Hadoop Yarn. Strong knowledge in various Python libraries like Numpy and Pandas. Strong knowledge in various Machine Learning algorithms like KNN, Naive Bayes, Logistic Regression and Linear Regression, SVM, Decision trees, Random Forest and Gradient Boosted Decision Trees. Experienced applying machine learning and deep learning techniques to build models and analyze large scale data. Profound experience in implementing real time data streaming solutions using Spark Streaming, Kafka. Good knowledge on various Amazon Web Services (AWS) such as S3, EC2,Redshift, ECS, EMR, VPC, RDS,SQS, ELB . Experience in tuning and improving the performance of spark jobs by exploring various options. Strong Experience in migrating data using Sqoop from HDFS to Relational Database Systems and vice-versa. Strong experience in developing the workflows using Apache Oozie framework to automate tasks. Experience in working with Map Reduce programs using Apache Hadoop to analyze large data sets efficiently. Strong experience in working with Core Hadoop components like HDFS, Yarn and Map Reduce. Strong experience in Cloudera Hadoop distribution with Cloudera manager. Experience in launching spark applications by using the Kerberos authentication. Strong knowledge in developing Spark applications using Scala. Good understanding and knowledge of NoSQL databases like HBase and Cassandra. Good experience in performing and supporting Unit testing, Integration testing, QAT and UAT and production support for issues raised by application users. Experience in using Design Patterns: Singleton Pattern, DAO and MVC Pattern. Experienced in generating logging by Log4j to identify the errors in production test environment. Efficient in developing java applications in various Integrated Development Environment (IDE) tools like Eclipse, My Eclipse and RAD. Experience in deploying applications on IBM WebSphere and Apache Tomcat. Having good experience on using version control tools like GIT, SVN and Clearcase. Hands on experience in setting up repositories in SBT, Maven and Ant. Outstanding skills in design aspects and technical documentation along with strong interpersonal, analytical, and Organizational skills. Experience in developing application on different platforms like Windows, UNIX and LINUX.