
Over 6 years of IT experience in analysis, design, development and implementation of large-scale applications using Big Data and Java/J2EE technologies such as Apache Spark, Hadoop, Hive , Sqoop, Oozie, Hbase, Zookeeper, Python & Scala Strong experience writing Spark Core, Spark SQL, Spark Streaming, Java MapReduce, Spark on Java Applications. Highly skilled in integrating Kafka with Spark Streaming applications to build long running real-time applications. Solid understanding of RDD operations in Apache Spark i.e., Transformations & Actions, Persistence (Caching), Accumulators, Broadcast Variables, Optimizing Broadcasts. In-depth knowledge of handling large amounts of data utilizing Spark Data Frames/Datasets API and Case Classes. Experienced in running query using Impala and used BI tools to run ad-hoc queries directly on Hadoop. In-depth knowledge of the Big Data Architecture along with-it various components of Hadoop 1.X and 2.X such as HDFS, Job Tracker, Task Tracker, Data Node, Name Node and YARN concepts such as Resource Manager, Node Manager. Hands on experience on AWS cloud services (EC2, S3, RDS, Glue, Redshift, Data Pipeline, EMR,, Workspaces, Lambda,, RDS). HiveQL scripts leading to good understanding in MapReduce design patterns, data analysis using Hive . Great knowledge of working with Apache Spark Streaming API on Big Data Distributions in an active cluster environment. Very capable at using AWS utilities such as EMR, S3 and CloudWatch to run and monitor Hadoop/Spark jobs on AWS. Proficient in importing and exporting data from Relational Database Systems to HDFS and vice versa, using Sqoop. Good understanding of column-family NoSQL databases like HBase, Cassandra and Mongo DB in enterprise use cases. Very capable in processing of large sets of structured, semi-structured and unstructured data and supporting system application architecture in Hadoop, Spark and SQL databases such as Teradata, MySQL, DB2. Experienced in version control and source code management tools like GIT, SVN, and Bitbucket. Experience in Java Application Development, Client/Server Applications using MVC, J2EE, JDBC, JSP, XML methodologies (XML, XSL, XSD), Web Services, Relational Databases and NoSQL Databases. Hands-on experience in application development using Java, RDBMS, and Linux shell scripting, Perl. Hands-on experience working with IDE tools such as Eclipse, IntelliJ, NetBeans, Visual Studio, GIT and Maven and experienced in writing cohesive E2E applications on Apache Zeppelin. . Experience working in Waterfall and Agile - SCRUM methodologies.
Citizenship: US Citizen