
Around 6 years of experience in Software Development with strong focus on Big Data, Hadoop and Spark. Strong expertise in Big Data ecosystem like Spark, Hive, Sqoop, HDFS, Map Reduce, Kafka, Yarn. Developed production ready Spark applications using Data frames, Datasets, Spark SQL and Spark Streaming. Solid experience in using various file formats like CSV, XML, Parquet, ORC, JSON. Strong knowledge of NoSQL databases and worked with HBase, Cassandra and Mongo DB. Experience in using cloud services like Amazon EMR, S3, EC2, Red shift, Athena and Azure Databricks, Azure Data Factory. Worked on Spark Streaming and Structured Spark streaming including Kafka for real time data processing. Good knowledge in Oracle PL/SQL and shell scripting. Worked extensively in Agile methodology to complete projects continuously and collaboratively. Having strong analytical and problem-solving skills and can resolve complex technical issues. Seasoned Senior Data Engineer with background in developing, testing, and maintaining data architectures. Possess strong skills in database management systems, Big Data processing frameworks, data modeling and warehousing. Have successfully led teams in creating innovative data solutions to improve system efficiency and business decision-making processes. Demonstrated impact through enhanced data availability and accuracy in previous roles.