Summary

Overview

Work History

Education

Skills

Timeline

Bejoy Thankachan

Bethel,CT

Summary

Over 15+ years of overall experience as an IT developer including 5+ years as a Big Data/Hadoop Developer. Good knowledge of Hadoop Distributed File System and Ecosystem components like SPARK, MapReduce, HIVE, PIG, HBase, Sqoop, Oozie, Storm, Zookeeper and Flume. Detailed understanding of Hadoop internal architecture and functionality of various components such as Job Tracker, Task Tracker, Name Node & Data Node, Application Master, Resource Manager, Node Manager & MapReduce programming paradigm. Experience in Apache Spark, Spark Streaming, Spark SQL and No SQL databases like Cassandra and HBase. Used CQL to retrieve the data from Cassandra DB. Experience in Hive query language for data analytics and loading data into Hive partitions and bucketing. Experience in Cloudera and Horton Works distribution also Cloudera manager to manage and monitor Hadoop cluster. Used Spark streaming to divide streaming data into batches as an input to Spark engine for batch processing. Implemented Spark Scripts using Scala, Spark SQL to access Hive tables into Spark for faster processing of data. Experienced in performance tuning of Spark Applications for setting right Batch Interval time correct level of Parallelism and memory tuning. Developed a Pig Latin scripts for transformations and using Hive Query Language for data analytics. Experienced in importing and exporting data from different databases like MySQL, Oracle, Teradata into HDFS and vice-versa using Sqoop. Developing various cross platform products while working with different Hadoop file formats like Sequence File, RC File, ORC, AVRO & Parquet. Have experience in Shell Scripting like Scala/Python scripting languages and used it extensively with Spark for data processing. Hands on experience with batch processing of data sources using Apache Spark. Implemented Spark RDD transformations actions to implement business analysis. Used Flume to collect aggregate and store the web log data onto HDFS. Used Zookeeper for various types of centralized configurations. Experienced in loading the huge data from local file system and HDFS to Hive and writing complex queries to load data into internal tables. Experience in processing of load and transform the large data sets of structured, unstructured and semi structured data. Utilizing Spark streaming to receive real time data from the Kafka and store the stream data to HDFS using Python/PySpark also Scala and databases such as HBase Imported and extracted the needed data using Sqoop from the server into HDFS and Bulk Loaded the cleaned data into HBase using MapReduce. Promote full cycle approach including request analysis, creating/pulling dataset, report creation and implementation and providing final analysis to the requestor. Very Good understanding of SQL, ETL and Data Warehousing Technologies. Designing and creating ETL jobs through Talend to load huge volumes of data in Hadoop Ecosystem and relational databases. Developed a numerous application using Java, J2EE, JSP, SPRING, Hibernate, XML, HTML, PL/SQL, JavaScript and jQuery. Experience database development skills using SQL/PLSQL for various relational Databases like Oracle, Sybase, Postgress SQL, SQL server and NOSQL databases like MongoDB. Developed a website using RESTful APIs to fetch data from the web server. Java developer with extensive experience on various Java Libraries, API's, front end, back end and frameworks. Worked on Log4J package for logging purposes and CVS, Sub Version for the version control. Strong ability to understand new concepts and applications. Excellent Verbal and Written Communication Skills have proven to be highly effective in interfacing across business and technical groups. Results-driven Lead Technologist with extensive experience in AWS, Big Data Analytics, and ETL Development. Proven ability to lead complex projects and implement innovative big data solutions.

Overview

years of professional experience

Work History

LEAD TECHNOLOGIST

Booz Allen Hamilton

Herndon, VA

08.2019 - Current

AWS BIG DATA MIGRATION LEAD

Freddie Mac

McLean, VA

04.2019 - 08.2019

SR. BIG DATA DEVELOPER

Boeing

Seattle, WA

09.2018 - 04.2019

SR. BIG DATA DEVELOPER

Transamerica

06.2016 - 08.2018

SR. BIG DATA DEVELOPER

Voya Financial

Manhattan, NY

02.2013 - 05.2016

SR. BIG DATA/JAVA DEVELOPER

Northrop Grumman

San Diego, CA

02.2012 - 12.2013

SR. JAVA DEVELOPER

Vanguard Inc

Malvern, PA

09.2010 - 02.2012

SR. JAVA DEVELOPER

Dept. of Transportation

Manhattan, NY

08.2007 - 09.2010

JAVA DEVELOPER

Pfizer Pharmaceuticals

Parsippany, NJ

12.2006 - 06.2007

Education

Master’s - Computer Science

Mahathma Gandhi University

01-2005

Bachelor of Science - Physics

SB College Changanacherry

01-2001

Skills

MapReduce
SPARK
HBase
Hive
Cassandra
Sqoop
Impala
Elasticsearch
Databricks
AWS
EMR
Cloud Computing
Zookeeper
MapR
Data modeling
ETL development
Big data analytics
Java
Scala
Python
JDBC
Hibernate
RESTful Services
Servlets
JSP
Spring Framework
Struts
Web Services
AJAX
TypeScript

Modern JavaScript
UI Frameworks
AngularJS
Responsive Design
Linux
Ubuntu
Apache Tomcat
Containerization
Serverless Architecture
Integrated Development Environment
Test Automation
Continuous Integration
Dependency Management
Agile Methodologies
SDLC
Agile Modeling
Architectural Patterns
SOA
Data Analysis
MySQL
Oracle
MongoDB
HBASE
Problem solving
SOA
Data modeling
ETL development
Big data analytics
Problem solving

Timeline

LEAD TECHNOLOGIST

Booz Allen Hamilton

08.2019 - Current

AWS BIG DATA MIGRATION LEAD

Freddie Mac

04.2019 - 08.2019

SR. BIG DATA DEVELOPER

Boeing

09.2018 - 04.2019

SR. BIG DATA DEVELOPER

Transamerica

06.2016 - 08.2018

SR. BIG DATA DEVELOPER

Voya Financial

02.2013 - 05.2016

SR. BIG DATA/JAVA DEVELOPER

Northrop Grumman

02.2012 - 12.2013

SR. JAVA DEVELOPER

Vanguard Inc

09.2010 - 02.2012

SR. JAVA DEVELOPER

Dept. of Transportation

08.2007 - 09.2010

JAVA DEVELOPER

Pfizer Pharmaceuticals

12.2006 - 06.2007

Master’s - Computer Science

Mahathma Gandhi University

Bachelor of Science - Physics

SB College Changanacherry

Similar Profiles

Naresh UreNaresh Ure
Senior AI/ML Engineer | Lead Data Scientist at TOYOTASenior AI/ML Engineer | Lead Data Scientist at TOYOTA
Sudershan VaidyaSudershan Vaidya
Senior Technical Lead at HCL Technology LtdSenior Technical Lead at HCL Technology Ltd
BHANU PRAKASH REDDY RELLABHANU PRAKASH REDDY RELLA
Lead Data Engineer at Walmart Associates IncLead Data Engineer at Walmart Associates Inc
Aparna AGAparna AG
Tech Lead, Sr. Developer, Co-ordinator at AnthemTech Lead, Sr. Developer, Co-ordinator at Anthem

CREATE PROFILE

Summary

Overview

Work History

LEAD TECHNOLOGIST

AWS BIG DATA MIGRATION LEAD

SR. BIG DATA DEVELOPER

SR. BIG DATA DEVELOPER

SR. BIG DATA DEVELOPER

SR. BIG DATA/JAVA DEVELOPER

SR. JAVA DEVELOPER

SR. JAVA DEVELOPER

JAVA DEVELOPER

Education

Master’s - Computer Science

Bachelor of Science - Physics

Skills

Timeline

LEAD TECHNOLOGIST

AWS BIG DATA MIGRATION LEAD

SR. BIG DATA DEVELOPER

SR. BIG DATA DEVELOPER

SR. BIG DATA DEVELOPER

SR. BIG DATA/JAVA DEVELOPER

SR. JAVA DEVELOPER

SR. JAVA DEVELOPER

JAVA DEVELOPER

Master’s - Computer Science

Bachelor of Science - Physics

Similar Profiles

Naresh UreNaresh Ure

Sudershan VaidyaSudershan Vaidya

BHANU PRAKASH REDDY RELLABHANU PRAKASH REDDY RELLA

Aparna AGAparna AG