Summary
Overview
Work History
Education
Skills
Accomplishments
Certification
Timeline
Generic

Rakesh Surendra

Kansas City

Summary

As a passionate Data Architect, AI Enthusiast, and IEEE Member, 14 years of expertise in software development have been dedicated to specializing in Data Engineering, Big Data, and Cloud Development. Holding a Master’s degree in Computer Science from the University of Texas at Dallas, skills have been honed in designing and implementing robust, real-time big data systems using cutting-edge technologies. The background includes creating and maintaining data pipelines, big data processing, data warehousing, and analytics. Known for a quality-driven approach and hardworking nature, combined with excellent communication and project management skills.

Overview

16
16
years of professional experience
1
1
Certification

Work History

Data Engineer

Vectorworks
11.2020 - Current
  • Responsible for expanding and optimizing data from Vectorworks products. Building data pipeline architecture, as well as optimizing data flow and collection for cross functional teams. Build infrastructure required for optimal extraction, transformation, and loading data from wide variety of data sources using Python, Spark, Airflow, RedShift, Elastic Search and AWS EMR.
  • Built analytics tools that utilize data pipeline to provide actionable insights into customer acquisition, operational efficiency and other key business performance metrics.
  • Cost Reduction & Efficiency Improvement: Successfully reduced the cost of EMR jobs by 30% and improved efficiency by 40% through spark-tuning PySpark scripts, demonstrating a strong ability to enhance operational performance. Analyzed complex data and identified anomalies, trends and risks to provide useful insights to improve internal controls.
  • Collaborated on ETL (Extract, Transform, Load) tasks, maintaining data integrity and verifying pipeline stability. Contributed to internal activities for overall process improvements, efficiencies and innovation.
  • Machine Learning & Prompt Engineering: Leveraged machine learning models and prompt engineering techniques to develop intelligent systems that enhance data processing and predictive analytics. Currently working on customer churn analysis to improve customer retention and satisfaction.

Big Data Engineer

InMobi
11.2018 - 11.2020
  • Monitored incoming data analytics requests and distributed results to support internal products and strategies.
  • Employed data cleansing methods, significantly Enhanced data quality. Compiled, cleaned and manipulated data for proper handling.
  • Design, development of Apache Spark jobs which runs on AWS EMR, S3 to ETL and aggregate anonymized advertising data and schedule automated Python-based daily workflows using Apache Airflow DAG’s. Worked on cross-functional team to move PySpark ETL workflows from on-premise Hadoop data management platform to AWS.
  • Data pipelining and ETL (data acquisition, cleaning, transformation) & Data visualization using Apache Airflow and InfluxDB along with Grafana.

Application Developer III

Sprint Corporation
06.2015 - 11.2018
  • Position involved Real-time ingestion of data into a relational No-SQL database for reporting, dash boarding and ad-hoc analysis. Additional development experience with Java, J2EE, Apache Storm/Trident, Apache Kafka and Cassandra in building Real-time Big Data components.
  • Position involved designing, development, maintenance of real-time analytics component to capture, transform, analyze and store terabytes of structured and unstructured data.
  • Collaborated with stakeholders regarding project capabilities and limitations to deliver optimal functionality.
  • Collaborated with multidisciplinary teams to design and implement new technology features.

Software Engineer

Oracle Cerner Corporation
06.2013 - 04.2015
  • Worked with software development and testing team members to design and develop robust solutions to meet client requirements for functionality, scalability and performance.
  • Worked on Java, Hadoop, HBase, Storm and Map Reduce building real time Big Data systems for processing Electronic Medical Records.
  • Maintained a secure and massive cloud architecture which is capable of storing and retrieving critical health care data in milliseconds.
  • Tested methodology with writing and execution of test plans, debugging and testing scripts and tools.
  • Introduced agile methodologies and development best practices to division to enhance product development.

Software Developer Intern

Information Processing Corporation
09.2012 - 02.2013
  • Responsible for Development & Testing for Omni Sports Management (OSM) mobile application - used JSON Ajax calls to send and receive data from OSM Mobile Service database.
  • Developed OSM mobile application on PhoneGap platform. Ported desktop application into mobile application using PhoneGap (iOS/ Android / Blackberry/J2ME/Windows).

Software Engineer

Remoba Technologies
06.2008 - 07.2011
  • Worked with software development and testing team members to design and develop robust mobile applications to meet client requirements for functionality, scalability and performance.
  • Reviewed project specifications and designed technology solutions that met or exceeded performance expectations.
  • Worked on RemoSync, YourNumbers Backup & Sidekick Sync mobile applications on Android, Blackberry, J2ME, Danger mobile platforms and delivered them successfully.

Education

Master of Science - Computer Science

The University of Texas At Dallas
Richardson, TX
05.2013

Bachelor of Science - Electrical, Electronics And Communications Engineering

Anna University
04.2006

Skills

    Apache Spark, Python, Apache Airflow, Elastic MapReduce, EC2, S3, Lambda, SQS, SNS, Elastic Search, MySQL, Java, Hadoop, AWS Redshift, Apache Kafka, Apache Storm/Trident, Cassandra, Android, Zookeeper, MapReduce, JBehave, Mockito, Jenkins, JUnit, Splunk, Jenkins, JIRA, Bugzilla, Crucible, PostgreSQL, Source Safe, GitHub, Assembla, Data Security, Performance Tuning, ETL development, Machine Learning, Data Warehousing, NoSQL Databases, Data Pipeline Design, Big Data Processing, Data Quality Assurance, Real-time Analytics, Business Intelligence, Teamwork and Collaboration

Accomplishments

    US Patent - Predictive intelligent processor balancing in streaming mobile communication device data processing.

    (https://patents.justia.com/patent/10313219)

    US Patent Office : Issued Jun 2019 ·

    ID Patent number: 10313219

Certification

  • AWS Certified Data Analytics - Specialty
  • Python for Data Science Professional Certification
  • ChatGPT - Prompt Engineering Certification
  • Amazon AWS Technical Essentials
  • SCMAD 1.0
  • SCJP 5.0

Timeline

Data Engineer

Vectorworks
11.2020 - Current

Big Data Engineer

InMobi
11.2018 - 11.2020

Application Developer III

Sprint Corporation
06.2015 - 11.2018

Software Engineer

Oracle Cerner Corporation
06.2013 - 04.2015

Software Developer Intern

Information Processing Corporation
09.2012 - 02.2013

Software Engineer

Remoba Technologies
06.2008 - 07.2011

Master of Science - Computer Science

The University of Texas At Dallas

Bachelor of Science - Electrical, Electronics And Communications Engineering

Anna University
Rakesh Surendra