Summary
Overview
Work History
Education
Skills
Certification
Timeline
Generic
Simi Joseph

Simi Joseph

Senior Data Engineer
San Antonio,TX

Summary

Senior Data Engineer with over 10 years of experience in data engineering and software development. Expert in optimizing data pipelines, reducing costs, and enhancing data quality. Achievements include reducing daily data acquisition costs by 90%, achieving a 75% cost reduction in Glue job environment, and architecting solutions that led to a significant growth in online sales. Skilled in AWS, Azure, ETL tools, data modeling, and distributed systems. Seeking a Data Engineer position to leverage expertise in building efficient data processing systems and driving analytical insights to support the company's mission of revolutionizing entertainment through data-driven decision-making.

Overview

20
20
years of professional experience
1
1
Certification

Work History

Senior Data Engineer

Amazon
05.2022 - Current
  • Generated over $228K in annual cost savings through comprehensive ETL optimization, including Glue job rewrite, data model improvements, and S3 storage optimization ($76K)
  • Reduced data acquisition costs by 90% for a 5 TB table, generating $912.5K in annual savings (from $1.095M to $182.5K) by redesigning the data pipeline to leverage cost-effective DynamoDB table exports to S3 and optimizing job scheduling
  • Achieved a 75% cost reduction in the pre-production Glue job environment and a 24% decrease in production costs by implementing cost tagging for individual jobs, identifying major cost contributors, and applying targeted optimizations
  • Streamlined data engineering by creating a unified Glue Job Framework, resulting in increased code reusability by 90% and reducing maintenance efforts
  • Optimized SQL queries, resulting in a 92.8% faster report generation (e.g.: from 3 hours 25 minutes to 7 minutes 18 seconds). This improvement enabled faster access to critical business insights

Lead Data Engineer

H-E-B San Antonio
09.2018 - 05.2022
  • Architected solutions across various business verticals during the pandemic, achieving a significant growth in online sales from $1M in 2018 to over $2B by 2021.
  • Led development projects using Agile methodology, maintained productivity with a reduced team from 8 to 3 members, ensuring projects met scope and timeline.
  • Utilized Informatica and TIBCO to engineer programs, enhancing architecture decisions and reducing project timelines from 6 to 2 months.
  • Facilitated successful on-prem to Cloud data archival by designing an efficient Azure data pipeline.
  • Optimized code implementation, drastically reducing processing times from ~20 minutes to ~3 seconds.
  • Coordinated successful deployment of new integrations, enabling dashboards that differentiated in-store from e-Commerce sales, increasing sales by 16.5% annually.
  • Provided biweekly 24/7 on-call technical support, troubleshooting system and data issues, improving usability and customer service.

Senior Data Engineer

H-E-B San Antonio
09.2014 - 09.2018
  • On boarded and mentored new members of the team (across US) providing day to day direction and regular status updates to the leadership
  • Drive overall execution of multiple initiatives and coordinate development of deliverables and execute against established timelines
  • Engineered and maintained scalable data pipelines and built out new integrations to support continuing increases in data volume and complexity
  • Led a cross-functional collaboration (analytics, business, engineering) to refine physical data models for BI tools. This increased data accessibility for at least 60% of our customer base, eliminating the need for individual data pipelines. This empowered data-driven decision-making across the organization
  • Delivered automated processes to monitor data quality, ensuring production data is always accurate and available for key stakeholders and business processes that depend on it
  • Establish testing protocols to ensure the accuracy of the data produced for the business and provided up to date well-defined documentation
  • Led incident analysis efforts, resulting in a 25% reduction in recurring issues through long-term resolution strategies

Senior Software engineer

UST Global
01.2006 - 09.2014
  • Designed, developed, implemented and supported Enterprise Application Integrations using TIBCO EAI suite of products
  • Developed business processes by configuring shared resources, creating process definitions, creating activities and configuring message transports using TIBCO Business Works
  • Worked on implementing Generic Error Handling framework for all TIBCO Business Processes
  • Managed and led a redesign of a web application project and successfully launched against the established tight deadlines that saw a 45% increase in user interaction
  • Led 24 x 7 BI production support team that managed over 300 workflows
  • Dynamic team member who has played multiple roles of Production Support lead, Technical Project Lead, Mentor, Developer and Tester, shifting between various roles with ease

Education

Bachelor of Technology - Computer Science and engineering

College of engineering
Chengannur, India
05.2005

Skills

Python programming

Data pipeline design

ETL development

Advanced SQL

Data warehousing

Performance tuning

Git version control

Data modeling

NoSQL databases

Big data processing

API development

Data quality assurance

Real-time analytics

Data integration

Linux administration

Continuous integration

SQL programming

Business intelligence

SQL and databases

Data analysis

Data migration

Relational databases

Database administration

Advanced analytics

Backup and recovery

Big data technologies

Data visualization

Problem-solving abilities

Teamwork and collaboration

Effective communication

Analytical thinking

Team building

Team collaboration

Data aggregation processes

Excellent communication

Data governance

Problem-solving aptitude

Amazon redshift

Data operations

Multitasking

Problem-solving

Statistical analysis

Adaptability

Organizational skills

Decision-making

Data programming

Data repositories

Adaptability and flexibility

Data acquisitions

Interpersonal skills

Time management abilities

Goal setting

Multitasking Abilities

Professionalism

Task prioritization

Written communication

Continuous improvement

Active listening

Self motivation

Critical thinking

Analytical skills

Certification

AWS Certified Cloud Practitioner

Timeline

AWS Certified Cloud Practitioner

12-2025

Senior Data Engineer

Amazon
05.2022 - Current

Lead Data Engineer

H-E-B San Antonio
09.2018 - 05.2022

Senior Data Engineer

H-E-B San Antonio
09.2014 - 09.2018

Senior Software engineer

UST Global
01.2006 - 09.2014

Bachelor of Technology - Computer Science and engineering

College of engineering
Simi JosephSenior Data Engineer