Summary
Overview
Work History
Education
Skills
Education
Timeline
Generic

Saroj Banjara

Round Rock,TX

Summary

Experienced Data Engineer with expertise in IBM Infosphere DataStage, SQL, UNIX scripting, and data analysis. Proficient in utilizing Python for data processing and transformation, including preprocessing complex file formats, automating ETL workflows, and enhancing data integration. Skilled in BI reporting using SAP BusinessObjects WebIntelligence(WebI) and Tableau, job automation with Control-M, and implementing CI/CD pipelines through GitLab. Demonstrated proficiency in Snowflake for cloud-based data integration and transformation. Successfully led system upgrades, migrated legacy repositories, and optimized data extraction processes to enhance efficiency and integration capabilities.

Overview

8
8
years of professional experience

Work History

ETL Python Developer

Apex Systems | USAA
01.2020 - 12.2024
  • Orchestrated the development and management of ETL processes using IBM Infosphere DataStage, ensuring seamless data flow across systems that enhanced reporting accuracy while saving 15 hours weekly on manual tasks.
  • Optimized complex data transformations by utilizing Joins, Lookup, Aggregator, and Transformer stages in IBM InfoSphere DataStage, improving ETL performance by 25% and ensuring data consistency across multiple source systems.
  • Developed and optimized complex SQL queries involving JOINS (inner, outer, cross), sub-queries, CTEs, and window functions to extract and transform large datasets, ensuring efficient data retrieval.
  • Partnered with business users and downstream teams to align data requirements, streamline reporting workflows, and enhance data accuracy for decision-making.
  • Leveraged UNIX scripting and IBM DataStage Designer to handle complex ETL workflows, transforming and loading data into Oracle and Snowflake databases, ensuring efficient, accurate, and seamless data integration.
  • Developed Python-based solutions for preprocessing complex data files (e.g., PDFs) using Pandas and NumPy, automating data loading processes into target systems and saving significant manual effort and time.
  • Collaborated with vendors, including Princeton Asset Management (PAM) for General Ledger and Accounting, and Charles River Investment Management System (CRIMS) for compliance and live trade operations, ensuring smooth financial and trading workflows.
  • Facilitated the transition from Charles River Investment Management System (CRIMS) to Broadridge Investment Management System (BIMS) and integrated BlackLine for reconciliation processes, streamlining financial operations and enhancing system efficiency.
  • Conducted data analysis, developed data models, and implemented data warehousing concepts to support efficient data storage and retrieval processes.
  • Designed and developed reports in SAP Web Intelligence, Crystal Reports and Tableau to support the analytical needs of Investment and Accounting teams, enhancing decision-making processes.
  • Troubleshot ETL job errors and debugged data issues, delivering innovative solutions to complex problems to ensure seamless data processing and reliability.
  • Automated job schedules using Control-M and contributed to testing and deployment of new projects utilizing GitLab and Urban Code Deployment, ensuring efficient and reliable workflows.
  • Implemented CI/CD pipelines for automated deployments, significantly reducing deployment time and enhancing overall operational efficiency.
  • Led the upgrade of IBM DataStage, ensuring system compatibility, improved performance, and minimal disruption to ongoing operations.
  • Collaborated with cross-functional teams to migrate legacy repositories, streamline workflows, and enhance operational processes for improved efficiency.
  • Optimized data extraction processes and automated data flow, resulting in improved operational efficiency and reduced manual intervention.
  • Provided support for Business Intraday processes, ensuring timely and accurate data handling to meet business requirements.
  • Monitored IT data platforms and server health, ensuring optimal performance and proactive identification of potential issues to maintain system reliability.
  • Led a team of 5 offshore and onshore members, providing feature requirements and support during development, testing, and implementation, while driving workflow efficiency and workspace improvements through active participation in 'Proof of Concept' (PoC) reviews.

Data Analyst

One Dollar Zone
12.2016 - 04.2018
  • Automated sales data extraction and reporting processes, reducing manual reporting time by 40% and providing real-time insights into daily sales performance.
  • Analyzed customer purchase data to create targeted marketing segments, boosting customer retention and conversion rates for promotional campaigns.
  • Analyzed pricing trends and competitor data to recommend pricing adjustments, resulting in increase in overall profit margins.
  • Created interactive dashboards for senior management, enabling faster decision-making and reducing time taken to assess business performance metrics.
  • Analyzed customer purchase data to identify popular products, optimizing product assortment and driving sales.
  • Standardized data collection and reporting processes, reducing reporting errors by 25% and increasing transparency for stakeholders.
  • Leveraged data visualization tools like SAP WebI to effectively communicate business insights, enhancing decision-making and facilitating clear understanding of complex data.
  • Developed customized reports, summarizing and presenting data in visually appealing format to enhance readability and support informed decision-making.
  • Created data models to support data-driven strategies and business outcomes, while providing technical support for troubleshooting analytics and reporting issues to ensure timely resolution and minimize disruptions.

Education

Bachelor of Science - Information Technology

Columbia Southern University
Orange Beach, AL
11.2023

Post-Graduate Certificate - Artificial Intelligence and Machine Learning: Business Applications

McCombs School of Business At The University of Texas At Austin And Great Learning
Austin, TX
11.2021

Associate of Science - Computer Programming

ASA College
Manhattan, NY
09.2019

Skills

  • ETL Development
  • Data Analysis
  • IBM InfoSphere DataStage
  • Python
  • UNIX Scripting
  • SQL Programming
  • Snowflake
  • AWS
  • SAP Web Intelligence
  • Crystal Reports
  • Tableau
  • BI Reporting
  • Control-M
  • GitLab CI/CD Pipelines
  • Agile Methodology

Education

  • Participate in systems and data implementation projects (requirements documentation, data models, dashboard design, test documentation/execution, issue identification and resolution).
  • Collaborate with business users to translate business requirements into technical data solutions.
  • Guide projects from concept (requirements, functional design) to launch (testing, sign offs).
  • Design and implement data models to promote connectivity and reuse enterprise data concepts.
  • Collaborate with other teams within the organization to devise and implement data strategies, build models, and assess shareholder needs and goals.

Timeline

ETL Python Developer

Apex Systems | USAA
01.2020 - 12.2024

Data Analyst

One Dollar Zone
12.2016 - 04.2018

Bachelor of Science - Information Technology

Columbia Southern University

Post-Graduate Certificate - Artificial Intelligence and Machine Learning: Business Applications

McCombs School of Business At The University of Texas At Austin And Great Learning

Associate of Science - Computer Programming

ASA College
Saroj Banjara