Summary
Overview
Work History
Education
Timeline
Accomplishments
Additional Information
Skills
OperationsManager
Saurav Paul

Saurav Paul

Software Architect, DS/ML
Herndon,VA

Summary

Accomplished Architect/Engineer with a proven ability to seamlessly blend data architecture expertise with machine learning acumen. Recognized for designing and implementing end-to-end data-driven solutions that leverage advanced analytics and AI to drive business innovation. Proficient in leading cross-functional teams, ensuring data governance, and architecting scalable, high-performance machine learning systems. An advocate for best practices, security, and compliance in data and AI endeavors. Seeking to apply unique skill set to lead transformative projects that elevate data and AI strategies within organization.

Overview

15
15
years of professional experience
6
6
years of post-secondary education

Work History

Software Architect - DS & ML

Urban Outfitters, Inc.
Herndon, VA
08.2021 - Current
  • Leading technical strategy and setting multi-quarter roadmap for AI/ML Infrastructure.
  • Leading & building the initiative ML Driven Search based on two-tower model that is inspired by this paper from Instacart to solve cold start using ElasticSearch.
  • Forging strong partnerships with tech leaders in Data, Infra & Product organizations to develop a comprehensive technical roadmap that spans across all URBN brands.
  • Collaborating with internal clients, ML engineers, and data scientists to address their concerns regarding ML development velocity and enable successful implementation of customer use cases.
  • Improved performance of recommendation engine data pipeline & reduced the cost of BigQuery from $14 to $0.44 for each execution per brand (AN, FP & Urban Outfitters)
  • Defining architecture & working on POC for collecting click stream data from various sources using GCP Services.
  • Building in-house Experimentation Platform using various open-source tools.
  • Mentor engineers within team and demonstrate technical leadership.
  • Driving end-to-end architecture of AI/ML on Google Cloud.

Staff Software Engineer

Synchronoss Technologies Inc.
Reston, VA
09.2015 - 08.2021
  • Taking ownership of end-to-end feature delivery i.e. design development & deployment.
  • Experience in working with data from multiple sources & understand the data.
  • Collaborate with cross-functional team with Product Managers, UI Team, Data Scientist, QA & SRE Team
    to deliver features & contribute on DevOps tasks.
  • Integrated OWASPI enterprise security API to Service API Gateway
  • 4 years of experience in leading & supervising 6+ software engineers to increase WIP Limit of SCRUM Team.
  • Working closely with Data Scientist to push ML Models/data pipeline in production using Data Platform.
  • Mentoring, code reviews & making sure software engineering practice are being adopted while delivering end-to-end feature.
  • Architecting & migrating existing microservices & API to Mesh architecture using Docker, Kubernetes, Istio, Envoy & Spring Framework.
  • Architected & Implemented Service Mesh architecture using Docker & Spring Cloud to build REST APIs
    to ingest data & egress data.
  • Architected & Implemented platform feature that helped to deploy & scale with using Docker & HA Proxy.
  • Setting software development standards, best coding practices, data governance for data platform & data science team.
  • Refactored code of in-house ETL Framework to adopt Lambda Architecture using Spark Streaming & Apache Kafka.
  • Research and evaluate build vs buy options for tooling and product building blocks & creating prototypes to evaluate right tools & technologies for the data platform.
  • Collaborating with Data Scientist to make Customer Segmentation & Forecaster Apache Spark compatible & built, automated a data pipeline for IOT & photo album app using microservices, REST API, ElasticSearch, Apache Spark, Apache Kafka, HDFS, Ansible, AWS Services like EC2, EMR, S3, Glue, Athena & QuickSight.
  • Implemented ML Model Monitoring using ElasticSearch, Logstash, Prometheus, Ansible & Grafana to identify data drift & Model Performance.
  • Adopted Lambda Architecture & Implemented reusable near Real Time & in-house ETL components using Java, Apache Kafka & Spark Streaming to address use case like Parsing, Inter/Intra transformation & Loader to build near to real time data pipeline.
  • Developed machine learning cookie cutter python project to develop ML libraries which helped to reduce 45% boiler plate code and integrated with our multi module maven project.
  • Architected, Designed, developed & automated major features of Data platform like “Data ingestion” & “Data Consumption” & “Data Processing” module using Ansible, Java, Docker & Python based ecosystem.
  • Customized AWS/Deequ library to develop Data Cataloguer & adopt in machine learning pipeline using Java.
  • Architected, Design, Developed DSL micro service using Spring Boot on top of ElasticSearch 7.x Query Language & Service API which reduced custom approaches by 65% across all solution.
  • Architected, Designed, Developed & Deployed Model Serving REST API microservice using FastAPI & Gunicorn & deployed using Python, Docker & Ansible.
  • Performance Tuning of ElasticSearch cluster for feature “Tag & Search” resulted latency dropped from 60ms to 10ms.
  • Performance & Cost Tuning for Spark based Data Pipeline Jobs using appropriate spark parameters & identifying column to use for partitioning which helped to reduce job running time by 50% & cost of running in AWS came down by 40%.
  • Contributed until v2 of Workbench module using AngularJS/TypeScript which has been served on top of ElasticSearch & Data Lake.

Senior Software Engineer

Razorsight Corp.
Reston, VA
11.2014 - 08.2015
  • Implemented reusable Batch Processing in-house modern ETL framework using Java, Apache Spark & MapRFS to address use case like Parsing, Inter/Intra transformation & Loader to build data pipeline using Apache Oozie, part of Data Processing Module, of in-house Data platform to reduce custom coding & improve delivery turn around time.
  • Contributed v1 of Data visualization module using AngularJS which has been served on top of ElasticSearch & Data Lake.
  • Automated Data platform API & publishing User Guide Documentation using AsciiDoc, Swagger.& to Confluence using Maven build tool & integrated to CI/CD pipeline using Bamboo
  • Converted existing HBase Binary Store implementation to MapRDB JSON Store
  • Setup from scratch CI/CD for our in-house Data Platform using Docker, Ansible, Git, Bamboo, Maven, Nexus & SonarQube.
  • Developed micro services, OAuth, JWT & set up microservices monitoring using Prometheus & Grafana.
  • Implemented unit test case, Integration Test Cases using REST Assured, stubs, mocks in TDD environment using JUnit TestNG to perform E2E.
  • Integrated code-quality, code-formatting, code vulnerability & code-coverage check as part of build pipeline using Maven.
  • Implemented Fuzzy Matching Correlation Engine based using ElasticSearch which resulted in 70% auto-correlated records & helped Professional Service team to file dispute 3x faster.
  • Defining data-lake structure based on data & implemented way in ETL framework to PII logic based on business rules.
  • Research and evaluate build vs buy options for tooling and product building blocks & Created proof- of-concepts prototype to evaluate right tools & technology for data platform.

Senior Software Engineer

Razorsight Private India Limited
Bangalore, Karnataka
08.2010 - 11.2014
  • Refactored to enhance the performance of core (ADRAP) engine of SaaS product suite using Spring, Spring Data & ActiveMQ which resulted into improved speed by 45%.
  • Developed desktop Java toolkit software application to automate adding component identifier in JSF xhtml pages which saved 80% time of Automation QA Engineers against adding it manually.
  • Designed & Developed Business REST API following multi-tenancy paradigm into consideration using JAX-RS, JAXB, Spring, Hibernate, Maven & JSF to support multiple customers on single deployment.
  • Designed & developed REST API Services Gateway to facilitate SaaS platform, adhering to modern development practices REST, OAuth, API, using Spring Framework & Spring Data to consolidate all API Endpoints.
  • Changed Ant based build system to multi-module maven project implemented product versioning, packaging and deployment of WAR & JAR file to improve release management life cycle.

Software Developer

ITC Infotech India Limited
Bangalore, Karnataka
07.2008 - 07.2010
  • Implemented the module of Air Freight parcel delivery system to maintain the Center of Gravity using JSF, Spring & Hibernate
  • Contributed Air Freight code to common platform group & helped 2 projects to reduce the effort of implementation by substantial amount.
  • Developed end to end Project until v1.0 to production ranging from prototyping to deployment using Spring, Hibernate, JSF, Maven, GlassFish, Activiti BPM, Hudson(Jenkins).

Education

Bachelor of Engineering - Computer Science

Manipal Institute of Technology
Manipal Karnataka India
07.2005 - 05.2008

Diploma - Computer Science

KS Polytechnic
Bangalore Karnataka India
04.2002 - 05.2005

Timeline

Software Architect - DS & ML

Urban Outfitters, Inc.
08.2021 - Current

Staff Software Engineer

Synchronoss Technologies Inc.
09.2015 - 08.2021

Senior Software Engineer

Razorsight Corp.
11.2014 - 08.2015

Senior Software Engineer

Razorsight Private India Limited
08.2010 - 11.2014

Software Developer

ITC Infotech India Limited
07.2008 - 07.2010

Bachelor of Engineering - Computer Science

Manipal Institute of Technology
07.2005 - 05.2008

Diploma - Computer Science

KS Polytechnic
04.2002 - 05.2005

Accomplishments

  • Building centralized experimentation platform.
  • Led a team of 6 engineers to migrate 5 old Airflow Cluster with 200 pipelines consolidated into 2 cluster & reduced the cost by 60%
  • Led IOT based complex near to real time data pipeline from acquiring hardware, building sensor gateway application, dashboard conception to deployment in production.
  • Built data pipeline to provide weekend highlight, flashback for mobile app which caters to 7+ millions premium users.
  • Built a offshore & onsite team of size 10 by hiring software engineers as well as mentoring to improve their skills, their knowledge of our in-house data platform.


Additional Information

2023-09 Professional Machine Learning Engineer, Google Cloud.
2022-08 Cloud Architect, Google Cloud

2019-04 Introduction to TensorFlow for AI, ML and DL, Coursera

2019-05 Neural Network and Deep Learning, Coursera

2018-02 Python for Data Science, DataQuest
2012-09 Cloudera Certified Developer
2013-06 MongoDB for Java Developer
2015-03 Cloud Computing from Ilinois
2015-07 MongoDB Certified DBA Associate

Skills

  • Java, SQL,Go
  • Python3, pytest, Sphinx, tox, PEP8, Shell, CentOS
  • Apache Spark, Apache Hadoop, Apache Kafka, Apache ActiveMQ, Apache Parquet Spark-SQL, PySpark, Spark Streaming, Apache Oozie
  • Spring Boot 2.0, FastAPI, Spring Cloud, Spring Framework, Hibernate, Spring Data, OWASP Enterprise Security
  • HAProxy, Ngnix, JAX-RS, JAXB
  • Elasticsearch, MariaDB/MySQL, Oracle, MapR-DB/HBase
  • Logstash, Prometheus, Grafana, New Relic
  • Bazel, Git, Maven, Nexus, Jenkins, Bamboo, JIRA, SonarQube
  • Kubernetes, Ansible, Docker
  • AngularJS, TypeScript, JSON, REST API, OAuth, JWT, JSF
  • REST Assured, K6, JUnit/TestNG, Spring Test Framework
  • Amazon S3, EMR, Amazon SageMaker, Amazon EC2, Amazon Glue, Athena
  • GKE, GCR, BigQuery, Cloud Function, Cloud Run, Pub-Sub, Kubeflow, Airflow, Dataproc, Dataflow, Vertex AI
  • AsciiDoc, Swagger, Confluence, Nexus
  • Tomcat, Gunicorn, GlassFish
Saurav PaulSoftware Architect, DS/ML