Summary
Overview
Work History
Education
Skills
Certification
Timeline
Generic

Sujayeendra Karanam

Jersey City,NJ

Summary

Data engineer with 4+ years of experience designing, developing, and optimizing data solutions to support data-driven decision-making and achieve business objectives. Adept in data integration, ETL processes, and database management for efficient and reliable data pipelines. Proficient in big data technologies like Hadoop, Spark, and Kafka for handling large volumes of data. Skilled in working with diverse database systems such as SQL Server, Oracle, and MySQL for data integrity and efficient retrieval. Experienced in leveraging cloud platforms like AWS, Azure, and Google Cloud to build scalable and cost-effective data infrastructures. Strong understanding of data modeling and warehousing concepts for efficient storage and retrieval. Proficient in using Tableau and Power BI to transform complex data into insightful visualizations. Demonstrates expertise in implementing data governance frameworks for compliance and data privacy. Skilled in performance tuning and optimization to enhance data processing speed. Collaborative team player with excellent communication and problem-solving skills.

Overview

5
5
years of professional experience
1
1
Certification

Work History

Data Engineer

Intuit Business Solutions
07.2022 - Current
  • Identified and cataloged various data sources within the healthcare organization, including Electronic Health Records (EHR), laboratory results, billing data, and external data sources
  • Developed ETL processes to extract data from disparate sources, ensuring data integrity and consistency, and implemented incremental data extraction mechanisms to capture updates in real-time
  • Standardized data formats, resolved inconsistencies, handled missing or erroneous data during the transformation phase and applied data cleansing and validation procedures to ensure high data quality
  • Used tools like Apache NiFi or Apache Camel to facilitate smooth data flow between systems
  • Deployed machine learning models using Amazon SageMaker to analyze and predict clients’ fiscal data to forecast future spending and profits, and generated reports of recommendations to decrease hospital drug expenditure
  • Monitored and managed cloud resources using AWS CloudWatch, setting up alerts and dashboards for real-time visibility into system performance
  • Designed and implemented a scalable and optimized data warehouse schema to store integrated healthcare data on AWS Redshift
  • Implemented robust security measures to ensure compliance with healthcare data privacy regulations (e.g., HIPAA) also applied encryption, access controls, and audit trails to protect sensitive patient information
  • Used AWS Glue for data ingestion, enabling automatic discovery, cataloging, and transformation of healthcare data, and leveraged AWS DataSync for securely transferring healthcare data to and from AWS
  • Extracted large patient files from NoSQL database (MongoDB) and processed them with Spark using the Mongo Spark connector
  • Designed visually appealing and interactive dashboards in Tableau that present key healthcare metrics and utilized color coding, charts, and graphs to enhance data visualization
  • Designed high-availability and disaster recovery solutions using AWS Backup and AWS Elastic Disaster Recovery, ensuring data resilience
  • Worked very closely with healthcare interoperability and messaging standards, like HL7 2.x, FHIR, HIPAA, Radiology, Patient Access, etc.
  • Collaborated on ETL (Extract, Transform, Load) tasks, maintaining data integrity and verifying pipeline stability.
  • Gathered, defined, and refined requirements, led project design, and oversaw the implementation.

Data Engineer

Aspire Systems
01.2020 - 07.2021
  • Involved in designing and implementation of a distributed data processing framework using Apache Spark, achieving a 50% reduction in data processing time for real-time analytics
  • Support data governance policies and security measures with Azure Security Center, implementing best practices for data protection and risk management to protect sensitive data
  • Conducted comprehensive data profiling and analysis, identifying, and resolving data inconsistencies to improve data reliability by 25%
  • Developed Spark/Scala, Python for regular expression (regex) project in the Hadoop/Hive environment with Linux/Windows for big data resources
  • Managed security groups on Azure, focusing on high-availability, fault-tolerance, and auto scaling using Terraform templates
  • Along with Continuous Integration and Continuous Deployment with Azure Functions and Azure DevOps
  • Developed data monitoring and alerting mechanisms using ELK stack, ensuring real-time data accuracy and anomaly detection
  • Worked with Data NoSQL Warehouse in the development and execution of data conversion, data cleaning and standardization strategies and plans as several small tables are combined into one single data repository system MDM (Master Data Management)
  • Used Azure Data Catalog with crawler to get the data from Azure Blob Storage and perform SQL query operations using Azure Synapse Analytics
  • Implemented data integration and analytics solutions using Snowflake, enhancing data processing capabilities and performance.

Education

Master in Information Systems -

Stevens Institute of Technology
Hoboken, NJ
05.2023

Bachelor’s in Computer Science Engineering -

SRM Institute of Technology
Chennai, India
06.2020

Skills

  • SDLC
  • Agile
  • Waterfall
  • Python
  • R
  • SQL
  • SAS
  • NumPy
  • Pandas
  • Matplotlib
  • SciPy
  • Scikit-learn
  • TensorFlow
  • Visual Studio Code
  • PyCharm
  • Eclipse
  • MySQL
  • Microsoft SQL Server
  • MongoDB
  • Oracle Database
  • Postgres
  • Elastic Search
  • Amazon Web Services (AWS)
  • Azure
  • Google Cloud
  • Windows
  • Linux
  • MacOS
  • Data curating
  • Data integration
  • ETL development
  • Data pipeline control

Certification

AWS Cloud Certification

Timeline

Data Engineer

Intuit Business Solutions
07.2022 - Current

Data Engineer

Aspire Systems
01.2020 - 07.2021

Master in Information Systems -

Stevens Institute of Technology

Bachelor’s in Computer Science Engineering -

SRM Institute of Technology
Sujayeendra Karanam