YEARS OF PROFESSIONAL EXPERIENCE
YEARS OF PROFESSIONAL EXPERIENCE
๐ Hi, I'm Aslesha Reddy, a passionate and performance-driven Data Engineer with over 4 years of experience in designing scalable data pipelines and robust ETL/ELT workflows for structured & unstructured data.
๐ก I specialize in Big Data tools like Apache Spark, Kafka, Airflow, and NiFi, and work extensively with cloud platforms including AWS, GCP, and Azure. My focus is building real-time and batch processing pipelines that empower data-driven decisions.
๐ง I bring expertise in:
๐ My mission is to make data fast, reliable, and usable at scaleโtransforming complex business problems into smart data solutions.
๐ Built scalable real-time and batch pipelines with Kafka, Spark, and AWS Glue for processing critical financial datasets
๐ง Developed Spark-based ETL workflows in Python, enhancing fraud detection & customer insights
๐ Reduced ETL latency by 35% via optimized deployment on AWS (S3, Athena, Redshift)
๐งช Integrated Databricks and Snowflake for downstream ML & analytics workflows
๐ก๏ธ Led implementation of data lineage, classification, and automated quality checks
โ๏ธ Automated deployment using CI/CD pipelines (Jenkins, GitHub) & IaC tools
๐ฐ Improved infrastructure performance by 40% through distributed computing on AWS EMR
๐ Partnered with finance teams to enforce privacy & compliance standards across datasets
๐งฉ Built end-to-end ETL/ELT pipelines with PySpark, Ab Initio, and Teradata, improving reliability by 25%
โ๏ธ Migrated data to AWS cloud (S3, Glue, Athena) for real-time analytics
๐๏ธ Designed modern data warehouses to support analytical workloads
๐ Streamlined deployments via CI/CD integration with Jenkins and Git
๐งฌ Implemented MDM and data quality frameworks, enabling trusted reporting via Tableau and Qlik
๐ Contributed to ML pipeline orchestration using Databricks and MLFlow
๐ ๏ธ Optimized SQL & Python scripts, reducing job runtimes by up to 30%
๐ Participated in agile sprints and documentation aligned with SDLC
Languages & Programming: Python ๐ PySpark โก SQL ๐๏ธ Java โ R ๐
Big Data: Apache Spark ๐ฅ Kafka ๐ก Hadoop ๐ HBase Pig ๐ท NiFi
Cloud & ETL: AWS โ๏ธ (Glue, S3, Redshift) GCP ๐ (BigQuery, Dataflow) Azure ๐ (Data Factory, Synapse) Databricks Snowflake โ๏ธ dbt
Databases: Teradata Oracle PostgreSQL MongoDB Cassandra DynamoDB
DevOps & Infra: Jenkins GitHub Docker ๐ณ Kubernetes โธ๏ธ Terraform ๐ Airflow ๐ฌ๏ธ CI/CD
Analytics & Visualization: Tableau ๐ QlikView QlikSense Jupyter ๐
Data Governance: MDM Data Quality ๐งน Lineage GDPR, HIPAA Compliance โ
ML & Orchestration: MLflow MLOps ๐ง Distributed Systems
Other Tools: Ab Initio Unix/Linux ๐ฅ๏ธ Confluence ๐