Summary
Overview
Work History
Education
Skills
Certification
Timeline
Generic

VISWACHANDRIKA BADDULA

Khammam,Telangana

Summary

Motivated and results-driven Data Scientist with proven expertise in data analysis, ETL pipeline development, and business intelligence. Proficient in SQL, Python, Power BI, and Databricks, with a strong ability to extract actionable insights from large datasets and deliver impactful visualizations and reports. Skilled at collaborating with cross-functional teams and leveraging Agile methodologies to design data-driven solutions that drive informed decision-making and optimize business performance.

Overview

2023
2023
years of professional experience
1
1
Certification

Work History

Java Development Intern

Mindtree
03.2022 - 06.2022
  • Developed and debugged Java-based applications, ensuring seamless performance and functionality.
  • Gained expertise in Java frameworks, including Spring and Hibernate.
  • Contributed to code reviews, resolving application issues to improve efficiency and scalability.

Sales Data Analytics and Visualization

[SQL | Python (Pandas, Matplotlib) | VS Code]
  • Extracted, transformed, and analyzed sales data from a relational database using advanced SQL queries, optimizing performance through indexing, subquery refactoring, and efficient JOIN operations.
  • Developed interactive visualizations using Pandas and Matplotlib to highlight key business metrics such as monthly sales trends, customer segmentation patterns, and product-level performance across categories.
  • Conducted requirement analysis from a business perspective, identifying strategic use cases for marketing and sales teams, and documenting insights to support data-driven planning.
  • Applied statistical techniques to compute unique product counts, evaluate fiscal year-over-year performance changes, and identify seasonal variances in sales behavior.
  • Created visually compelling dashboards using Excel and Python libraries (e.g., Seaborn, Plotly), incorporating pie charts, bar graphs, and time-series plots to communicate insights clearly to a non-technical audience.

Customer Segmentation and Insights – RFM & KMeans

[SQL | Databricks | Delta Lake| ETL]
  • Engineered robust, scalable ETL pipelines in Databricks using PySpark to ingest, clean, and transform large volumes of Kickstarter campaign data from raw CSVs into structured formats, improving data quality and readiness for analysis.
  • Designed and implemented data workflows to eliminate duplicates, standardize inconsistent records, and perform complex data mapping and validation, streamlining the integration process into a star schema with clearly defined dimension and fact tables.
  • Built incremental data loading solutions using Delta Lake MERGE operations, ensuring efficient updates to large datasets while maintaining historical accuracy and minimizing compute costs.
  • Leveraged advanced SQL analytics to derive business insights such as campaign success rates, country-wise funding trends, rolling averages of backers over time, and key performance metrics across categories.
  • Automated end-to-end reporting pipelines that populate Gold layer Delta tables, supporting real-time dashboarding, stakeholder reporting, and executive-level decision-making.
  • Ensured pipeline reliability through data quality checks, schema validations, and exception handling mechanisms, enabling seamless deployment and monitoring of ETL workflows.

Weather Insights:Unveiling Precipitation Dynamics

[Excel | MongoDB | Power BI]
  • Led data cleaning and integration efforts in Excel, ensuring data integrity before loading into MongoDB.
  • Designed and documented a robust MongoDB data architecture, creating comprehensive design documents to support system scalability and ease of access.
  • Utilized Agile methodology and contributed to user stories to enhance project organization.
  • Developed optimized queries for efficient data retrieval, creating dynamic dashboards in Power BI to support informed, data-driven decisions.
  • Tested data accuracy through validation techniques, ensuring transparent, error-free visualizations.

Education

Master's - Data Modeling/Warehousing and Database Administration

Rowan University
12-2024

Bachelor of Technology - Electronics and Instrumentation Engineering

VNR VJIET
06-2022

Skills

Data Analysis & Visualization: SQL, Python (Pandas, Matplotlib), Power BI

Machine Learning: Scikit-learn, KMeans Clustering

Data Engineering: Databricks, ETL Pipelines

Programming: Python, Java

Databases: MongoDB, SQL

Cloud & Big Data: AWS, Spark

Certification

  • Coursera: Python for Everybody, Gained hands-on experience in Python programming for data analysis and automation.
  • AWS Essential Training for Developers, Amazon Web Services (AWS)

Timeline

Java Development Intern

Mindtree
03.2022 - 06.2022

Customer Segmentation and Insights – RFM & KMeans

[SQL | Databricks | Delta Lake| ETL]

Weather Insights:Unveiling Precipitation Dynamics

[Excel | MongoDB | Power BI]

Sales Data Analytics and Visualization

[SQL | Python (Pandas, Matplotlib) | VS Code]

Master's - Data Modeling/Warehousing and Database Administration

Rowan University

Bachelor of Technology - Electronics and Instrumentation Engineering

VNR VJIET
VISWACHANDRIKA BADDULA