Summary
Overview
Work History
Education
Skills
Certification
Work Preference
Timeline
Generic

ANTENEH TEFERA

ATLANTA,GA

Summary

Dynamic data architecture professional with a proven ability to develop comprehensive strategies aligned with business objectives. Expertise in advanced data modeling techniques and architectural frameworks enhances system efficiency and reliability, fostering a robust environment for data-driven decision-making. Strong track record in optimizing data management practices ensures seamless integration and drives innovative solutions that propel business growth. Recognized for collaborative teamwork and adaptability in fast-paced environments, with proficiency in data modeling, database design, and cloud-based architecture to meet evolving project demands.

Overview

21
21
years of professional experience
1
1
Certification

Work History

Lead Data Architect

Florida Department of Health – TB Control Section
Tallahassee, FL
07.2023 - Current
  • ETL Design & Data Pipelines: Designed and developed robust ETL solutions to load data from staging environments into data marts, ensuring seamless integration across multiple heterogeneous sources.
  • Data Integration & Transformation: Built and maintained scalable data pipelines using Azure Databricks, Azure Data Factory (ADF), SSIS, and Python, streamlining data ingestion, transformation, and orchestration workflows.
  • Advanced Data Visualization & Reporting: Created actionable visualizations (line graphs, bar charts, heatmaps) using PySpark in Azure Databricks to analyze trends in STD, TB, and HIV datasets. Developed interactive dashboards and reports using Power BI, Qlik Sense, and SSRS.
  • Data Analysis & Experimentation: Conducted A/B testing and experimental design to assess program impacts. Supported data profiling, validation, and cleansing to ensure high data accuracy and integrity.
  • Data Warehousing & OLAP: Developed and deployed OLAP solutions using SQL Server Analysis Services (SSAS) and MDX, enabling multidimensional data analysis and performance-optimized reporting.
  • Health Data Formats & Integration: Processed and transformed structured healthcare data formats such as HL7 and XML using ADF, SSIS, and Azure Databricks for integration into centralized data repositories.
  • Database Management & Query Optimization: Wrote and optimized complex SQL queries, stored procedures, and views. Monitored performance of ETL and query operations and implemented optimizations as needed.
  • Team Leadership & Mentorship: Mentored junior data engineers and interns, promoted knowledge sharing, and led team collaboration to ensure high-quality deliverables and agile project execution.
  • BI & Reporting Solutions: Designed and delivered BI solutions using SSRS, Logi Analytics, SAP Crystal Reports, and Qlik Sense. Administered Qlik Sense Server via Qlik Management Console (QMC) to manage user roles, reload tasks, and app deployments.
  • Infrastructure Optimization: Continuously improved system performance, scalability, and cost-efficiency. Implemented high availability strategies and proactively resolved infrastructure-related issues.
  • Public Health Data Systems: Extensive experience with CAREWARE CTLS and HMS for managing HIV/AIDS, STD, and SSuN-related care data, including RSR submissions.
  • Designed data models to support TB surveillance and reporting initiatives.
  • Collaborated with cross-functional teams to enhance data integration processes.

Senior Data Engineer

Hudson Advisors
Dallas, TX
01.2023 - 06.2023
  • Assisted in developing ETL pipelines to extract, transform, and load commercial real estate occupancy data from various global sources. Supported integration of data related to underwriting, trading, financing, and management of structured credit assets, including RMBS, CDOs, and mortgage portfolios.
  • Executed data integration and standardization processes to transform complex heterogeneous data into uniform formats.
  • Designed and implemented data pipelines for efficient ingestion and processing of environmental, social, and governance (ESG) data.
  • Enhanced performance of Apache Spark applications within Hadoop environments by optimizing Spark context, SQL DataFrames, and RDD configurations.
  • Engineered dynamic dashboards and interactive visualizations using Tableau to enhance asset performance reporting.
  • Assisted in generating regular reports on asset performance through data capture, transformation, and visualization. Supported leadership in formulating data-driven recommendations that align with strategic objectives.
  • Assisted in managing data integration platforms. Supported environment monitoring to maintain system performance. Coordinated deployment schedules and performed backups for system reliability.
  • Engaged with asset managers, analysts, and business teams to clarify data requirements, enhancing the efficiency of report delivery processes.
  • Engineered and executed scalable data pipelines to optimize data integration workflows.
  • Facilitated collaboration among diverse teams to optimize data architecture, supporting enhanced system performance and reliability.

Senior Data Engineer

Capital One
Plano, TX
01.2022 - 12.2022
  • Assisted in developing record-level analysis processes to identify inconsistencies between Shaw and PFC exit program datasets. Organized mismatches by type and root cause to enhance data reliability.
  • Optimized root cause analysis (RCA) framework utilizing Scala Quantum, AWS EMR, and shell scripting to identify and rectify data discrepancies across diverse platforms.
  • Data Comparison Platform: Stabilized and optimized a data comparison system supporting SHAW and PFC data sources, improving detection and remediation of inconsistencies.
  • ETL Pipeline Development with AWS Glue: Developed ETL jobs to migrate campaign and Adobe data from S3, ORC, Parquet, and text files into Redshift, using AWS Glue and PySpark for data aggregation and consolidation.
  • Data Lake Architecture: Created and managed external partitioned tables using Hive, AWS Athena, and Redshift, optimizing data structure and query performance.
  • Analyzed data trends and patterns to uncover discrepancies, driving recommendations for improvements in upstream systems.
  • Data Visualization with Tableau: Built heatmaps and column-level reports to illustrate table-level mismatches between SHAW and PFC datasets stored in Snowflake.
  • Developed and managed Arow jobs to ensure seamless operations through effective database object registration and exchange processes.
  • Onboarding & Ad-Hoc Support: Provided onboarding, training, and ad-hoc troubleshooting for SHAW systems, scripts, reports, AROW jobs, and RCA tools.
  • Architected and deployed efficient data pipelines to facilitate seamless data integration across systems.
  • Spearheaded initiatives with cross-functional teams to refine data architecture, resulting in enhanced system performance and reliability.
  • Designed and implemented scalable data pipelines to enhance data integration processes.
  • Led cross-functional teams to optimize data architecture, improving system performance and reliability.

Data Engineer

Florida Department of Health
Tallahassee, FL
01.2017 - 06.2022
  • ETL Design & Automation: Designed and implemented ETL workflows using SSIS to load data from staging servers into data marts and the central data warehouse. Leveraged control flow elements like Lookups, Fuzzy Lookups, Derived Columns, and Script Tasks for efficient transformation.
  • Process Scheduling & Monitoring: Automated SSIS package execution using SQL Server Agent, configured SQL Mail Agent for alerts, and scheduled FTP-based file transfers. Developed master packages for centralized execution and monitoring.
  • Error Handling & Logging: Implemented checkpoints, event handling, and detailed logging mechanisms to ensure robustness and simplify troubleshooting of ETL processes.
  • Database Engineering & Integrity: Developed stored procedures, triggers, views, and indexes while ensuring referential integrity across all schema objects. Managed DML/DDL changes to maintain data consistency.
  • Data Modeling & Documentation: Led data modeling initiatives using tools like Erwin and documented technical designs, transformation logic, and architectural decisions to support data governance and scalability.
  • BI Reporting & Visualization: Created and deployed parameterized reports using SSRS, Qlik Sense, SAP Crystal Reports, and Logi Analytics. Administered Qlik Sense Server via QMC and supported NPrinting operations.
  • Performance Optimization: Monitored and enhanced the performance of HIV/AIDS data systems and SSIS interfaces, resolving bottlenecks and improving ETL speed and reliability.
  • New Interface Development: Developed new data exchange interfaces using SSIS, Azure Data Factory, and File Mover to accommodate evolving reporting and data sharing requirements.
  • Data Exploration & Visualization: Led exploratory data analysis (EDA) initiatives and built insightful visualizations to support evidence-based public health decision-making.
  • Technical Documentation: Maintained thorough documentation of system modifications, ETL processes, and integration workflows to streamline onboarding and compliance.
  • Developed and implemented ETL processes to streamline data integration across multiple health databases.
  • Designed and maintained data pipelines using Apache Kafka for real-time data processing.
  • Designed and implemented data architecture frameworks for TB control initiatives.
  • Collaborated with cross-functional teams to ensure data integrity and usability.
  • Led ETL process improvements, enhancing data quality and reducing processing time.
  • Designed and implemented data models to optimize reporting and analytics processes.
  • Designed and optimized data pipelines for efficient data processing and analysis.
  • Collaborated with cross-functional teams to integrate data solutions and enhance reporting capabilities.

Data Scientist (Course Designer & Consultant)

ICATT Consulting Inc.
Jacksonville, FL
09.2016 - 09.2017
  • Developed a comprehensive course catalog emphasizing emerging data science and analytics technologies.
  • Conducted market research to align content with current industry needs.
  • Designed and implemented training materials focused on predictive modeling, machine learning, and statistical methods.
  • Implemented advanced algorithms including logistic regression, clustering, PCA, SVM, random forest, and neural networks for educational case studies.
  • Conducted comprehensive data analysis and generated reports to support various internal project initiatives.
  • Utilized Python and R to support data analysis tasks. Assisted in developing visual representations using RStudio and Visio. Contributed to database management with T-SQL and LaTeX for documentation.
  • Designed and implemented predictive models to support informed decision-making across multiple initiatives.
  • Executed analysis of large datasets utilizing Python and R to enhance data interpretation efficiency.

Research Scientist

Florida A&M University
Tallahassee, FL
02.2015 - 08.2016
  • Engineered Monte Carlo algorithm to simulate charge-carrier conductivity in molecular-doped semiconductors.
  • Executed ab-initio molecular dynamics simulations and density functional theory (DFT) calculations to analyze molecular behavior.
  • Analyzed physiochemical properties of nanoclusters and quantum dots to enhance electronic and electrostatic characterization.
  • Facilitated undergraduate students' mastery of research methodology and scientific computing techniques.
  • Utilized R, Python, C++, Fortran, and LaTeX to support data analysis and project documentation. Collaborated with team members to enhance programming skills and improve project outcomes. Assisted in the development of software solutions using various programming languages.
  • Spearheaded experimental design and execution to enhance understanding of biochemistry and molecular biology.
  • Executed comprehensive analysis of complex data sets utilizing advanced statistical software to extract actionable insights.
  • Designed and conducted experiments to advance knowledge in biochemistry and molecular biology.
  • Analyzed complex data sets using advanced statistical software to derive meaningful insights.

Research Scientist & Business Analyst (Internship)

TLC Precision Wafer Technology
Minneapolis, MN
05.2014 - 01.2015
  • Created use case diagrams and workflows to model system interaction.
  • Tested high and low band voltage-controlled oscillator (VCO) modules.
  • Conducted gap analyses and requirements engineering for software systems.
  • Collaborated with clients to develop tailored solutions and system enhancements.
  • Key Tools: SQL Server 2012, T-SQL, Excel, Access, R, Python

Research Assistant (Ph.D. Program)

Florida A&M University
Tallahassee, FL
08.2009 - 12.2014
  • Engineered and validated Monte Carlo simulations to model conductivity in molecular-doped semiconductors.
  • Executed DFT-based analysis on nanomaterials, including CdSe quantum dots and bimetallic nanoclusters.
  • Provided mentorship to undergraduate students, focusing on advanced concepts in computational physics and scientific analysis techniques.
  • Assisted in programming tasks using R, Python, C++, Fortran, and LaTeX. Supported team members in developing data analysis and modeling solutions. Contributed to documentation and reporting efforts for project deliverables.
  • Executed comprehensive literature reviews to inform research projects.
  • Facilitated data collection and analysis utilizing statistical software to enhance accuracy and reliability.
  • Conducted literature reviews to support research projects and enhance understanding of relevant topics.
  • Assisted in data collection and analysis using statistical software to ensure accuracy and reliability.

Business Intelligence Analyst / Data Analyst / Research Associate

FHE Consultant
Addis Ababa, Ethiopia
02.2005 - 07.2009
  • Provided supplementary information upon request to enhance decision-making processes.
  • Analyzed complex data sets to identify trends and inform strategic business decisions.
  • Developed and maintained interactive dashboards to optimize real-time performance tracking.
  • Streamlined data entry processes by automating routine tasks through SQL scripts and ETL tools.
  • Analyzed complex data sets to identify trends and inform business strategies.
  • Developed and maintained interactive dashboards for real-time performance tracking.

Education

Master of Science - Data Science

Eastern University
PA
12-2025

Ph.D. - Computational Physics

Florida Agricultural And Mechanical University
Tallahassee, FL
12-2014

Master of Science - Computational Statistical Physics

Addis Ababa Univesity
Addis Ababa, Ethiopia
07-2007

Bachelor of Science - Applied Physics

Addis Ababa University
Addis Ababa, Ethiopia
07-2004

Skills

  • SQL
  • ETL development
  • Security protocols
  • SQL expertise
  • Data quality management
  • Data migration
  • Database design
  • Data analytics
  • Data lake management
  • System architecture design
  • Data governance
  • Data security
  • Data warehousing expertise
  • Machine learning
  • Data modeling
  • Data cataloging
  • Data warehousing
  • Performance tuning
  • Data mining
  • Big data management
  • Excellent communication
  • Critical thinking
  • SQL and databases
  • Decision-making
  • RDBMS
  • ETL processes
  • Database management
  • Data visualization
  • Microsoft SQL server
  • Data acquisitions
  • Master data management
  • Real-time analytics
  • Key performance indicators

Certification

  • Microsoft Certified Professional – Querying Data with Transact-SQL (Mar 2018)
  • DevOps Foundation Certification – DevOps Institute (Mar 2018)
  • Big Data & Hadoop Architect Certification – Intellipaat Software Solutions (Jan 2018)
  • Certificate in Big Data Analytics – IBM Career Education Program(Jan 2018)
  • Data Science Specialization – Johns Hopkins University, Coursera (Dec 2016)

Work Preference

Job Search Status

Not actively looking

Salary Range

$45000/yr - $200000/yr

Timeline

Lead Data Architect

Florida Department of Health – TB Control Section
07.2023 - Current

Senior Data Engineer

Hudson Advisors
01.2023 - 06.2023

Senior Data Engineer

Capital One
01.2022 - 12.2022

Data Engineer

Florida Department of Health
01.2017 - 06.2022

Data Scientist (Course Designer & Consultant)

ICATT Consulting Inc.
09.2016 - 09.2017

Research Scientist

Florida A&M University
02.2015 - 08.2016

Research Scientist & Business Analyst (Internship)

TLC Precision Wafer Technology
05.2014 - 01.2015

Research Assistant (Ph.D. Program)

Florida A&M University
08.2009 - 12.2014

Business Intelligence Analyst / Data Analyst / Research Associate

FHE Consultant
02.2005 - 07.2009

Master of Science - Data Science

Eastern University

Ph.D. - Computational Physics

Florida Agricultural And Mechanical University

Master of Science - Computational Statistical Physics

Addis Ababa Univesity

Bachelor of Science - Applied Physics

Addis Ababa University