Summary
Overview
Work History
Education
Skills
Work Authorization
Degrees
Timeline
Generic

Ram Chandra Tiwari

Euless,United States

Summary

Platform engineer with full end-to-end responsibility for enterprise-scale, multi-petabyte Hadoop ecosystems (Cloudera CDP Private/Public Cloud, CDH, HDP, AWS EMR, Databricks) in on-premise and cloud environments (AWS, Azure, Comcast CCP). Proven cluster-level expertise in performance engineering, Linux kernel optimization, security governance, high availability, disaster recovery, and large-scale automation using Ansible, Chef, Terraform, Bash, and Infrastructure-as-Code. Expert in Cloudera Manager, Cloudera Data Hub (CDH), and CDP deployments with deep hands-on experience in SDX (Shared Data Experience), Ranger, Atlas, and Ozone administration Owned and operated multiple petabyte-scale production clusters (HDFS, Ozone, YARN, Hive, Spark, HBase, Zookeeper) on Cloudera CDP/CDH, Hortonworks HDP, AWS EMR, and Databricks Led full-lifecycle Ozone deployments including installation, upgrades, Ratis HA configuration, multi-tenant Organizations/Volumes/Buckets with quotas, and Ranger fine-grained ACLs Optimized cluster performance through Linux kernel tuning, JVM GC tuning, YARN queue design, memory management, and workload co-location Designed and implemented enterprise observability solutions using Cloudera Observability, Prometheus, Grafana, and automated alerting Built cluster-wide automation using Ansible, Chef, Terraform, Python, and Bash for 1000+ node fleets Owned end-to-end platform security: Kerberos, Ranger RBAC, Knox Gateway, TLS encryption, encryption zones, and MFA Designed secure, highly available AWS environments and automated infrastructure as code using CloudFormation Administered large-scale Kafka and streaming platforms with enterprise-grade performance and high availability Proven ability to manage Cloudera licenses, support ticket resolution, and implement Cloudera Data Visualization (CDV) and Cloudera Machine Learning (CML)

Overview

11
11
years of professional experience

Work History

SENIOR BIG DATA PLATFORM ENGINEER

Mastercard
St. Louis, Missouri
11.2023 - Current
  • Led end-to-end administration, installation, upgrades, and migrations (CDH/HDP → CDP Private & Public Cloud) for multi-petabyte Cloudera environments; oversaw Cloudera Data Services (ECS) deployments
  • Managed Cloudera License usage and worked with Cloudera Support for critical issue resolution
  • Primary Ozone Administrator: designed, implemented, installed, upgraded, and operated petabyte-scale Ozone clusters; managed OM, SCM, datanodes, and Ratis HA
  • Implemented Ozone S3 Gateway for cloud-native application integration
  • Primary Kafka Administrator for large-scale, multi-region production clusters: proactively monitored, tuned, and optimized performance, health, and availability
  • Drove Hive & Spark performance optimization on multi-PB datasets, achieving 40-70% faster queries and 50%+ storage reduction
  • Integrated Apache Iceberg for transactional data lake management and time-travel queries
  • Optimized core ecosystem components: HDFS (block placement, replication, NameNode HA), HBase (region splits, compaction throttling), YARN (queue design, fair scheduler), and Hive (parameter tuning, partitioning)
  • Owned platform-wide security and governance: Knox Gateway, Ranger RBAC, Kerberos + LDAP/AD, TLS encryption, Atlas metadata management
  • Implemented tag-based policies and classification workflows in Atlas
  • Integrated Cloudera Telemetry Publisher with AWS for cost allocation and proactive capacity planning
  • Built and enforced infrastructure-as-code and automation using Chef, Ansible, Terraform, Bitbucket CI/CD, Python, and Bash
  • Deployed and managed Cloudera Data Visualization (CDV) and Cloudera Machine Learning (CML) for analytics and data science teams

SENIOR HADOOP PLATFORM ENGINEER

Automobile Club of Southern California
Coppell, Texas
03.2022 - 11.2023
  • Led periodic cluster patching, parcel upgrades, hotfixes, and vulnerability remediation (Log4j, Spring4Shell) across Cloudera CDP/CDH fleets
  • Managed Cloudera Manager configurations and service restarts in coordinated fashion
  • Owned end-to-end platform hardening and security: Kerberos + LDAP/AD integration, MFA, Ranger fine-grained authorization, Knox Gateway
  • Troubleshot Kerberos ticket renewals, SPNEGO, and cross-realm trust configurations
  • Enforced TLS/SSL everywhere, data encryption at rest, full-disk encryption, and network security
  • Proactively monitored, tuned, and optimized cluster performance through OS-level tuning, JVM heap sizing, and continuous research
  • Recommended and guided Data Engineering/Data Science teams on Hive, Spark, and Impala optimization techniques
  • Developed custom tools, utilities, and automation (Python, Ansible, Bash, PowerShell) for job management and performance monitoring
  • Built and maintained comprehensive alerting (Cloudera Manager + custom scripts) for host/service availability
  • Owned disaster recovery and high availability planning: NameNode HA, HDFS federation & replication, HBase multi-cluster replication
  • Implemented cross-cluster HDFS snapshots and distcp for backup and migration

BIG DATA ADMINISTRATOR

Comcast
Philadelphia, PA
07.2019 - 02.2022
  • Deployed and administered multi-cloud big data platforms across AWS EMR, Azure Databricks, Comcast Cloud Platform, Cloudera CDP, and Hortonworks HDP
  • Executed HDP → CDP migrations including data, workloads, security policies with zero downtime
  • Managed Cloudera Manager deployment templates and parcel distributions
  • Performed major and minor platform upgrades (Cloudera CDH/CDP, HDP) for security patches and performance enhancements
  • Implemented enterprise governance and security stack: Cloudera Ranger, Apache Atlas, SSL/TLS encryption, SSO, LDAP/S + Active Directory
  • Designed and enforced Ranger policies for Hive, HDFS, and Kafka
  • Established hybrid-cloud disaster recovery with HDFS & Hive cross-cluster replication and snapshot-based backups
  • Configured and tuned YARN Queue Manager for Capacity Scheduler and Fair Scheduler hierarchies
  • Built infrastructure automation using Ansible playbooks, Terraform modules, and Crontab-based scheduling
  • Administered Databricks workspaces in AWS and Azure: Unity Catalog setup, S3/ADLS integration, cluster policies, IAM role management
  • Developed and supported streaming and batch pipelines using Spark Structured Streaming, Sqoop, SparkSQL, PySpark, and SparkR
  • Conducted performance benchmarking and bottleneck analysis in hybrid on-prem/cloud environments

DATA ANALYST

The University of Tulsa
Tulsa, Oklahoma
08.2014 - 07.2019
  • Worked on data cleansing and data integration to prepare data for analysis
  • Managed databases for both structured and unstructured data
  • Visualized data using Tableau, Python, and R
  • Installed and configured MySQL, PostgreSQL, and Hadoop for big data analysis
  • Assisted undergraduate students in data manipulation, analysis, and visualization
  • Analyzed optical, electronic, and structural properties of Quantum Dots
  • Involved in technical writing - research reports, manuscripts, and external grants
  • Presented research findings at colloquia and conferences

Education

MASTERS - ELECTRICAL AND COMPUTER ENGINEERING

The University of Tulsa
Tulsa, Oklahoma
12.2018

Skills

  • Cloudera CDP Private/Public Cloud
  • CDH
  • HDP
  • AWS EMR
  • Databricks
  • HDFS
  • MapReduce
  • Apache Ozone
  • Hive
  • YARN
  • Kafka
  • Zookeeper
  • Spark
  • Impala
  • Hue
  • Ranger
  • HBase
  • Phoenix
  • Kudu
  • Zeppelin
  • SparkSQL
  • NiFi
  • Apache Iceberg
  • Apache Atlas
  • SQL Server
  • AWS RDS
  • Teradata
  • MySQL 55/56/57
  • Microsoft SQL
  • PostgreSQL
  • RDS
  • MongoDB
  • Hadoop HBase
  • Apache Cassandra
  • PySpark
  • SQL
  • Scala
  • R
  • MATLAB
  • Java
  • Amazon Web Services (AWS)
  • Microsoft Azure
  • Google Cloud Platform (GCP)
  • SAS Data Management
  • Sqoop
  • Qlik
  • Red Hat Linux
  • Unix
  • Ubuntu
  • CentOS
  • Windows
  • MacOS
  • Tableau
  • Matplotlib
  • Seaborn
  • Microsoft Power BI
  • QlikView
  • Cloudera Data Visualization (CDV)
  • CyberArk
  • SSL/TLS
  • LDAPI/DAPS
  • Quest
  • Kerberos
  • Knox
  • RSA Token
  • MFA
  • Puppet
  • Ansible
  • Chef
  • Terraform
  • CloudFormation
  • Cloudera Manager
  • Ambari
  • Prometheus
  • Grafana
  • Cloudera Observability
  • Nagios
  • Organizational skills
  • Analytical skills
  • Decision-making
  • Production
  • Multitasking Abilities
  • Continuous improvement
  • Disaster recovery planning
  • Database administration
  • Virtualization technologies
  • Collaboration and communication
  • DevOps methodologies
  • Incident response
  • Load balancing
  • Linux system administration
  • Continuous integration and deployment
  • Capacity planning
  • Network security
  • Time management
  • Team collaboration
  • Adaptive learning
  • Problem solving
  • Platform security protocols
  • Cluster performance tuning
  • Big data architecture
  • Cloudera administration
  • Data visualization tools

Work Authorization

Authorized to work in the United States (Green Card)

Degrees

  • ME
  • MS
  • M.Sc

Timeline

SENIOR BIG DATA PLATFORM ENGINEER

Mastercard
11.2023 - Current

SENIOR HADOOP PLATFORM ENGINEER

Automobile Club of Southern California
03.2022 - 11.2023

BIG DATA ADMINISTRATOR

Comcast
07.2019 - 02.2022

DATA ANALYST

The University of Tulsa
08.2014 - 07.2019

MASTERS - ELECTRICAL AND COMPUTER ENGINEERING

The University of Tulsa
Ram Chandra Tiwari