Spearheaded the automation of J.Crew's e-commerce platform deployment using Ansible , reducing deployment times and minimizing errors during releases.
Developed and maintained CI/CD pipelines using Jenkins and GitLab CI , integrating automated testing and rollback strategies for faster and more reliable feature releases.
Enhanced infrastructure monitoring and observability using Prometheus , Splunk , and AppDynamics , improving incident detection and reducing response times by 40%.
Managed the design, configuration, and maintenance of both cloud-based and privately hosted environments, ensuring high availability and scalability for J.Crew's online platforms.
Optimized e-commerce infrastructure for peak traffic events, reducing downtime by 30% during major sales campaigns and improving overall system performance.
Managed AWS infrastructure including EC2 , S3 , RDS , and configured VPC , Security Groups , and IAM roles , ensuring security and streamlined access control.
Led the migration of key services to AWS , leveraging ECS for containerized applications and RDS for database management, improving platform performance by 30%.
Implemented CI/CD pipelines using Jenkins, enhancing deployment efficiency.
Managed container orchestration with Kubernetes, streamlining application scalability.
DevOps Engineer
Chase
01.2021 - 01.2023
Managed observability and monitoring for complex environments using Datadog , ensuring proactive issue detection and resolution to maintain high system availability.
Automated certificate renewals with an internal CKMS tool , significantly reducing manual intervention and minimizing the risk of security lapses in production environments.
Managed Active Directory across Linux , Windows , and AWS platforms, handling ID/UID management to ensure secure access control and compliance with organizational standards.
Collaborated closely with developers and SREs on migration projects, providing expertise in IAM , access control, and deployment processes to ensure smooth transitions and secure application operations.
Conducted vulnerability assessments and led remediation efforts, working with application owners, project managers, and monitoring teams to address security concerns effectively.
Implemented a Blue-Green deployment strategy using AWS and Kubernetes , reducing downtime during updates and enabling smooth, reliable transitions between application environments.
Utilized Terraform to automate the provisioning of infrastructure, creating parallel environments that facilitated testing and deployment with consistency and efficiency.
Managed CI/CD pipelines with Spinnaker , ensuring secure, controlled deployments across multiple environments and enhancing the overall deployment process through automation.
Led application migrations from internal infrastructure to AWS , building environments from scratch and deploying key services such as EKS , Kafka , MongoDB , and PostgreSQL , driving cloud adoption and improving scalability.
Integrated Google Cloud Platform (GCP) services to manage and deploy scalable applications, leveraging tools like GKE, BigQuery, and Cloud Storage for optimized performance, multi-cloud integration, and advanced data analytics.
Managed deployments on Azure, implementing CI/CD pipelines and infrastructure automation for scalable and secure cloud solutions.
Developed infrastructure as code solutions with Terraform, improving environment consistency.
Automated system monitoring and alerting with Prometheus, ensuring operational reliability.
System Administrator
Medstrat
07.2020 - 12.2020
Managed 700+ remote servers , ensuring system reliability and performance.
Installed and configured RAID, LVM, encrypted volumes (LUKS), MySQL servers, and OpenVPN clients to optimize system security and storage.
Resolved customer tickets related to Linux server administration, network connectivity, DICOM image issues, and Joints web/native application problems.
Configured Let's Encrypt SSL certificates for PACS API to secure medical data transfers.
Provided end-user support for Joints/Echoes software solutions and Linux systems via remote management tools.
Monitored server performance using tools like sar, top, free, vmstat, and iostat.
Skills
Cloud platform expertise: AWS, Microsoft Azure, GCP
Proficient in CKMS tool, Terraform, Ansible, and Chef
Freelance English - Vietnamese Translator (Remote) at B.I.N. Translation CompanyFreelance English - Vietnamese Translator (Remote) at B.I.N. Translation Company