Summary
Overview
Work History
Education
Skills
Accomplishments
Timeline
Generic

Wei Wang

TX

Summary

Innovative Problem Solver, Linux Enthusiast and Technology Integration Specialist.

Overview

26
26
years of professional experience
6
6
years of post-secondary education

Work History

Lead Technology Integration Engineer

Infinidat
07.2022 - Current
  • Company Overview: High-performance, scalable, and cost-effective enterprise storage solutions
  • Currently leading Infinidat GenAI Toolkit development, enabling Infinidat customers to securely augment public data with proprietary data
  • Single-handedly migrated Infinidat's enterprise storage appliance to AWS and Azure (intro, news1, news2, news3), utilizing extensive Linux kernel modification, CI/CD pipeline creation, advanced networking design, and programming in Python, C/C++, and Terraform
  • Designed a virtual lab leveraging Linux KVM, LXC, Docker, and a range of open-source technologies, significantly improved lab usability, flexibility, and operation&cost efficiency
  • Delivered an OpenStack NFSv3 to iSCSI migration solution for a major client, Fidelity Investments, resolving technology compliance issues and retaining their business
  • Created the IMX Exporter, a Prometheus metrics exporter tailored to address customer monitoring requirements within the Infinidat Metrics system
  • High-performance, scalable, and cost-effective enterprise storage solutions

Senior SRE Engineer

Kraken
10.2021 - 06.2022
  • Company Overview: One of the largest and most secure Cryptocurrency Exchanges
  • Joined Kraken Bank, a startup within a startup project, as the second SRE engineer, responsible for architecting and constructing intricate AWS infrastructure
  • Orchestrated and deployed Kubernetes clusters in AWS with a security-centric approach, emphasizing monitoring, alerting, and seamless integration with Vault
  • Established a Kafka cluster using the Strimzi Operator on Kubernetes to facilitate transactional data processing
  • Formulated a comprehensive logging and metrics monitoring platform grounded in USE and RED methodologies
  • Managed and optimized CI/CD pipelines utilizing GitLab and ArgoCD
  • One of the largest and most secure Cryptocurrency Exchanges

Principal System Engineer

AT&T / Xandr
08.2018 - 10.2021
  • In this Lead Engineer role, I directed a small team in designing, engineering, deploying, and supporting an API/application platform, eventually transitioning to a Data Platform with expanded responsibilities
  • Architected and engineered AWS Kubernetes (EKS) infrastructure for Xandr Dataview applications and APIs
  • Coded infrastructure using Terraform on AWS and Azure
  • Developed CI/CD framework using Jenkins, Tekton, and ArgoCD, with automation and configuration management via Ansible, Python, and Shell
  • Defined best practices for development workflow using standard tools such as Docker, Vagrant, and LXD/KVM
  • Designed, engineered, deployed, and operated an end-to-end monitoring framework for the API/application platform, which later expanded to the entire Data Platform
  • Created system tracing and profiling tools for in-depth analysis of complex issues, including performance problems
  • Served as a troubleshooter, supporting and mentoring the Data Platform team on challenging issues and providing internal knowledge transfer sessions on Linux, networking, and problem-solving skills

Principal Member of Technical Staff

Principal Member of Technical Staff AT&T
09.2016 - 08.2018
  • Technology research and review, focus on cloud storage for AT&T private cloud (AIC)

Senior Member of Technical Staff

RiftIO
05.2015 - 09.2016
  • Company Overview: A Intel & North Bridge funded start-up company in Networking/Cloud/NFV field
  • Customized and automated OpenStack installation for NFV/MANO development, gaining hands-on experience with major OpenStack flavors, including Mirantis (Fuel), RHEL (Director), Canonical (Autopilot), Packstack, and Devstack
  • Engineered, hands-on created and maintained an end-to-end development lab environment, including server racking, cabling, switch and router configuration, DNS, firewall, and networking, as well as automated server installation using Cobbler
  • Utilized virtualization methods (KVM, Virtual Bridge) to set up labs, optimizing the use of available hardware resources
  • Studied and integrated new technologies in computing and networking industries, providing cloud environment integration solutions
  • Example projects: trusted compute with TXT technology, open flowenabled switches in cloud environments, and hybrid cloud GPU provisioning for high-performance computing
  • Packaged Riftware and used Docker to maintain various build and test environments
  • Packaged Riftware, built Cloud-In-A-Box for customer demos, created OpenStack installer VM images, and developed custom Linux kernels
  • Resolved complex issues in cloud environments
  • A Intel & North Bridge funded start-up company in Networking/Cloud/NFV field

Principal System Engineer

Fidelity Investments
02.2012 - 03.2015
  • Customized and maintained internal Linux release, yum repository, and vendor hardware support tools, providing technical solutions such as hardware fault monitoring and alerting
  • Engineered cloud storage (ZFS-based NAS, SAN) by collaborating with vendors and architects to define platform standards, select hardware, test product prototypes, develop performance monitoring frameworks, and automate installation/configuration tasks for the operations team
  • Focused on performance monitoring, alerting, and troubleshooting in large-scale cloud engineering; evaluated and selected open-source projects for the design and implementation of cloud instrumentation and TSD frameworks for Fidelity's private cloud monitoring
  • Developed an automation framework using Chef and other open-source tools, automating complex infrastructure provisioning and application deployment with Chef cookbooks, Python, Shell, and PowerShell scripts
  • Evaluated new hardware and technologies, assisting management in hardware and technology selection through active performance benchmarking and defining performance baselines for multiple products and environments
  • Served as a troubleshooter and technical escalation point for Linux and cloud storage operation teams, acting as an SME for performance, networking, kernel, and driver-related issues

Converge Core & Application Lead Engineer

Nokia Siemens Networks
01.2009 - 02.2012
  • Tier 3 support to T-Mobile Subscriber Data Management product (include over 400 Linux/Unix servers)
  • Perform system update/hotfix testing/installation to Linux/Unix servers in SDM product
  • QA test for new releases
  • Technical escalation for external and internal customers
  • Disaster recovery
  • Develop automating scripts(shell/perl) for system monitoring/audit/update

Operations Technical Team Lead

Nokia Networks
07.2004 - 06.2009
  • Provide 2G/3G network operation/project solutions to network operators
  • Develop method/procedures/tools for Managed Service projects
  • Lead trouble shooting when there is a network problem during project implementation
  • Technical/tool support to operation team
  • Host technical team meetings and seminar
  • Customer interface
  • Answer technical questions from customer in daily base and provide proactive solutions based on understanding customer's network and challenges
  • Sample of projects I worked as technical lead: AT&T North Florida Network Consolidation Automate technical procedures
  • Lead implementation team (10+ members) for BSS rehosting, Cut-overs and trouble shootings activities
  • Hurricane Emergency Support Develop emergency monitoring script
  • Support network recoveries

Radio Access Project Technical Lead

Nokia Networks
07.2007 - 01.2009
  • Provide technical procedures for radio access projects
  • Develop MOP for internal or external customers
  • Develop automation tools/scripts for project implementation, network monitoring, configuration auditing
  • Provide on job training to project engineers
  • Won Employees Choice Award Winner for Q3 and Q4 2008, Innovate Role Models Award
  • Won NSN Top Performers 2009, Innovation value

BSC Engineer

MyCom North America
05.2002 - 06.2004
  • Group lead for AT&T Wireless S10 upgrades project team
  • Nokia Radio Network on-site support for Cingular Northeast region
  • Mentor Cingular Switch Technicians on deploying and maintaining Nokia GSM network
  • Provided on job training on how to use Nokia DX200/Linux platform and OSS/Unix tools
  • Automating switch operational tasks for AT&T

BSC Engineer

Nokia Networks, China
07.1998 - 08.2001
  • Commission and integration embed Linux systems
  • Implemente Software upgrade and Hardware upgrade or retrofit in customer's live networks
  • Automate Radio network parameter database creation/analysis/modification
  • Helpdesk support for China Mobile and Unicom

Education

MS - Computer Science

University of Bridgeport
Bridgeport, CT
01.2001 - 01.2003

Bachelor - EE

Nanchang University
Nanchang, Jiangxi
01.1994 - 01.1998

Skills

Cloud

Openstack

AWS

Azure

Kubernetes

Linux/Unix

RHCE/RHCSA

Certified Redhat Performance Tuning

Storage

Xen/KVM/VMWARE

Networking/CCNA

Scripting with Python

Scripting with Perl

Scripting with Shell

Scripting with Powershell

NFV

Openflow

MySQL

RabbitMQ

Kernel tracing

Automating

LaTex

Accomplishments

  • Created InfuzeOS Cloud Edition that won CRN USA 2024 Tech Innovator Award., 01/01/24
  • Migrated storage appliance Infinibox to AWS and Azure, a requirement for Infinibox to stay on Gartner Leader list 2023., 01/01/23
  • Pioneered the team responsible for establishing the online banking infrastructure in AWS for Kraken, the world's most secure cryptocurrency exchange.
  • Engineered the Vantrix/Artesyn OpenStack solution, showcased at MWC 2016 and BCE 2016., 01/01/16
  • Designed, engineered, and implemented a Linux performance monitoring framework for Fidelity's private cloud.
  • Optimized Linux kernel settings for Oracle Database, increasing query load capacity by 300%.
  • Diagnosed and resolved multiple latency-related performance issues for Fidelity's trading platform.
  • Received Nokia Siemens Networks Top Performers Award in 2009 for outstanding engineering work., 01/01/09

Timeline

Lead Technology Integration Engineer

Infinidat
07.2022 - Current

Senior SRE Engineer

Kraken
10.2021 - 06.2022

Principal System Engineer

AT&T / Xandr
08.2018 - 10.2021

Principal Member of Technical Staff

Principal Member of Technical Staff AT&T
09.2016 - 08.2018

Senior Member of Technical Staff

RiftIO
05.2015 - 09.2016

Principal System Engineer

Fidelity Investments
02.2012 - 03.2015

Converge Core & Application Lead Engineer

Nokia Siemens Networks
01.2009 - 02.2012

Radio Access Project Technical Lead

Nokia Networks
07.2007 - 01.2009

Operations Technical Team Lead

Nokia Networks
07.2004 - 06.2009

BSC Engineer

MyCom North America
05.2002 - 06.2004

MS - Computer Science

University of Bridgeport
01.2001 - 01.2003

BSC Engineer

Nokia Networks, China
07.1998 - 08.2001

Bachelor - EE

Nanchang University
01.1994 - 01.1998
Wei Wang