Dynamic AI Systems Architect and Linux Engineer with a robust background in machine learning infrastructure, cloud computing, and system administration. Proven success in designing and deploying AI-driven micro-cloud architectures while optimizing AI inference workflows and leading cross-functional development teams. Expertise in leveraging containerization, virtualization, and GPU-accelerated computing to enhance system performance. Committed to integrating automation, data analytics, and scalable AI solutions to drive efficiency and innovation, particularly in high-stakes environments demanding regulatory compliance and operational excellence.
Built and scaled a cross-functional team of full-stack and ML engineers from 1 to 18 members within 18 months.
Led the design and implementation of specialized inference pipelines for retrieval and augmentation in a generative AI application for religious content.
Architected and deployed multiple ML applications on custom cloud infrastructure tailored for on-premises workloads.
Conducted advanced research into novel search and retrieval techniques for generative AI use cases.