AI Systems Architect & Technology Leader
Writing about AI infrastructure, edge intelligence, and the engineering realities that hype cycles miss. 20+ years building the systems AI actually runs on.
Chhavi Jain is an AI systems architect and technology leader with 20+ years building large-scale telecommunications and machine learning platforms. She has led enterprise AI platform initiatives across financial services and technology, spent a decade at Qualcomm driving AI inference optimization and on-device ML powering billions of devices — partnering with Apple, Meta, Google, and Samsung — and founded Live AI Dream, a GenAI inference platform company built in partnership with Microsoft Azure and NVIDIA.
She holds an MS in Data Science & AI from UIUC, an MBA from IIM Calcutta, and is a recipient of the President of India Award, and a TinyML certification from Harvard. She serves as IEEE SPS Vice Chair and mentors at UCSD at the intersection of AI research and real-world impact.
Her technical work spans LLMs, RAG, agentic AI, quantization, and distributed systems. She has published IEEE research, holds multiple granted patents in 5G + AI systems, and has spoken at TinyML, IEEE Women in Engineering, and industry AI conferences globally.
Published on Substack, Medium, and LinkedIn. Writing from 20 years inside the infrastructure AI depends on.
Speaking at the intersection of on-device AI, TinyML, wireless intelligence, and engineering leadership.
The engineering problems that don't make headlines — but determine whether AI works at scale.
Notable open-source contributions from my decade at Qualcomm — tools used by researchers and engineers worldwide to optimize AI models for deployment on edge devices.
Advanced quantization and compression library for trained neural network models. AIMET enables INT8 and INT4 quantization with less than 1% accuracy loss — making large models deployable on edge devices like smartphones without retraining. Techniques include AdaRound, SeqMSE, Cross-Layer Equalization, and Quantization-Aware Training for PyTorch and ONNX models.
3 granted patent families in 5G + AI systems. 5 provisional families in Edge AI. Filed across the United States, Europe, China, and PCT.
Patent citations from industry leaders across telecommunications and device manufacturing — reflecting the influence of these innovations on later generations of wireless and AI systems.
Cited by
Media inquiries, speaking engagements, collaboration, or a conversation about edge AI and the infrastructure layer.