Get this course for free with Premium Ai+ subscription
Description
Deep learning-powered retrieval systems are the backbone of modern LLM applications, enabling efficient semantic search, recommendation, and knowledge retrieval. This talk unpacks the journey behind building, training, and scaling the Nomic Embed model series—one of the most widely used open-source, multilingual, multimodal embedding families on Hugging Face, with over 35M+ downloads. We’ll explore the data curation pipeline, training infrastructure, and inference-time objectives that make these models state-of-the-art, including innovations like Matryoshka resizability, quantization, and sparse mixture-of-experts training. Additionally, we'll discuss the unique challenges of pushing applied deep learning forward in an industry lab, where real-world constraints drive cutting-edge advancements in embedding model design.
Instructor's Bio

Max Cembalest
Developer Advocate at Nomic AI
Max Cembalest is a developer advocate at Nomic AI who blends a rigorous math and CS foundation (Wesleyan B.A., 3.95) and an MS in Data Science from Harvard with five years building ML products and educational technology. He previously worked as an ML engineer at Arthur and taught mathematics and computer science at African Leadership Academy, using classroom practice to shape practical, teacher-centered AI tools. Max's hands-on experience ranges from Cozmo robotics and Scratch integration to shipping ML systems, giving him a rare mix of developer, educator, and product intuition. A self-described "creative quant" with a love of puzzles and a theatrical background, he brings clear communication, playful curiosity, and a healthy skepticism of undue faith in AI — plus the conviction that logarithms are underrated.
Webinar
-
1
Talk "Building State-of-the-Art, Open-Source Embedding Models"
-
Ai+ Training
FREE PREVIEW -
Recording
-
Additional information
-
Dozens of Free Courses with Premium
-
All Courses
ODSC 2025: 6-Week Winter AI Bootcamp
69 Lessons $499.00
-
All Courses
Agentic AI Summit 2025
38 Lessons $399.00
-
All Courses
ODSC East 2025 - All Recordings
61 Lessons $299.00
-
All Courses, RAG
ODSC AI Builders 2025 Summit - Mastering RAG
26 Lessons $299.00
-
All Courses
ODSC AI West 2025 - All Recordings
56 Lessons $299.00
-
All Courses
Deep Learning Bootcamp with Dr. Jon Krohn
7 Lessons $699.00