Get this course for free with Premium Ai+ subscription
Description
If you have worked with AI in any capacity, you'll know that AI is only as valuable as the data in can leverage. Data is the cornerstone of AI, and developers need better ways to transform complex documents into structured data ready for model training and inference.
In this session we will learn how to turn common, real-world documents and scans into structured data for search and RAG. In this 90-minute, code-along workshop, you’ll learn the strengths of Docling, an open-source toolkit for advanced document conversion, allowing you to leverage your data more effectively into AI workflows. We’ll complete three labs; Conversion, Chunking, and RAG, and you’ll leave with runnable notebooks from a public GitHub repo.
Instructor's Bio

Mingxuan Zhao
Developer Advocate at IBM
Ming Zhao is an open source developer and Developer Advocate at IBM Research, where he helps IBM leverage open technologies while building impactful tools and growing vibrant open-source communities. He’s passionate about making open tech accessible to all and ensuring developers have the tools they need to succeed in the rapidly developing AI space. Ming now leads community efforts around Docling, IBM’s fastest-growing open source project, recently welcomed into the LF AI & Data Foundation.
Webinar
-
1
Workshop "Complex Data Ingestion with Open Source AI"
-
Ai+ Training
FREE PREVIEW -
Workshop recording
-
Additional information
-
Dozens of Free Courses with Premium
-
All Courses
ODSC 2025: 6-Week Winter AI Bootcamp
69 Lessons $499.00
-
All Courses
Agentic AI Summit 2025
38 Lessons $399.00
-
All Courses
ODSC East 2025 - All Recordings
61 Lessons $299.00
-
All Courses, RAG
ODSC AI Builders 2025 Summit - Mastering RAG
26 Lessons $299.00
-
All Courses
ODSC AI West 2025 - All Recordings
56 Lessons $299.00
-
All Courses
Deep Learning Bootcamp with Dr. Jon Krohn
7 Lessons $699.00