Description
Supervised Machine Learning projects typically require labeled data to train the algorithms. You want to use high-quality data that corresponds to the problem you are trying to solve. But how do you obtain this kind of data?
In this session, Magda will show you how to build data labeling pipelines through crowdsourcing. Crowdsourcing is a scalable approach that can be applied to a variety of domains. Magda will share some examples of real-life labeling projects and show you what best practices to apply in the process.
Local ODSC chapter in Dublin, Ireland
Use ODSCCommunity2022 code to get 70% off on your ODSC East pass - https://hubs.li/H0_t0tT0
Instructor's Bio
Magdalena Konkiewicz
Data Evangelist at Toloka
Magdalena holds a Master's degree in Artificial Intelligence from Edinburgh University. She’s worked as an NLP Engineer, Developer, and Data Scientist for businesses in Europe and America. She now teaches and mentors Data Scientists, and regularly contributes to publications like Towards Data Science.
Webinar
-
1
ON-DEMAND WEBINAR: How to set up an ML data labeling pipeline: best practices and examples
-
Ai+ Training
-
Webinar recording
-
Join ODSC East 2022 Training Conference
-
UPCOMING LIVE TRAINING
Register now to save 30%
-
All Courses, All Live Training
PAST LIVE TRAINING: Available On-Demand: A Data Driven Approach to Understanding COVID-19 with NetworkX
2 Lessons $189.00 -
All Courses, All Live Training
PAST LIVE TRAINING: Available On-Demand: Idiomatic Pandas: Best Practices for Manipulating Data in Python
2 Lessons $189.00 -
All Courses, All Live Training
PAST LIVE TRAINING: Available On-Demand: Data Governance Essentials
1 Lessons $189.00