This session is a hands on workshop (with coding) to demonstrate how to gain observability (monitoring & alerting) for production machine learning pipelines. We will provide background on why observability is important to run successful MLOps, then walk through in detail how to set up a robust observability system.

Without a proper observability system, it is impossible to scale a successful machine learning effort. The session will provide ML engineering teams with the tools they need (all available in the open source ecosystem) to solve major visibility gaps in the machine learning lifecycle, including monitoring data quality, job statuses, ML model performance, and retraining.

The session will cover the end-to-end process, from data prep jobs running in Airflow, to model development and experimentation in Jupyter notebooks, to model serving in production.

The content covered will be of interest to data engineers and data scientists, including anyone who is working on machine learning projects.

We recommend that participants have strong backgrounds in python and at least high level knowledge of job orchestrators like Airflow, which are used to run automated data pipelines.


New on-demand courses are added weekly

Workshop Overview

  • 1

    ODSC East 2020: Gaining Machine Learning Observability

    • Workshop Overview and Author Bio

    • Before you get started: Prerequisites and Resources

    • Gaining Machine Learning Observability

Instructor Bio:

Co-founder | Databand.ai

Josh Benamram

Josh is Cofounder of Databand, an APM and observability solution for data engineering teams. Prior to founding Databand, Josh was a Product Manager at Sisense, a business analytics software startup. Josh led product on Sisense's ETL and database integration technologies as the startup scaled to over 700 team members and over 1,000 clients. Before Sisense, Josh worked in venture capital at Bessemer Venture Partners, where he focused on cloud infrastructure and machine learning investments.

CTO & Co-founder | Databand.ai

Evgeny Shulman

Evgeny is Cofounder of Databand, an APM and observability solution for data engineering teams. Evgeny is a data architect and engineer by background. Prior to Databand, Evgeny was first employee, data architect, and team lead at Crosswise, a big data startup acquired by Oracle Data Cloud. Before Crosswise and ODC, Evgeny was a senior developer, software engineering team lead, and researcher at various startups.