Live Training Overview

Duration: 3 Hours, November 15h, 1 PM ET

If you can write a model in sklearn, you can make the leap to Bayesian inference with PyMC3, a user-friendly intro to probabilistic programming (PP) in Python. PP just means building models where the building blocks are probability distributions! And we can use PP to do Bayesian inference easily. Bayesian inference allows us to solve problems that aren't otherwise tractable with classical methods. The intention is to get hands-on experience building PyMC3 models to demystify probabilistic programming / Bayesian inference for those more well versed in traditional ML, and, most importantly, to understand how these models can be relevant in our daily work as data scientists in business. Let's build up our knowledge of probabilistic programming and Bayesian inference! All you need to start is basic knowledge of linear regression; familiarity with running a model of any type in Python is helpful.

30% discount ends soon:

  • 00 Days
  • 00 Hours
  • 00 Minutes
  • 00 Seconds

DIFFICULTY LEVEL: BEGINNER

Instructor Bio:

Risk Management Specialist | Federal Reserve Bank of Chicago

Lara Kattan

Lara is a risk specialist with the Federal Reserve Bank of Chicago and occasional adjunct at the University of Chicago's Booth School of Business, teaching Python and R. Previously she's taught a data science bootcamp and built risk models for large financial institutions at McKinsey & Co.

Course Outline

Module 1. What is probabilistic programming?

  • PP is the idea that we can use computer code to build probability distributions

  • Theory of the primitives in probabilistic programming and how we can build models out of distributions

Module 2: What is Bayesian inference and why should I add it to my toolbox on top of classical ML models?

  • Classically, we had simulations, but they run in only one direction: get data input and move it according to assumptions of parameters and get a prediction

  • Bayesian inference adds another direction: use the data to go back and pick one of many possible parameters as the most likely to have created the data (posterior distributions)

  • Use Bayes' theorem to find the most likely values of the model parameters

Module 3. What is PyMC3 and how can I start building and interpreting models using it?

  • We'll work through actual examples of models using PyMC3, including hierarchical models

  • Solving Bayes’ theorem in practice requires taking integrals, and if we don’t want to do integrals by hand, we need to use numerical solution methods

  • From the package authors: "[PyMC3 is an ]open source probabilistic programming framework written in Python that uses Theano to compute gradients via automatic differentiation as well as compile probabilistic programs on-the-fly to C for increased speed"

Background knowledge

  • Basic Python and machine learning, sklearn, some stats and probability