Take The Course

Gain a thorough understanding of the math and intuition behind Gradient Boosting.

Learn to avoid overfitting by implementing improvements to basic Gradient Boosting algorithms.

Get hands-on experience with real-world examples, code, and best practices.


Still pondering? If you're keen to bring your predictive modeling game to the next level, this course is the stepping stone you've been searching for.

Instructor Spotlight

Principal | Numeristical

Brian Lucena,PhD

Brian Lucena is Principal at Numeristical and the creator of StructureBoost, ML-Insights, and SplineCalib. His mission is to enhance the understanding and application of modern machine learning and statistical techniques. He does this through academic research, open-source software development, and educational content such as live stream classes and interactive Jupyter notebooks. Additionally, he consults for organizations of all sizes from small startups to large public enterprises. In previous roles, he has served as SVP of Analytics at PCCI, Principal Data Scientist at Clover Health, and Chief Mathematician at Guardian Analytics. He has taught at numerous institutions including UC-Berkeley, Brown, USF, and the Metis Data Science Bootcamp.

Course Curriculum

  • 1

    Welcome to the course!

    • Welcome to the course!

    • What you'll learn in this course

    • Course Notebook

    • How to use this course

  • 2

    Section 1: Decision Trees

    • Decision Trees Video Lesson

  • 3

    Section 2: Random Forests

    • Random Forests Video Lesson

  • 4

    Section 3: Gradient Boosted Trees

    • Gradient Boosted Trees - Video Lesson

  • 5

    Section 4: Details of Hyperparameters

    • Details of Hyperparameters - Video Lesson

Learning Objectives

By the end of this course, you should be able to:

  • Understand how to analyze network traffic, including what features to extract and how to analyze them.

  • Use Python and Scapy to analyze network traffic in packet capture files and live captures.

  • Develop custom Python scripts to answer questions with network traffic data.

Course Outline

What you'll learn in this course?

This course starts with a single decision tree and progresses to random forests and gradient-boosted tree models.  We cover the various hyperparameters involved in gradient-boosted tree models and characterize their importance.  We also discuss why gradient boosting can generally outperform linear/logistic regression.

  •   Module 1: Decision Trees
  •   Module 2: Random Forests
  •   Module 3: Gradient Boosted Trees
  •   Module 4: Details of Hyperparameters


Background knowledge needed

Helpful, but not required to know.

  • Knowledge of Python

  • Network Traffic Analysis

  • Data Science

Enroll now!

Accelerate your journey to gradient boosting by enrolling in our program today!

Is this course for you?

Is the "Fundamentals of Gradient Boosting" course right for you? If you're keen on delving into machine learning algorithms and understanding how to sequentially ensemble weak predictive models into stronger ones, this course is your go-to resource.