Course Abstract

Training duration : 4 hours

Modeling is a fundamental aspect of any science. This fact is particularly apparent in data science. The key aspects of modeling that make it important for science are: (1) models are representations of things that cannot be fully understood or known (e.g., predictive models are essential to predict a future outcome, unless you have access to a time machine that we don’t know about); (2) models can give us new insights into those things, including their behaviors, responses, and characteristics (especially in previously unseen conditions), thereby potentially revealing causal factors for observed outcomes and informing prescriptive actions to optimize outcomes; (3) models provide testable predictions to validate our assumptions and hypotheses about things (otherwise, it’s not science); and (4) models can help answer questions that are not otherwise answerable (e.g., we can pose “what if” scenarios safely in a model environment that we would not be able or allowed to test in a real life situation). In data science, we use observation (data, evidence) to inform and inspire our models, we use machine learning (algorithms that learn from patterns in the data) to build testable models, and we use the scientific method to verify, validate, and/or refine our models. The ideal goal of these activities is discovery from data, specifically actionable insights discovery. This two-part course (DS 101 and DS 102) on modeling and machine learning in data science follows two main threads: foundational concepts and practical examples "

DIFFICULTY LEVEL: BEGINNER

Learning Objectives

  • Identify modeling opportunities and categorize them (Detect, Discover, Predict, Optimize)

  • Select appropriate modeling components and ML algorithms for specific use cases

  • Design your own analytics solutions

  • Solve a broad range of problems

  • Generate value from the data assets in your organization

  • Communicate to your stakeholders the importance and meaning of models in data-intensive environments

  • Attendees will also work on data literacy exercises and study exploratory data analysis use cases that broaden and deepen one’s understanding and abilities in insights discovery and value creation from data.

Instructor

Instructor Bio:

Course Outline

Module 1: 
Training Overview and Data Science Preliminaries
Introduction to Modeling Concepts
Supervised vs. Unsupervised Modeling

Module 2: 
Insights Discovery and Generalization
Supervised Learning Concepts 
Predictive vs. Prescriptive Modeling
What does Cognitive have to do with it?

Module 3:
The Two Most Important Things in Data Science
Optimization and Feedback Loops in Modeling
Cold-Start Modeling: When the Data Becomes the Model (Unsupervised ML)
Machine Learning vs. Deep Learning

Module 4:
Common Business Modeling Examples
The OODA Loop in Decision Science and Data Science
When Predictive Modeling Fails
Ethical Modeling 
Enriching Your Models with Smart Data (Semantic Tags, Labels, Annotations)
Exploiting High-Variety Data to Achieve Better Model Outcomes
Steps to Data Analytics Mastery

Background knowledge

  • Data scientists, data analysts, business intelligence practitioners, data users, and other analytics-related professionals are the target audience for this training. Generally, this training is for anyone:

  • Who seeks to understand how machine learning works and how ML models can deliver actionable insights, decision support, and value to their organization.

  • Who wants to become more knowledgeable and proficient in identifying machine learning opportunities and in contributing to ML modeling applications.

  • Who seeks to learn the power of machine learning models in thought and action, in order to progress in your own career journey (e.g., from data analyst to data scientist).

  • Some experience with machine learning will make this workshop easier to follow, but all that is required is basic knowledge of the concepts.