Data Science and Machine Learning in the Cloud for Cloud Novices
This course is only available as a part of subscription plans.
Training duration: 90 minutes
DIFFICULTY LEVEL: BEGINNER
Understand Practical use of cloud computing resources in data science and machine learning
Create a new GCP account and explore documentation and tutorials offered
Explore public datasets hosted on GCP’s BigQuery service
Use SQL to do data analysis on a public dataset
Create a Jupyter notebook on a free-tier compute environment and use Python to analyze data
Create an RStudio Community server environment on a free-tier compute environment and use R to analyze data
Create a machine learning predictive model on public data
Instructor Bio:
Joy Payton
• Cloud computing concepts and vocabulary
• Cloud providers
• Free tier and cost considerations
• Public datasets and citizen science
• Redundancy, security, and privacy
• Continuum of management levels
• Cloud data storage and analytics
• Machine learning in the cloud
This course is for current or aspiring Data Engineers, Machine Learning Engineers, Software Engineers and Data Analysts
Knowledge of following tools and concepts:
Background R or Python will be helpful, although it is not rigorously required
This training will be useful for those considering cloud adoption, interested in data engineering, or interested in working with public data as citizen scientists.