Year 2 is the heart of the program. This is where you go from understanding data to mastering it — building predictive models, reasoning under uncertainty, engineering data pipelines at scale, and deploying your work into the real world. By the end of this year, you won’t just be analyzing data; you’ll be building production-ready systems that make decisions automatically.

The seven data science courses in Year 2 cover the full spectrum of modern data science: from machine learning and deep learning, to cloud engineering, probabilistic modeling, and MLOps. This is the year that turns a student into a practitioner.

What You’ll Be Able to Do

Build, evaluate, and select machine learning models for classification, regression, and clustering problems

Design and train deep learning architectures — CNNs, RNNs, LSTMs — for images, sequences, and more

Apply Bayesian reasoning to quantify uncertainty and build probabilistic models

Design rigorous surveys and apply sound sampling methods for data collection

Process massive datasets using Apache Spark and PySpark on cloud infrastructure

Architect cloud systems on AWS for storage, compute, and big data workloads

Deploy machine learning models as scalable APIs and cloud services — and monitor them in production

Data Science Courses

Course What You’ll Learn Key Skills & Tools
Machine Learning Master the core toolkit of supervised and unsupervised learning — from decision trees and SVMs to ensemble methods and recommendation systems. Learn how to prepare data, select the right algorithm, and rigorously evaluate results. Scikit-learn, classification, regression, clustering, ensemble learning, model evaluation
Bayesian Data Analysis Learn to reason probabilistically. Move beyond point estimates and understand uncertainty — using Bayesian inference, conjugate priors, MCMC sampling, and hierarchical models to build models that tell you how confident to be in your results. Bayesian inference, MCMC (Metropolis, Gibbs), posterior distributions, Bayesian regression, model comparison
Survey and Sampling Methods Good data doesn’t collect itself. Learn how to design surveys, choose sampling strategies (simple random, stratified, cluster), and make statistically valid inferences from samples — foundational for any research or data collection project. Survey design, probability sampling, statistical inference, bias assessment
Data Management and Organization Work with data at scale using Apache Spark and PySpark. Learn data governance principles, implement ETL workflows, and run machine learning pipelines on distributed systems — skills essential for any enterprise data environment. Apache Spark, PySpark, ETL, data governance, Scala, structured streaming, feature engineering at scale
Big Data Infrastructure and Technology Move into the cloud. Learn how to design and deploy cloud architectures on AWS — covering compute, storage, databases, networking, and security — and apply them to real big data workloads using services like S3, EC2, and AWS big data tools. AWS (EC2, S3, RDS, CloudWatch), cloud architecture design, ETL/ELT on cloud, cost optimization, AWS certification preparation
Deep Learning Go beyond classical machine learning into neural networks. Build and train deep architectures for image classification, object detection, sequence modeling, and generation. Explore CNNs, RNNs, LSTMs, GANs, and autoencoders — and learn how to explain what your models are doing with Explainable AI. TensorFlow, Keras, CNN, RNN, LSTM, GAN, autoencoder, transfer learning, explainable AI (XAI)
Model Deployment Building a model is only half the job. Learn to take models from notebooks to production — packaging them as REST APIs with FastAPI, orchestrating ML pipelines, deploying to AWS SageMaker, and monitoring model performance over time. FastAPI, Scikit-learn pipelines, AWS SageMaker, Docker, MLOps, model monitoring, scalable training

Beyond the Data Science Courses

Year 2 also deepens your computer science foundations with courses in algorithm design, database technology, computer networks, and operating systems — giving you the engineering intuition to build robust, efficient data systems.

Year 2 is where ambition gets engineering behind it. You’ll finish it with the skills to build, ship, and scale real data science solutions.