Applied Machine Learning

Learn Python programming, write programs to implement machine learning in business. *The course requires an undergraduate knowledge of statistics, calculus, linear algebra, and probability.

Get Your Brochure

Course Dates


May 25, 2021

Course Duration


5 months, online
8-10 hours per week

Course Duration



Course Information Flexible payment available

Who is this course for?

The Applied Machine Learning course teaches you a wide-ranging set of techniques of supervised and unsupervised machine learning approaches using Python as the programming language.

Since this course requires an intermediate knowledge of Python, you will spend the first part of this course learning Python for Data Analytics taught by Emeritus. This will provide you with the programming knowledge required to do the assignments and application projects that are part of the Applied Machine Learning course.

If you are looking to implement or lead a machine learning project or looking to incorporate machine learning capability in your software application, this course is appropriate for you. This is a programming course: you will be required to write code, but no prior programming knowledge is required.


The course requires an undergraduate knowledge of statistics (descriptive statistics, regression, sampling distributions, hypothesis testing, interval estimation etc.), calculus (derivatives), linear algebra (vectors & matrix transformation) and probability (conditional probability/Bayes theorem).

*Assessment: Students will be given an assessment to test their math skills prior to commencement of the course. You can view sample questions by clicking here. To familiarize yourself with the topics of the assessment, refer to learning resources by clicking here.


anticipated spending growth on AI and ML by 2021.



expected wage growth for data scientists (vs. <2% average wage increase across all occupations).



decrease in ‘click-to-ship’ time by Amazon using ML algorithm.


Course Highlights

Decorative image relating to 240+ Faculty Video Lectures

240+ Faculty Video Lectures

Decorative image relating to 45 Quizzes / Assignments

45 Quizzes / Assignments

Decorative image relating to 18 Moderated Discussion Boards

18 Moderated Discussion Boards

 Decorative image relating to 20+ Q&A Sessions with Course Leaders

20+ Q&A Sessions with Course Leaders

Decorative image relating to 12 Application Projects

12 Application Projects

Decorative image relating to Includes Live Online Teaching

Includes Live Online Teaching

Learning Journey

Going beyond the theory, our approach invites participants into a conversation, where learning is facilitated by live subject matter experts and enriched by practitioners in the field of machine learning:

  • Define a model for your data and make the model learn.
  • Build regression models to predict an unknown output from a given set of inputs.
  • Create classification models to categorize datasets such as email messages as spam or non-spam.
  • Develop unsupervised models like topic models or recommender systems to extract hidden patterns from large amounts of data
  • Determine hidden parameters in data to improve the accuracy of your model's predictions.
  • Create probabilistic data models to predict a range of possible outcomes that account for real-world risks and uncertainties.


Part 1: Python for Data Analytics (Video content and delivery by Emeritus)

  • Module 1: Introduction to Data Science
  • Module 2: Working with Data Types and Operators in Python
  • Module 3: Writing Functions in Python
  • Module 4: Popular Data Science Packages in Python
  • Module 5: Advanced Functions
  • Module 6: Data Manipulation and Analysis with Pandas
  • Module 7: Data Visualization with Matplotlib
  • Module 8: Random Variables and Statistical Inferences
  • Module 9: Statistical Distributions and Hypothesis Testing
  • Module 10: Data Cleaning
  • Module 11: Exploratory Data Analysis
  • Module 12: Getting Started with Linear Algebra for Machine Learning

Part 2: Applied Machine Learning (Video content from Columbia Engineering and delivery by Emeritus)

Supervised Learning

  • Module 1: Regression
    Introduction to Data Science
  • Module 2: Linear Regression
  • Module 3: Bayesian Methods
    Bayes Rule, MAP Inference, Active Learning
  • Module 4: Foundational Classification Algorithms – Part 1
    Nearest Neighbors, Perceptron, Logistic Regression
  • Module 5: Foundational Classification Algorithms – Part 2
    Refinements to classification - kernel methods, Gaussian process
  • Module 6: Intermediate Classification Algorithms – Part 1
    SVM, trees, forests, and boosting
  • Module 7: Intermediate Classification Algorithms – Part 2
    Clustering methods - k-means clustering, e-m, Gaussian mixtures

Unsupervised Learning

  • Module 8: Clustering Methods
    K-means clustering, e-m, Gaussian mixtures (cont.)
  • Module 9: Recommendation Systems – Part 1
    Collaborative Filtering, Topic Modeling, PCA
  • Module 10: Recommendation Systems – Part 2
    Sequential data models - Markov and hidden Markov models, Kalman filters
  • Module 11: Sequential Data Models
    Markov and Hidden Markov Models, Kalman Filters
  • Module 12: Association Analysis Clustering methods
    Model comparisons, analysis considerations

Application Projects

Decorative image relating to Movie Recommendation Engine

Movie Recommendation Engine

You will build a movie recommendation engine by applying collaborative filtering and topic modelling techniques. You use a dataset which contains 20 million viewer ratings of 27,000 movies.

 Decorative image relating to House Price Prediction

House Price Prediction

You will write code to predict house prices based on several parameters available in the Ames City dataset compiled by Dean De Cock using least squares linear regression and Bayesian linear regression.

 Decorative image relating to Human Activity Prediction

Human Activity Prediction

You will predict the human activity (walking, sitting, standing) that corresponds to the accelerometer and gyroscope measurements by applying the nearest neighbours technique.

 Decorative image relating to Credit Card Fraud Detection

Credit Card Fraud Detection

You will detect potential frauds using credit card transaction data. You will apply the random forest method to identify fraudulent transactions.

 Decorative image relating to Marketing Segmentation

Marketing Segmentation

You will create market segments using the US Census dataset and by applying the k-means clustering method.

Note: All product and company names are trademarks™ or registered® trademarks of their respective holders. Use of them does not imply any affiliation with or endorsement by them.


Dr. John W. Paisley

Columbia University Associate Professor, Electrical Engineering Affiliated Member, Data Sciences Institute

John has a PhD from Duke and has been a postdoctoral researcher in the Computer Science departments at Princeton University and UC Berkeley. John Paisley’s research focuses on developing models for large-scale text and image processing applications. He is particularly interested in Bayesian models and posterior inference techniques that address the big data problem.

Course FAQs

  • 1. Review the sample math assessment, to see if you feel confident with the material that includes Calculus, Linear Algebra, Statistics, and Probability.

    2. Review the learning resources to familiarize yourself with the assessment quiz content.

    3. Submit your application.

    4. Take the math assessment. You will have two attempts. Participants who pass the assessment will receive access to the course. Any deposit fees will be refunded to participants who do not pass the assessment.

  • The course familiarizes you with Machine learning algorithms and applications. It will also help you understand the approach to a business problem and provide you with the tool knowledge needed to transition to a Machine Learning or a Data Science role.

  • The course familiarizes you with Machine learning algorithms and applications and provides a solid foundation in statistics/mathematics and problem-solving skills to help you solve enterprise-level problems. The Applied Machine Learning course augments your existing knowledge of various tools and expands your skill set as a Data Science or Machine Learning professional.

  • The course familiarizes you with Machine learning algorithms and applications while providing a solid foundation in statistics/mathematics and enhancing your business acumen. It augments your existing programming knowledge and expands the technologies you are familiar with, helping you further develop your skill set as a Data Science or Machine Learning professional.

  • Absolutely! Knowledge of Data Science and Machine Learning (ML) has quickly become a requisite across industries, and all businesses will eventually need to use these techniques to thrive. While your current role may not require Machine learning knowledge, it is almost certain that ML skills will be in high demand in most every industry in the future.

  • The course is a blend of theory, tools, and case studies (datasets) that are easy to assimilate and implement. For instance, students work on application projects that require them to apply the Machine Learning concepts they’ve learned to datasets and derive inferences. These application projects are intentionally made to be challenging, and students are expected to spend substantial time and effort solving them. At the end of the course, students will be able to apply Machine Learning to solve many of the business problems they face in their workplace.

  • Columbia Engineering Executive Education is collaborating with online education provider Emeritus to offer a portfolio of high-impact online courses. These courses leverage Columbia’s thought leadership in management practice developed over years of research, teaching, and practice.

  • Recommended System Requirements

    • Processors: 2.60 GHz
    • RAM: 8 GB of RAM
    • Disk space: 2 to 3 GB
    • Operating systems: Windows 10, MacOS and Linux
    • Python download link
    • Compatible tools: Any text editor, Command prompt
    Minimum System Requirements
    • Processors: 1 GHz
    • RAM: 1 GB of RAM
    • Disk space: 1 GB
    • Operating systems: Windows 7 or later, MacOS and Linux
    • Python versions: 2.7.X, 3.6.X
    • Compatible tools: Any text editor, Command prompt



Upon successful completion of the course, participants will receive a verified digital certificate from Emeritus in collaboration with Columbia Engineering Executive Education.

Download Brochure

Apply Now

Early registrations are encouraged. Seats fill up quickly!

Flexible payment options available. Learn more.