Machine learning is a subfield of computer science that gives computers the ability to learn without being explicitly programmed. It explores the study and construction of algorithms that can learn from, and make predictions on, data. The R language is widely used among statisticians and data miners to develop statistical software and perform data analysis. Machine Learning is a growing field that focuses on teaching computers to do work that was traditionally reserved for humans; it is a cross-functional domain that uses concepts from statistics, math, software engineering, and more.
In this course, you will start by organizing your data and then predicting it. Then you will work through various examples. The first example will demonstrate (using linear regression) predicting the murder arrest rate based on arrest data for a given State. Here you will explore R Studio and libraries, how to apply linear regression, how to score test sets, and plotting test results on a Cartesian plane. Then the next example will use logistic regression to predict for a classification problem on automobile data: selecting engine cylinders by performance features. This example demonstrates labelling and scaling data, how cross-validation works, and how to apply Logistic regression. Finally, you will move on to the next example—medical data about Diabetes—where you will use the caret package in R to simplify some of these steps.
By the end of this course, you will have mastered preparing data and the tools involved: regression and classification. Additionally, you will have learned to make predictions on new observations.
Organize and set up your data, and make predictions
Apply a variety of tools: regression, and classification
Label and scale data and how cross-validation works
Make predictions on new observations
Use the caret package to apply and score a model
You should be familier with basics of the R language and data frames, and have a basic knowledge of statistics
Who is this course intended for?
If you are an aspiring data scientist and are familiar with the basics of the R language and data frames, and have a basic knowledge of statistics, then this is the course you need. You are not expected to have any knowledge of the development of Artificial Intelligence or machine-learning systems. If you are looking to understand how the R programming environment and packages can be used to develop machine learning systems, then this is the perfect course for you.