Abstract
Statistical learning refers to a set of tools for modelling and understanding complex datasets. It is a recently developed area in statistics and blends with parallel developments in computer science and, in particular, machine learning. This paper aims to outline some of the key statistical learning methods in the areas of prediction and classification of data. The goal is to discuss the theory and methodology of Ordinary Least Squares Regression, Ridge Regression, Lasso Regression, Logistic Regression, K-Nearest Neighbours method of classification, Linear and Quadratic Discriminant analysis, and Classification Trees. We then discuss the idea of Cross Validation, and demonstrate these methods by applying them to two real-life datasets.
Primary Advisor
M. Belalia
Program Reader
M. Hlynka
Degree Name
Master of Science
Department
Mathematics and Statistics
Document Type
Major Research Paper
Convocation Year
2022