Course Description

This course is the second part of a two-semester data analytics course (MIS 301). The topics covered are: Review of Data Analytics I course content, Review of programming SW, Cross-Validation, dimension reduction approaches, Feature Selection, Similarity, clustering methods.

Learning Objectives

By the end of this course, students will be able to:

  1. Apply linear regression to analyze relationships between variables.

  2. Implement generative models for classification problems.

  3. Use cross-validation and resampling methods to assess model performance.

  4. Apply dimension reduction techniques.

  5. Understand Feature Selection briefly.

  6. Explore and interpret clusters in unsupervised data.

Course Schedule

Tentative Course Schedule

Week Topic Chapters Key Activities/Assignments
1 Introduction to Statistical Learning 1, 2 Overview, R Setup, Basic Commands
2 Simple and Multiple Linear Regression: Review 3 Lab: Linear Regression
3 Evaluation of Linear Regression 3 Lab: Linear Regression Evaluations
4 Model for Classification: Review 4 Lab: Logistic Regression, KNN
5 Cross-Validation 5 Lab: Cross-Validation Methods
6 Model Selection (Left For ML Course) 6 Lab: Model Selection Methods
7 Dimension Reduction (Left For ML Course) 6 Lab: Principal Components
8 Before Midterm Exam Review
9 Midterm Exam
10 Support Vector Machines 9 Lab: SVMs for Classification
11 Similarity Measures 12 Lab: Similarity Measures
12 Clustering Methods 12 K-Means, Hierarchical Clustering
13 Presentations
14 Recap: What have we learned

Course Materials

Evaluation Criteria

Policies

Academic integrity is fundamental to the academic mission of the university. Acts of academic dishonesty, including but not limited to plagiarism, cheating, fabrication, or unauthorized collaboration, undermine the learning process and violate university policies.

Specific guidelines include:

  1. Plagiarism: Using someone else’s work, ideas, or words without proper attribution is strictly prohibited. This includes copying and pasting from any source, paraphrasing without citation, or submitting another person’s work as your own.

  2. Cheating: Unauthorized use of materials, devices, or information during exams or assignments, including sharing or receiving answers, is not allowed.

  3. Fabrication: Falsifying or inventing data, citations, or research is a breach of academic integrity.

  4. Collaboration: While collaboration on group assignments may be permitted, sharing answers or work on individual tasks is not acceptable unless explicitly authorized.

  5. Consequences: Violations of academic integrity will be addressed following the university’s academic policies, potentially leading to penalties such as assignment failure, course failure, or further disciplinary actions.