Course Syllabus

Course Details

Course Name: Statistical Machine Learning for Data Science

Course Number: Stat 447 (Stat 846)

Course Code (CRN): 82213

Year & Term: 2022-2023 Term 1

 

Required Text: Lantz, B. (2019). Machine Learning with R. Packt Publishing Ltd.

Readings/Reference:

[1] Wickham, H., & Grolemund, G. (2016). R for data science: import, tidy, transform, visualize, and model data. O'Reilly Media, Inc.

[2] Grolemund, G. (2015) Hands-On Programming with R: Write Your Own Functions and Simulations. O'Reilly Media, Inc.

[3] James, G., Witten, D., Hastie, T., & Tibshirani, R. (2013). An introduction to statistical learning. New York: Springer.

 

Course Website: on Canvas

Prerequisites: STAT 344 or STAT 345 or CMPT 317 or CMPT 318.

(For graduate students, students should have basic statistical theoretical knowledge, a good understanding of linear regression, and basic R coding skills.)

 

Instructor Details

Dr. Li Xing

Office: Room 215 McLean Hall

Email: li.xing@math.usask.ca

 

Schedule

Course Delivery:

Lecture Section: Monday & Thursday 11:30am-1:00pm (online via zoom)

Lab Section: Friday 2:00pm-3:20pm (online via zoom for outside USask students, or onsite in room 41 in the Arts and Science building basement).

Office Hours: Friday 4:00pm-5:00 pm, and by appointment (online via zoom, or onsite by appointment)

 

Invited Lectures:

Sponsored by the Pacific Institute for the Mathematical Sciences (PIMS), as one of the PIMS network courses, we have tentatively scheduled five invited lectures with details below.

  1. (September 12, 2022) Topics on Reproducible Research

Invited Speaker: Dr. Wesley Burr, Associate Professor at Mathematics Department of Trent University, Canada. He is also the President-Elect (2021-22) for the Statistical Education Section of the Statistical Society of Canada.

  1. (September 29, 2022) Topics on R Visualization

Invited Speaker: Dr. Jiayun Angela Yao, Epidemiologist at BC Centre for Disease Control, and Adjunct Professor, School of Population and Public Health, University of British Columbia, Canada.  

  1. (October 24, 2022) Machine Learning and Survival Analysis

Invited Speaker: Dr. Lihui Zhao, Associate Professor of Biostatistics in the Department of Preventive Medicine at Northwestern University.

  1. (November 14, 2022) Novel Machine Learning for Genomics Data

Invited Speaker: Dr. Jian Hu, Assistant Professor, Department of Human Genetics, School of Medicine, Emory University, US.

  1. (November 24, 2022) Advanced Unsupervised Learning

Invited Speaker: Dr. Lynn Lin, Assistant Professor in Biostatistics and Bioinformatics and Associate Director of the Quantitative Sciences Core of Duke Center for AIDS Research, at Duke University.

Please note that the schedule of invited talks might be changed later and more confirmed dates will be released during the class.  

 

Catalogue Description

The course provides learning opportunities on statistical software, R, with some focus on data management and wrangling, reproducibility, and visualization. On top of that, there are basic introductions to Machine Learning such as k-NN, Naive Bayes, regression methods, etc. The focus is on hands-on skills with R and applications to real data.   

 

Learning Objectives

By the completion of this course, students will be expected to

  1. learning statistical software R on data management and visualization.
  2. identify and apply the right tools from a critical statistical learning toolkit provided in the course to extract useful information from real data.
  3. given a real data problem, specify an appropriate research hypothesis and then manage a proper data analysis process using R software.
  4. demonstrate and explain these skills in writing and through an oral presentation.
  5. (For Stat 846 only, demonstrate the ability to employ the methods in research problems, understand the pros and cons of the method utility, and provide deeper thoughts in algorithm design and application.)  

Content Overview

  1. Introduction to machine learning.
  2. Managing and understanding data with R.
  3. Reproducibility and visualization.
  4. Supervised and unsupervised learnings.

 

Tentative Schedule:

Week

Date

Topic/Section

Assignments, Term Tests, and

Holidays

1

Sept 1

Introduction to Machine Learning

 

2

Sept 5

Data Management and Wrangling

Quiz 1

3

Sept 12

Reproducibility

Quiz 2

4

Sept 19

Visualization & EDA

Assignment 1 due

5

Sept 26

Data Splitting

Quiz 3

6

Oct 3

Regression Method

Quiz 4 & Assignment 2 due

7

Oct 10

Classification Evaluation

Quiz 5

8

Oct 17

KNN

Assignment 3 due

9

Oct 24

Naïve Bayes

 

10

Oct 31

Decision Tree

Project Proposal Due

11

Nov 7

No School

Reading Break

12

Nov 14

Advanced Learning

 

13

Nov 21

Unsupervised Learning

Presentation Due

14

Nov 28 & Later

Student Presentation

Presentation and Final Report Due

 

Midterm and Final Examination Scheduling

There are no midterm and final exams for this course.

 

Grading Scheme

5 Invited Lecture Attendance

5% (1% for each attendance)

10 Lab Assignments

10% (1% for each lab assignment)

5 In-class Quizzes

15% (3% for each quiz)

3 Lecture Assignments

30% (10% for each assignment)

1 Course Project including Proposal + Report + Presentation

40% (2% for proposal, 18% for final presentation and 20% for final report)

Total

100%

 

Evaluation Components

Invited Lecture Attendances 1-5

Description: Credit for Participants. TA will check your participation during the course session and confirm your attendance.     

Value: 5% of final grade

Submission: No submission is required. You should expect TA to communicate with you to confirm your attendance.   

 

Lab Assignments 1-10

Description: Problem based assignments.

Value: 10% of final grade

Submission: Lab assignment submission is due during its designated lab session.

 

In-Class Quizzes 1-5

Description: Problem based assignments.

Value: 15% of final grade

Submission: The quiz submission is due within 15-minutes time window during its designated lecture session.

 

Lecture Assignments 1-3

Description: Problem based assignments.

Value: 30% of the final grade

Due Date: (A1) Sept 25, 2022; (A2) Oct 9, 2022; and (A3) Oct 25, 2022.

Submission: Assignment submission is online. Detailed instruction will be provided during the course.

 

Course Project

Value:  40% of the final grade

Date:   Proposal Due Nov 6, 2022 

             Presentation Due Nov 27, 2022

             Final Report Due Dec 18, 2022

Type:   Online submission of proposal; in-class presentation, and take-home project.

Submission: Project related coursework should be directly submitted to the course instructor via email li.xing@math.usask.ca.

 

Late Coursework

I will not accept late lab assignment submission. I will accept late assignments only for seven (7) days beyond the due date. The penalty for your delay is 10 percent per day of lateness from the value of the assignment, including weekend days. Extensions may be granted only in exceptional circumstances (such as significant illness or emergency). 

 

Criteria That Must Be Met to Pass

Students must complete at least six lab assignments, at least two assignments, submit the project proposal, conduct the presentation, and submit the report to be eligible to pass the course.

 

Recommended Technology for Remote Learning

Students can access course materials via the course platform on Canvas. Zoom will be used for office hours, online discussions, and personal meetings. 

Students are reminded of the importance of having the appropriate technology for remote learning. The list of recommendations can be found at https://students.usask.ca/remote-learning/tech-requirements.php.

 

Recording of the Course

Use of video and recording of the course:

Please note that the pre-recorded course videos are available online, which belong to the instructor and the University and are protected by copyright. Do not download, copy, or share recordings without the explicit permission of the instructor.

For questions about recording and use of sessions in which you have participated, including any concerns related to your privacy, please contact your instructor. More information on class recordings can be found in the Academic Courses Policy https://policies.usask.ca/policies/academic-affairs/academic-courses.php#5ClassRecordings.

 

Copyright

Course materials are provided to you based on your registration in a class, and anything created by your professors and instructors is their intellectual property, unless materials are designated as open education resources. This includes exams, PowerPoint/PDF slides and other course notes. Additionally, other copyright-protected materials created by textbook publishers and authors may be provided to you based on license terms and educational exceptions in the Canadian Copyright Act (see http://laws-lois.justice.gc.ca/eng/acts/C-42/index.html)

Before you copy or distribute others’ copyright-protected materials, please ensure that your use of the materials is covered under the University’s Fair Dealing Copyright Guidelines available at https://library.usask.ca/copyright/general-information/fair-dealing-guidelines.php. For example, posting others’ copyright-protected materials on the open web is not covered under the University’s Fair Dealing Copyright Guidelines, and doing so requires permission from the copyright holder.  

For more information about copyright, please visit https://library.usask.ca/copyright/index.phpwhere there is information for students available at https://library.usask.ca/copyright/students/rights.php, or contact the University’s Copyright Coordinator at mailto:copyright.coordinator@usask.ca or 306-966-8817.

 

Integrity in a Remote Learning Context

Although the face of teaching and learning has changed due to covid-19, the rules and principles governing academic integrity remain the same. If you ever have questions about what may or may not be permitted, ask your instructor. Students have found it especially important to clarify rules related to exams administered remotely and to follow these carefully and completely.

The University of Saskatchewan is committed to the highest standards of academic integrity and honesty.  Students are expected to be familiar with these standards regarding academic honesty and to uphold the policies of the University in this respect.  Students are particularly urged to familiarize themselves with the provisions of the Student Conduct & Appeals section of the University Secretary Website and avoid any behavior that could potentially result in suspicions of cheating, plagiarism, misrepresentation of facts and/or participation in an offence.  Academic dishonesty is a serious offence and can result in suspension or expulsion from the University.

All students should read and be familiar with the Regulations on Academic Student Misconduct (https://secretariat.usask.ca/student-conduct-appeals/academic-misconduct.php) as well as the Standard of Student Conduct in Non-Academic Matters and Procedures for Resolution of Complaints and Appeals (https://secretariat.usask.ca/student-conduct-appeals/academic-misconduct.php#IXXIIAPPEALS)

For more information on what academic integrity means for students see the Academic Integrity section of the University Library Website at: https://library.usask.ca/academic-integrity#AboutAcademicIntegrity

You are encouraged to complete the Academic Integrity Tutorial to understand the fundamental values of academic integrity and how to be a responsible scholar and member of the USask community - https://library.usask.ca/academic-integrity.php#AcademicIntegrityTutorial

 

Access and Equity Services (AES) for Students

Students who have disabilities (learning, medical, physical, or mental health) are strongly encouraged to register with Access and Equity Services (AES) if they have not already done so. Students who suspect they may have disabilities should contact AES for advice and referrals at any time. Those students who are registered with AES with mental health disabilities and who anticipate that they may have responses to certain course materials or topics, should discuss course content with their instructors prior to course add / drop dates. In order to access AES programs and supports, students must follow AES policy and procedures. For more information or advice, visit https://students.usask.ca/health/centres/access-equity-services.php, or contact AES at 306-966-7273 or aes@usask.ca.

Students registered with AES may request alternative arrangements for mid-term and final examinations. Students must arrange such accommodations through AES by the stated deadlines. Instructors shall provide the examinations for students who are being accommodated by the deadlines established by AES.

For information on AES services and remote learning please visit https://updates.usask.ca/info/current/accessibility.php#AccessandEquityServices

 

Student Supports

Academic Help for Students

The University Library offers a range of learning and academic support to assist USask undergrad and graduate students. For information on specific services, please see the Learning page on the Library web site https://library.usask.ca/support/learning.php.

 

Teaching, Learning and Student Experience

Teaching, Learning and Student Experience (TLSE) provides developmental and support services and programs to students and the university community. For more information, see the students’ web site http://students.usask.ca.

 

College Supports

Students in Arts & Science are encouraged to contact the Undergraduate Student Office and/or the Trish Monture Centre for Success with any questions on how to choose a major; understand program requirements; choose courses; develop strategies to improve grades; understand university policies and procedures; overcome personal barriers; initiate pre-career inquiries; and identify career planning resources. Contact information is available at: (http://artsandscience.usask.ca/undergraduate/advising/)

 

Financial Support

Any student who faces challenges securing their food or housing and believes this may affect their performance in the course is urged to contact Student Central (https://students.usask.ca/student-central.php).

 

Aboriginal Students’ Centre

The Aboriginal Students’ Centre (ASC) is dedicated to supporting Aboriginal student academic and personal success. The centre offers personal, social, cultural and some academic supports to Métis, First Nations, and Inuit students. The centre is also dedicated to intercultural education, brining Aboriginal and non-Aboriginal students together to learn from, with and about one another in a respectful, inclusive and safe environment. Students are encouraged to visit the ASC’s Facebook page (https://www.facebook.com/aboriginalstudentscentre/) to learn more.

 

International Student and Study Abroad Centre

The International Student and Study Abroad Centre (ISSAC) supports student success and facilitates international education experiences at USask and abroad.  ISSAC is here to assist all international undergraduate, graduate, exchange and English as a Second Language students in their transition to the University of Saskatchewan and to life in Canada.  ISSAC offers advising and support on matters that affect international students and their families and on matters related to studying abroad as University of Saskatchewan students.  Please visit students.usask.ca or updates.usask.ca for more information.

 

Acknowledgements

The instructor would like to thank the Pacific Institute for the Mathematical Sciences (PIMS) for generous support for facilitating remote teaching and enhancing invited sessions. She appreciates that PIMS has selected the course as a PIMS network course, which provides opportunities for a more general student audience among our PIMS network universities.    

The instructor also would like to thank the Gwenna Moss Centre for Teaching and Learning, Mathematics and Statistics Department, and University Service for their support on course preparation and delivery.

 

Land Acknowledgement 

Land Acknowledgement I acknowledge that I live and work on Treaty 6 Territory and the homeland of the Métis. I pay my respect to the First Nations and Métis ancestors of this place past and present and reaffirm our relationship with one another. I respect the treaties that were made on these Territories, I acknowledge the harms and mistakes of the past, I recognize the ongoing present-day colonial violence that is faced by Indigenous peoples within healthcare, education, justice, child welfare and government systems and I dedicate myself to moving forward in partnership towards decolonization in the spirit of reconciliation and collaboration.

Course Summary:

Date Details Due