Tentative Schedule - MCDA 5580
Data Mining 

Week 01
January 8, 2020 

Data mining overview 

Week 02
January 13 and 15, 2020 

Clustering algorithm, data preparation 

clustering implementation

Assignments 1 discussed in class 

Week 03
January 20 and 22, 2020 

Classification

ID3 algorithm

Use of rpart, random forest, SVM, ten-fold cross validation

Assignments 2 discussed in class 

Week 04
January 27 and 29, 2020 

Association mining

a priori algorithm

Assignments 3 discussed in class 

Weeks 05 and 06
January 24 and January 31, 2020 

Note: This is scheduled on two Fridays from 5:30pm to 8:15pm

Instructor: Mohammed Shameer Iqbal (Data Scientist rein.ai)

Python for data and text mining

Week 07
February 3 and 5, 2020 

Read and process from API's (such as Twitter etc) 

Learn about Naive Bayes Classifiers and other text classifications

Instructor: Sreejata Chatterjee, Leadsift - Chief of Products

Week 08
February 10 and 12, 2020 

NLP and various concepts 

How to build a Search Engine (indexing, ranking, cleaning of data, query engine)

Instructor: Sreejata Chatterjee, Leadsift - Chief of Products

Week 09
February 24 and 26, 2020 

Social network and sentiment analysis 

Discuss some real life problems

Instructor: Sreejata Chatterjee, Leadsift - Chief of Products

Week 10
March 3 and 5, 2020

Perceptrons, Neural networks

Prediction

regression

neural networks - mathematical foundations

time-series predictions

Week 11
March 10 and 12, 2020

Prediction continued (Assignment 4 substituted by Statistics Canada Competition)

Week 12+13
April 03 and 10, 2020 

Note: Special timing 5:30 - 8:15 pm (Friday).

Instructor: Chris Malloy

Deep Learning workshop

Bonus Weeks 14 and 15
April 8-20, 2020 

One on one meetings with the instructor

Schedule of the meetings will be announced on March 18th.

You will be expected to know all the assignments that you have submitted.

Instructor may ask to see any data file and to run one of the scripts mentioned in your reports.