CMP SC 8370: Data Mining and Knowledge Discovery

Instructor: Dr. Jianlin Cheng

Location: Cornell Hall 219, Time: MoWe 11:00 am - 12:15 pm, Office Hours: Mo 1:30 - 2:30, We 1:30 - 2:00 Semester: Spring 2011

Syllabus

Lecture Slides

Acknowledgements: these slides are largely customized and adapted from the text book's slides.

1. Data Mining Concepts and Process

2. Data Preprocessing

3. Frequent Pattern Mining

4. Classification and Prediction

5. Cluster Analysis

6. Sequential Data Mining

Text Book

Han and Kamber. Data Mining: Concepts and Techniques (second edition). Morgan Kaufman, 2006. 

Reading Materials and Other Resources

1. A portal web site of the data mining community (news, tools, data, jobs, trends)
2. Chapters of the text book covered in the class (self-reading, not graded)
3. R Statistics Computing Software

Assignments

Assignment 1 (each question is worth 10 points), due 1/31/2011.
Assignment 2, due 2/16/2011. (Here are a couple of examples about how to use R to draw plots, which may be useful)
Assignment 3, due 3/2/2011.

Projects

The description of project 1 - customer relation prediction
The description of project 2 - new customer recognition
The description of project 3 - internet query classification