06CS755 Data mining syllabus for CS


Part A
Unit-1 INTRODUCTION DATA 1 6 hours

What is Data Mining? Motivating Challenges; The origins of data mining; Data Mining Tasks. Types of Data; Data Quality.

Unit-2 DATA 2 6 hours

Data Preprocessing; Measures of Similarity and Dissimilarity

Unit-3 CLASSIFICATION 8 hours

Preliminaries; General approach to solving a classification problem; Decision tree induction; Rule-based classifier; Nearest-neighbor classifier.

Unit-4 ASSOCIATION ANALYSIS 1 6 hours

Problem Definition; Frequent Itemset generation; Rule Generation; Compact representation of frequent itemsets; Alternative methods for generating frequent itemsets.

Part B
Unit-5 ASSOCIATION ANALYSIS 2 6 hours

FP-Growth algorithm, Evaluation of association patterns; Effect of skewed support distribution; Sequential patterns.

Unit-6 CLUSTER ANALYSIS 7 hours

Overview, K-means, Agglomerative hierarchical clustering, DBSCAN, Overview of Cluster Evaluation.

Unit-7 FURTHER TOPICS IN DATA MINING 7 hours

Multidimensional analysis and descriptive mining of complex data objects; Spatial data mining; Multimedia data mining; Text mining; Mining the WWW. Outlier analysis.

Unit-8 APPLICATIONS 6 hours

Data mining applications; Data mining system products and research prototypes; Additional themes on Data mining; Social impact of Data mining; Trends in Data mining.

Last Updated: Tuesday, January 24, 2023