Machine learning and data mining 1 [ KMI/MLDM1 ]

The course is the first part of the two semester course devoted to principles and main methods of data mining and machine learning. After problem introduction with defining these notions and looking at data and their preprocessing the basic data minig methods of classification, association analysis and clustering used (not only) in machine learning are discussed, from the algorithmic point of view.

Lectures

  1. Intro: Data mining: extracting information from data, KDD, typical tasks. Machine learning: learning from information from data, phases and types.
  2. Data: Types of data and attributes, quality a preprocessing (sampling, normalization, discretization), similarity and dissimimlarity of objects, summary statistics a visualization.
  3. Classification: Decision trees, overfitting problem, evaluation of performance, rule-based, nearest neighbor, naive Bayes, support vector machines (SVM), regression.
  4. Association analysis: Itemsets, rules, Apriori algorithm, interestingness evaluation.
  5. Cluster analysis: types of clusters, K-means, hierarchical, density-based, expectation-maximization (EM), quality evaluation.

Literature