IS372: Data Warehouse and Data Mining
مستودع البيانات والتنقيب

Course Description
This course covers the concepts of building a data-warehouse to be used through applying the main data mining algorithms. The Knowledge Discovery in Large Database techniques will be explained. Then the computational techniques and algorithms for analyzing and mining patterns in large-scale datasets it will be covered. The considerable amount of effort will also be put on computational aspects of algorithm implementation. The importance of data mining is explored by introducing principal ideas in statistical learning. This course focuses on mining process which includes data selection and cleaning, machine learning techniques to learn knowledge that is hidden in data, and the reporting and visualization of the resulting knowledge.

Course Objectives
After completing this course the student will be able to:

  1. Build a Data Warehouse and apply OLAP technology for data mining: Data preparation, design multi domination data model
  2. Explore the novel way of extraction of new knowledge from large amounts of data.
  3. Examine the concepts, issues, tasks and techniques of data mining and how to apply data mining tools to real-world problems.
  4. Discover hidden patterns, finding predictive information that experts may miss because it lies outside their expectations.
  5. Develop appropriate technologies to solve problems related to data mining.

Course Outline
  1. Introduction to Data Mining, Data Mining Goals, Stages of the Data Mining Process, Data Mining Techniques, Knowledge Representation Methods.
  2. Data Warehouse and OLAP technology
  3. Data preprocessing.
  4. Data mining algorithms: Classification/Prediction.
  5. Clustering algorithms: Basic issues in clustering, Partitioning methods: Rule mining; association rule mining and sequential Mining Patterns.
  6. Advanced techniques, Data Mining Techniques:
  7. Advanced techniques, Data Mining software and applications
Lab
A practical session is used in a lab environment in the form of a case study (or a mini-project). The above course outline is applied in the lab using a tool such as R visual studio or Rapid Miner or Weka.

Textbooks:
[1] Jiawei Han, Micheline Kamber (2012 ): Data Mining: Concepts and Technique, rd Edition, Elsevier [Download PDF]
[2] Shmueli, Patel and Bruce (2010 ): Data Mining for Business Intelligence: Concepts, Techniques, and Applications in Microsoft Office Excel with XLMiner. nd Edition, Wiley [Download PDF]
[3] Ian Witten, Data mining Practical Machine Learning Tools and Techniques [Download PDF]
[4]PAULRAJ PONNIAH, DATA WAREHOUSING FUNDAMENTALS FOR IT PROFESSIONALS [Download PDF]

Slides:
  • Lecture 1: Introduction [PDF]
  • Lecture 2: Data Pre-processing [PDF
  • Lecture 3: OLAP and Cube Technology [PDF]
  • Lecture 4: Classification[PDF]
  • Lecture 5: Association Rule [PDF]
  • Lecture 6: Model Evaluation [PDF]

No comments:

Post a Comment