In this course we make an introduction into data-driven methods and technologies that are used for retrieving, storing, querying and analyzing data, be it structured or unstructured. In particular, this course will delve into the following subjects:
• Relational Model.
• The SQL language.
• Relational Database Management Systems (RDBMSs).
• Introduction to Query Processing in RDBMSs.
• Introduction to Data Integration. Schema Matching and Schema Mapping.
• Data Preprocessing: Data Quality, Data Cleaning, Data Integration, Data Reduction, Data Transformation and Data Discretization.
• Introduction to Data Warehouses and On-line Analytical Processing (OLAP). Schemas, Data Cubes, cube materialization.
• Introduction to Big Data, the MapReduce programming model, Hadoop, HDFS.
Description
Semester
Winter Semester
Category
Obligatory
Lecture Hours
2 hours
Lab Hours
1 hour
Credits
5