Database Systems II

Semestr: Winter

Range: 2+2

Completion:

Credits: 6

Programme type:

Study form: Fulltime

Course language: Czech

Summary:

The aim is to introduce new trends in database systems to students. We will focus primarily on the current issues of Big Data and the associated problems of distributed storage and processing of data. We will introduce a so-called basic types of NoSQL databases and also the related issue of cloud computing, data storage and distributed computations over large data files.

Keywords:

Course syllabus:

1. Introduction to Big Data and NoSQL databases
2. MapReduce
3. Basic principles of Big Data Management (CAP theorem, distribution,
scaling, replication)
4. Comparison and classification of NoSQL databases
5. The database type of key - value
6. Column database
7. Document Database
8. Graph database - basic principles
9. Graph databases - advanced aspects of indexing, querying
10. Advanced Principles of Big Data management (transaction performance,
polyglot persistence)
11. NewSQL database
11. Cloud computing
13. Data warehouses and Big Data
14. Other problems associated with Big Data - data analysis, data
generators, benchmarking.

Seminar syllabus:

Literature:

1. http://nosql-database.org/
2. Pramod J. Sadalage - Martin Fowler: NoSQL Distilled: A Brief Guide to the Emerging World of Polyglot Persistence
3. Eric Redmond - Jim R. Wilson: Seven Databases in Seven Weeks: A Guide to Modern Databases and the NoSQL Movement
4. Sherif Sakr - Eric Pardede: Graph Data Management: Techniques and Applications
5. Shashank Tiwari: Professional NoSQL
6. Jeffrey Dean and Sanjay Ghemawat: MapReduce: Simplified Data Processing on Large Clusters, Google, Inc.
7. Google Code: Introduction to Parallel Programming and MapReduce
8. Hadoop: The Definitive Guide, by Tom White, 2nd edition, Oreilly?s, 2010

Examiners:

Lecturers:

Instructors: