Data Modelling (2021/2022) - Departamento de Informática
Description

This course address data modelling and query languages for semantically enriched data, big data and/or open data applications in the Web. The core of the course will cover graph models, in particular those allowing the representation of relationships between resources in the Web. Other alternative models like documentg and column stores are also discussed and compared to. Besides representing interconnections among data, it is necessary to model and explore multidimensional data for online analytical processing, in particular for representing time and space dimensions for understanding how data evolves and moves. The representation of information in novel (spatial)temporal databases will be addressed and applied.

Objectives

Knowledge:

Application:

Soft-Skills

Syllabus

1. NoSQL data models

Alternative models for storing big volumes ofdata. Column, document and graph models. Relational, semi-structured and graph data. Data modelling with graphs. Querying graph models. Graph databases. Relationship to NoSQL movement and key-value stores.

2. Semantic Web

Motivation. Linked Open Data. Language and semantics of the Resource Description Framework (RDF) and SPARQL query language. Ontologies in the Semantic Web: RDF Schema and Web Ontology Language (OWL).

3. Online Analytical Processing (OLAP)

Data Warehouses. (Conceptual) multidimensional data models. Typical OLAP operations and OLAP query languages. Metadata. Spatial and temporal dimensions. Interaction in the data analysis process.

4. Exercises and final project

Use of tools (graph database, temporal databases, RDF and OWL API, OLAP and multidimensional)

Bibliography

• Ian Robinson, Jim Webber, and Emil Eifrem. Graph Databases. O''''''''Reilly Media, Inc, 2013.

• Grigoris Antoniou, Paul Groth, Frank van Harmelen and Rinke Hoekstra . A Semantic Web Primer, 3rd Edition. MIT Press, August 2012.

• The Description Logic Handbook. Theory, Implementation and Applications. Edited by Franz Baader, Diego Calvanese, Deborah McGuinness, Daniele Nardi and Peter Patel-Schneider. Cambridge University Press, June 2010.

• The Data Warehouse Toolkit: The Complete Guide to Dimensional Modeling (Third Edition) - Ralph Kimball, Margy Ross. Wiley, 2013.

• Guy Harrison. Next Generation Databases: NoSQL, NewSQL and Big Data. Apress, 2015.
ISBN:978-1484213308.

• Dan Sullivan. NoSQL for Mere Mortals. Addison-Wesley, 2015.
ISBN:978-0134023212

• Ted Hills. NoSQL and SQL data modeling. Technics Publications, 2016.
ISBN:978-1634621090
Prerequisites

To take this curricular unit you should first get approval in Database Systems.

Student work
  Hours per credit 28
  Hours per week Weeks Hours
Aulas práticas e laboratoriais   24.0
Aulas teóricas   26.0
Avaliação   6.0
Self study   40.0
Project   66.0
Total hours 162
ECTS 6.0