Data Analytics and Mining (2021/2022) - Departamento de Informática
Description

This course addresses principles, methods and practical recommendations for extracting interesting and meaningful patterns from structured and unstructured data (numeric and textual data), from a perspective at the interface between Computer Science and Statistics. The course covers fundamental topics and computational methods for the growing field of Data Analytics and Mining.

The course is organized in two modules:

(i) Module I is about data pre-processing, dimensionality reduction and data-driven clustering, to induce models from data and their interpretation aids.

(ii) Module II is about Relevant Information Extraction, symbolic and statistical analysis of texts, document descriptors, document classification and distribution of words and multi-words in Big Data context.

Objectives

Knowledge:

Skills

Competences

Syllabus

Introduction

Data Analytics

What is data? Examples of data analytic tasks and various perspectives of them

Text Mining

Structured or unstructured data? Why mining texts?

What types of problems can be solved?

Data Understanding

Descriptive Modeling I

Principal Component Analysis(PCA): Model and Method

PCA: Applications

Descriptive Modeling II

Interpreting Descriptive Models

Data Analytics Case Studies

Relevant Information Extraction

Symbolic and Statistical Analysis of texts

Document Descriptors

Document Classification

Text Mining Case Studies(some examples)

Bibliography
Student work
  Hours per credit 28
  Hours per week Weeks Hours
Aulas práticas e laboratoriais   24.0
Aulas teóricas   24.0
Avaliação   6.0
Self study   54.0
Orientação tutorial   6.0
Project   54.0
Total hours 168
ECTS 6.0