Data Analytics and Mining (2017/2018) - Departamento de Informática
Description

This course addresses principles, methods and practical recommendations for extracting interesting and meaningful patterns from structured and unstructured data (numeric and textual data), from a perspective at the interface between Computer Science and Statistics. The course covers fundamental topics and computational methods for the growing field of Data Analytics and Mining.

The course is organized in two modules:

(i) Module I is about data understanding and pre‐processing with exploratory data analysis, and also on data-driven clustering algorithms to induce models from data and on their interpretation.

(ii) Module II is about Relevant Information Extraction, symbolic and statistical analysis of texts, document descriptors, document classification and distribution of words and multi-words in Big Data context.

Objectives

Knowledge

Skills

Competences

Autonomy and self-reliance in the application and furthering studies in Data Analytics and Text Mining.

Syllabus

Introduction

Data Analytics

What is data: Examples of data analytic tasks and various perspectives of them

Visualization as a convenient tool for business analytics

Text Mining

Structured or unstructured data? Why mining texts?

What types of problems can be solved?

Data Understanding

Data Preparation

Descriptive Modeling I

Principal Component Analysis(PCA): Model and Method

PCA: Applications

Descriptive Modeling II

Interpreting Descriptive Models

Data Analytics Case Studies

Relevant Information Extraction

Symbolic and Statistical Analysis of texts

Document Descriptors

Document Classification

Text Mining Case Studies(some examples)

Bibliography
Student work
  Hours per credit 28
  Hours per week Weeks Hours
Total hours 0
ECTS 6.0