This course addresses principles, methods and practical recommendations for extracting interesting and meaningful patterns from structured and unstructured data (numeric and textual data), from a perspective at the interface between Computer Science and Statistics. The course covers fundamental topics and computational methods for the growing field of Data Analytics and Mining.
The course is organized in two modules:
(i) Module I is about data pre-processing, dimensionality reduction and data-driven clustering, to induce models from data and their interpretation aids.
(ii) Module II is about Relevant Information Extraction, symbolic and statistical analysis of texts, document descriptors, document classification and distribution of words and multi-words in Big Data context.
Knowledge:
Skills
Competences
Introduction
Data Analytics
What is data? Examples of data analytic tasks and various perspectives of them
Text Mining
Structured or unstructured data? Why mining texts?
What types of problems can be solved?
Data Understanding
Descriptive Modeling I
Principal Component Analysis(PCA): Model and Method
PCA: Applications
Descriptive Modeling II
Interpreting Descriptive Models
Data Analytics Case Studies
Relevant Information Extraction
Symbolic and Statistical Analysis of texts
Document Descriptors
Document Classification
Text Mining Case Studies(some examples)
Hours per credit | 28 | ||
Hours per week | Weeks | Hours | |
Aulas práticas e laboratoriais | 24.0 | ||
Aulas teóricas | 24.0 | ||
Avaliação | 6.0 | ||
Self study | 54.0 | ||
Orientação tutorial | 6.0 | ||
Project | 54.0 | ||
Total hours | 168 | ||
ECTS | 6.0 |