Vai al contenuto principale
Oggetto:
Oggetto:

INTRODUCTION TO DATA SCIENCE - STATISTICAL LEARNING AND DATA ANALYTICS

Oggetto:

INTRODUCTION TO DATA SCIENCE - STATISTICAL LEARNING AND DATA ANALYTICS

Oggetto:

Anno accademico 2021/2022

Codice dell'attività didattica
SEM0125B
Docenti
Alessandra Cauli (Titolare del corso)
Pierpaolo De Blasi (Titolare del corso)
Matteo Ruggiero (Titolare del corso)
Insegnamento integrato
Corso di studi
ECONOMIA - percorso in Economia e Data Science
Anno
3° anno
Periodo didattico
Secondo semestre
Tipologia
Caratterizzante
Crediti/Valenza
6
SSD dell'attività didattica
SECS-S/01 - statistica
Modalità di erogazione
Tradizionale
Lingua di insegnamento
Inglese
Modalità di frequenza
Facoltativa
Tipologia d'esame
Orale
Oggetto:

Sommario insegnamento

Oggetto:

Obiettivi formativi

The course introduces to the fundamental techniques of statistical learning aimed at building a model for predicting a response variable based on one or more independent variables (or covariates). Special attention will be devoted to computer-based implementation of such techniques using a statistical software and to the interpretation of the analyses' results.

Oggetto:

Risultati dell'apprendimento attesi

- Knowledge and understanding
The student will learn the most common methodologies for analyzing a data set together with their implementation through the software R. The student will also be able to interpret the results of the analysis and present them through both visual and numerical summaries.

- Applying knowledge and understanding
The student will have the ability to discuss various methods and techniques for statistical learning.
- Making judgements
The student will be able to select the appropriate statistical method for analyzing a datasets with the support the R software in supervised learning.
- Communication skills.
Students will properly use statistical language to comunicate the results of their findings.

Oggetto:

Modalità di insegnamento

Roughly half of the course will be delivered through lectures and half through labs for model implementation.

Oggetto:

Modalità di verifica dell'apprendimento

The final examination consists in a written test. Homeworks may be assigned during the course, whose evaluation may contribute to the final grade.

Oggetto:

Programma

Statistical learning
- Goals
- Accuracy vs. interpretability
- Bias-variance trade off
Linear regression
- Simple linear regression
- Multiple linear regression
- Discussion and comparisons
Validation and resampling
- Cross-validation
- The bootstrap
Model selection and regularization
- Subset selection
- Shrinkage methods (ridge, lasso)
- Dimension reduction
Non-linear models
- Logistic regression for binary dependent variables
- Polynomial regression
- Regression Splines
- Generalized additive models

Testi consigliati e bibliografia

Oggetto:

Testi adottati
James, Witten, Hastie, Tibshirani. An Introduction to Statistical Learning. Springer (2013)
Testi per approfondimento
Hastie, Tibshirani, Friedman. The Elements of Statistical Learning. Springer (2009)



Oggetto:
Ultimo aggiornamento: 10/03/2022 16:36
Location: https://www.ecocomm.unito.it/robots.html
Non cliccare qui!