Table des matières

Course unit: Data and Decisions

Course metadata

Brief description

This course unit is divided into three parts:

Learning outcomes

Course content

Statistical learning

  1. Introduction
    1. Classical problems: regression, classification
    2. Supervised, unsupervised and semi-supervised learning
    3. Curse of dimensionality
  2. Regression
    1. Multiple linear regression, OLS method
    2. Shrinkage-type methods (LASSO, Ridge)
    3. k-nearest neighbors
  3. Classification
    1. Logistic regression
    2. k-nearest neighbors
    3. SVM
    4. Rosenblatt perceptron and neuronal networks

Python for data science

  1. Dataframe: data exploration and data description
  2. Spotting patterns using factor
    1. Principal Component Analysis
    2. Correspondence analysis
  3. Prediction using trend analysis
    1. Linear regression
    2. Logistic regression
  4. Data classification
    1. Classification using partitions
    2. Hierarchical methods

Data-driven decision making

  1. What is data?
  2. How do we take decision?
  3. Data governance and data quality
  4. How to develop data-based decision making?
  5. Data platform and data architecture

Bibliography

Check the availability of the books below at Centrale Marseille library.

  1. Statistical Learning
    • James G., Witten D., Hastie T. and al. (2013). An introduction to statistical learning: with applications in R. New York: Springer
    • Hastie T., Tibshirani R. and Friedman J. (2013). The elements of statistical learning: data mining, inference, and prediction. New York: Springer.
    • Cornillon P-A., Matzner-Løber E. et al. (2010). Régression avec R. Paris: Springer.
  2. Python for data science
    • Jannach, D., Zanker, M., Felfernig, A. and Friedrich, G. (2010). Recommender Systems: An Introduction. Cambridge.
  3. Advising using data
    • tba