Table des matières

Course unit: Data science project

Course metadata

Brief description

The course consists of a theoretical part and a practical part, simulating a business project.

Learning outcomes

Course content

  1. Data science in business
    • The main issues
    • Examples of data project
  2. Starting a data science project
    • The constraints of data science projects
    • Finding data
    • Acquiring information
    • Playing with data
  3. Lifecycle of a project
    • The Bias-Variance tradeoff
    • Feature Selection
    • Feature Engineering
    • Defining a metric
  4. The basic models
    • Regressions (linear, polynomial, penalized et logistic)
    • Decision trees (random forest and gradient boosting)
  5. Focus Natural Language Processing (NLP)
    • Word Embedding
    • Example: Sentiment analysis

Bibliography

Check the availability of the books below at Centrale Marseille library.