"Overview of methods for handling missing values".

  • Category: LabEx NUMEV monthly seminar
  • Dates : February 3, 2023
  • Timetable: 11 a.m. to 12 p.m.
  • Location: Campus St Priest, bât 2, amphi Moreau - 860, rue de St Priest 34090 Montpellier

The NUMEV Seminars are open to a wide audience of students and researchers from all disciplines, who wish to find out more about the current research areas of the NUMEV-MIPS (Mathematics, Informatics, Physics and Systems) community, or about opportunities to develop their skills and know-how.

Abstract

The problem of missing values exists since the earliest attempts of exploiting data as a source of knowledge as it lies intrinsically in the process of collecting, recording, and preparing the data itself. It is all the more unavoidable as vast amounts of data are currently collected from different sources: "One of the ironies of Big Data is that missing data plays an increasingly important role". There is a vast literature on this topic, and a recent survey even identified more than 150 different implementations.

In this presentation, I will share my experience on the topic. I will start by the inferential framework and then show how missing values create additional challenges to the task of supervised learning, as traditional machine learning algorithms can not handle incomplete data. Finally, I will illustrate the impact of the methods developed in the causal inference field to estimate treatment effects from clinical data.

Receive a weekly summary of the UM agenda

* By entering your e-mail address you agree to receive the weekly UM agenda summary by e-mail and you acknowledge our privacy policy. You can unsubscribe at any time using the unsubscribe link or by contacting us by e-mail.