Fall 2025

Data Science Seminar: Time Series Imputation

Lecturer: Mourad Khayati

Teaching assistant: Quentin Nater

Level: Master

Academic year: Fall 2025

Overview

Structure

Evaluation and Expectations

Schedule

List of Papers

Overview

The data science seminar involves presentations covering recent topics in data science. The area of this year’s seminar is time series imputation. As part of the seminar, we will study research papers that propose algorithms for imputing missing values. These papers present methods for reconstructing incomplete sensor data by applying various replacement strategies to estimate missing segments.

Imputation offers benefits on two levels. At the data processing level, the completed time series can be adequately utilized in a wide range of Machine Learning (ML) tasks, such as classification and forecasting. At the data management level, properly imputed time series can be more effectively stored and maintained, one reason why many Time Series Database Systems (TSDBs) have begun to incorporate native support for missing value imputation.

Structure

The goal for the students is to learn how to critically read and study a research paper, describe it in a report, and present it in front of an audience. Under supervision, students will select one paper to study and compare it with related work. This seminar aims to help students gather in-depth knowledge of an advanced topic and develop the skills required to describe a complex problem from the time series field in the form of both a presentation and a written report.

Students will also learn how to integrate an algorithm into a library, benchmark it against other algorithms, and perform the required unit tests. For this purpose, we will use ImputeGAP, a comprehensive library designed for time series imputation analysis.

IMPORTANT NOTE: The papers will be distributed on a first-come, first-served basis.

Evaluation and Expectations

The final grade depends on the quality of the report, presentation, integration quality, and active participation during the seminar. Each participant prepares a self-contained report of at least five pages and gives a presentation of 30 minutes. The report should describe the proposed algorithm in detail. The report might contain a small running example, counterexample(s), and should explore the extreme cases where the studied algorithm would perform best and worst. The reproducibility consists of reproducing the same set of experiments introduced in the paper using a different setup (dataset, metric, parameters, etc.).

Advice on how to:

Write the report.
Prepare the presentation.

IMPORTANT NOTE: Attendance is mandatory for the two-class seminar sessions. The total number of participants will be limited to 20.

Schedule

Kickoff Meeting. Date: Tue, 23.09.2025, 14:15-16:00, room: E230

Organization of the seminar and paper assignment

----------------------------------------------------------------------

Additional Introductory Meeting. Date: Tue, 07.10.2025, 14:15-16:00, room: C421

Organization of the seminar and library introduction

----------------------------------------------------------------------

Date: Tue, 11.11.2025
Report deadline Batch 1

Date: Tue, 18.11.2025, all day, room: C433

Office meeting with students from Batch 1

First Presentation Session. Date: Tue, 25.11.2025, 14:15-19:00, room: PER17, 001

Presentations of Batch 1

----------------------------------------------------------------------

Date: Tue, 25.11.2025
Report deadline of Batch 2

Date: Tue, 02.12.2025, all day, room: C433
Office meeting with students from Batch 2

Second Presentation Session. Date: Tue, 09.12.2025, 14:15-19:00, room: PER17, 001

Presentations of Batch 2

----------------------------------------------------------------------

Deadline Final Report. Date: Tue, 13.01.2026

----------------------------------------------------------------------

Code Inspection. Date: Tue, 20.01.2026, all day, room: C433

Paper Assignment

The papers will be distributed on a first-come, first-served basis. To select one paper from the list of papers, please use the following link.

Paper	Presentation Date	Presenter
SAITS: Self-attention-based imputation for time series, ESWA'23	25.11.2025	Kachuriak Volodymyr
Missing Value Imputation on Multidimensional Time Series, VLDB'21	25.11.2025	Sinthuja Vijayananthan
BayOTIDE: Bayesian Online Multivariate Time Series Imputation with Functional Decomposition, ICML'24	25.11.2025	Daksh Patel
Biased Temporal Convolution Graph Network for Time Series Forecasting with Missing Values, ICLR 2024	25.11.2025	Nicolas Wyss
NuwaTS: a Foundation Model Mending Every Incomplete Time Series, arXiv'24	25.11.2025	Carla Malo
Filling the G_ap_s: Multivariate Time Series Imputation by Graph Neural Networks, ICLR'22	25.11.2025	Jue Wang
CSDI: Conditional Score-based Diffusion Models for Probabilistic Time Series Imputation, NeurIPS'21	09.12.2025	Emma Chambers
ReCTSi: Resource-efficient Correlated Time Series Imputation via Decoupled Pattern Learning and Completeness-aware Attentions, KDD'24 (No repro)	09.12.2025	Alejandro Nardo González
Gp-vae: Deep probabilistic time series imputation, AISTATS'20	09.12.2025	Sebastian Käslin
Networked Time Series Imputation via Position-aware Graph Enhanced Variational Autoencoders, KDD'23 (No repro)	09.12.2025	Sanika Deore
Mining of Switching Sparse Networks for Missing Value Imputation in Multivariate Time Series, KDD'24	09.12.2025	Flaminia Trinca
Gp-vae: Deep probabilistic time series imputation, AISTATS'20	09.12.2025	Alba Adili
TimesNet: Temporal 2D-Variation Modeling for General Time Series Analysis, ICLR'23 (No repro)	09.12.2025	Ghulam Hussain Wabil
BRITS: Bidirectional Recurrent Imputation for Time Series, NeurIPS’18	09.12.2025	Majlinda Blaca
SSD-TS: Exploring the potential of linear state space models for diffusion models in time series imputation, KDD'25	09.12.2025	Maurice Amon
Missing Value Imputation on Multidimensional Time Series, VLDB'21	-	-
Missing Value Imputation for Multi-attribute Sensor Data Streams via Message Propagation, VLDB'24	-	-
Filling the G_ap_s: Multivariate Time Series Imputation by Graph Neural Networks, ICLR'22	-	-
PriSTI: A Conditional Diffusion Framework for Spatiotemporal Imputation, ICDE'23	-	-
DIM-SUM: Dynamic IMputation for Smart Utility Management, VLDB'25	-	-

* Papers marked with (no repro) require an additional compatibility adaptation and are exempt from the reproducibility task.