Medical Time-Series Data Generation using Generative Adversarial Networks - Laboratoire de recherche en informatique. Équipe: Apprentissage et optimisation Accéder directement au contenu
Communication Dans Un Congrès Année : 2020

Medical Time-Series Data Generation using Generative Adversarial Networks

Résumé

Medical data is rarely made publicly available due to high deidentification costs and risks. Access to such data is highly regulated due to it's sensitive nature. These factors impede the development of data-driven advancements in the healthcare domain. Synthetic medical data which can maintain the utility of the real data while simultaneously preserving privacy can be an ideal substitute for advancing research. Medical data is longitudinal in nature, with a single patient having multiple temporal events, influenced by static covariates like age, gender, comorbidities, etc. Extending existing time-series generative models to generate medical data can be challenging due to this influence of patient covariates. We propose a workflow wherein we leverage existing generative models to generate such data. We demonstrate this approach by generating synthetic versions of several time-series datasets where static covariates influence the temporal values. We use a state-of-the-art benchmark as a comparative baseline. Our methodology for empirically evaluating synthetic timeseries data shows that the synthetic data generated with our workflow has higher resemblance and utility. We also demonstrate how stratification by covariates is required to gain a deeper understanding of synthetic data quality and underscore the importance of including this analysis in evaluation of synthetic medical data quality.
Fichier principal
Vignette du fichier
AIME_2020_Submission.pdf (945.67 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)

Dates et versions

hal-03158549 , version 1 (04-03-2021)

Identifiants

  • HAL Id : hal-03158549 , version 1

Citer

Saloni Dash, Andrew Yale, Isabelle Guyon, Kristin P Bennett. Medical Time-Series Data Generation using Generative Adversarial Networks. AIME 2020 - International Conference on Artificial Intelligence in Medicine, Aug 2020, Minneapolis, United States. pp.382-391. ⟨hal-03158549⟩
152 Consultations
1321 Téléchargements

Partager

Gmail Facebook X LinkedIn More