Learning robust patient representations from multi-modal electronic health records: A supervised deep learning approach

  • Xianli Zhang
  • , Buyue Qian
  • , Yang Li
  • , Yang Liu
  • , Xi Chen
  • , Chong Guan
  • , Chen Li

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

16 Scopus citations

Abstract

Predicting patients’ future outcomes by analyzing Electronic health records (EHRs) is a hot topic in machine learning. The key challenge in this area is how to transform high dimensional, redundant, and heterogeneous EHRs into appropriate representations. In this paper, we argue for four desired properties of ideal patient representation learning, which are completeness, cross-modality invariance, anti-nuisance, and personality maintenance. To obtain such properties, We propose a Supervised Deep Patient Representation Learning Framework (SDPRL) to learn patients’ representations that incorporates complete semantics of health conditions by using multi-modal EHR data. Furthermore, we propose to maximize the mutual information (MI) among each pair of different modal representations, as well as minimizing the task-specific loss function. This not only keeps the task-relevant semantic information into the learned representations, but also makes the resulting representations to be relatively invariant across the modalities, anti-nuisance, and maintain the personality. With experiments conducted on the publicly available MIMIC-III dataset on the mortality prediction and forecasting the length of stay (LOS) tasks, we empirically demonstrate that the proposed SDPRL achieves higher prediction performance than baseline frameworks. Moreover, we demonstrate that SDPRL can yield the desired properties we argued. It can well-handle the modal-missing issue in the test phase, as well as getting advance to the goal of personalized medicine.

Original languageEnglish
Title of host publicationSIAM International Conference on Data Mining, SDM 2021
PublisherSiam Society
Pages585-593
Number of pages9
ISBN (Electronic)9781611976700
StatePublished - 2021
Event2021 SIAM International Conference on Data Mining, SDM 2021 - Virtual, Online
Duration: 29 Apr 20211 May 2021

Publication series

NameSIAM International Conference on Data Mining, SDM 2021

Conference

Conference2021 SIAM International Conference on Data Mining, SDM 2021
CityVirtual, Online
Period29/04/211/05/21

Fingerprint

Dive into the research topics of 'Learning robust patient representations from multi-modal electronic health records: A supervised deep learning approach'. Together they form a unique fingerprint.

Cite this