Skip to main navigation Skip to search Skip to main content

Temporal difference learning to detect unsafe system states

  • Huazhong Ning
  • , Wei Xu
  • , Yue Zhou
  • , Yihong Gong
  • , Thomas Huang
  • University of Illinois at Urbana-Champaign
  • NEC Corporation

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

5 Scopus citations

Abstract

This paper proposes a general framework to detect unsafe states of a system whose basic realtime parameters are captured by multi-sensors. Our approach is to learn a danger level function which can be used to alert the users in advance of dangerous situations. The main challenge to this learning problem is the labelling issue, i.e., it is difficult to assign an objective danger level at each time step to the training data, except at the collapse points where a penalty can be assigned and at the successful ends where a certain reward can be assigned. In this paper, we treat the danger level as expected future reward (penalty is regarded as negative reward) and use temporal difference (TD) learning [2] to learn a function to approximate the expected future reward. The TD learning obtains the approximation by propagating the penalty/reward observable at collapse points or successful ends to the entire feature space following some constraints. Our approach is applied to, but not limited to, the application of monitoring of driving safety and the experimental results demonstrate the effectiveness of the approach.

Original languageEnglish
Title of host publication2008 19th International Conference on Pattern Recognition, ICPR 2008
StatePublished - 2008
Externally publishedYes
Event2008 19th International Conference on Pattern Recognition, ICPR 2008 - Tampa, FL, United States
Duration: 8 Dec 200811 Dec 2008

Publication series

NameProceedings - International Conference on Pattern Recognition
ISSN (Print)1051-4651

Conference

Conference2008 19th International Conference on Pattern Recognition, ICPR 2008
Country/TerritoryUnited States
CityTampa, FL
Period8/12/0811/12/08

Keywords

  • Driving safety
  • Multi-sensor
  • Temporal difference learning
  • Unsafe system state

Fingerprint

Dive into the research topics of 'Temporal difference learning to detect unsafe system states'. Together they form a unique fingerprint.

Cite this