MKPLS: Multiple kernel partial least squares for transcription factor binding site identification

  • Ling Chai
  • , Qinke Peng
  • , Xiongpan Zhang
  • , Laiyi Fu
  • , Shiquan Sun

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

The core of computational identification of transcription factor binding sites (TFBSs) is to deal with high dimensional and small sample size data and to handle the complex nonlinear relationships between features. Partial least squares (PLS) performs well in reducing dimensionality as well as explaining relations between multiple variables. Besides, kernel methods are widely applied to non-linear relationship process. Therefore, we reasonably introduce kernel partial least squares(KPLS) as a feature selection method for TFBS identification. Moreover, to lower the instability caused by the choice of kernel functions in conventional kernel-based methods, we combine multiple kernel methods with KPLS to develop a new method named multiple kernel PLS (MKPLS) to perform feature selection. PSO is utilized to estimate the parameters of linear combination of multiple kernels, furthermore, SVM is applied here to build an identification framework. 52 Escherichia coli k-12 TFBS datasets are used to test MKPLS as well as KPLS and Mutual Information(MI). Results demonstrate that MKPLS acquires a better ability to pick out the key features and can obtain a noticeable identification accuracy.

Original languageEnglish
Title of host publicationProceedings - 2017 Chinese Automation Congress, CAC 2017
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages2939-2944
Number of pages6
ISBN (Electronic)9781538635247
DOIs
StatePublished - 29 Dec 2017
Event2017 Chinese Automation Congress, CAC 2017 - Jinan, China
Duration: 20 Oct 201722 Oct 2017

Publication series

NameProceedings - 2017 Chinese Automation Congress, CAC 2017
Volume2017-January

Conference

Conference2017 Chinese Automation Congress, CAC 2017
Country/TerritoryChina
CityJinan
Period20/10/1722/10/17

Keywords

  • feature selection
  • high dimensional and small sample size data
  • multiple kernel PLS
  • transcription factor binding site

Fingerprint

Dive into the research topics of 'MKPLS: Multiple kernel partial least squares for transcription factor binding site identification'. Together they form a unique fingerprint.

Cite this