Skip to main navigation Skip to search Skip to main content

Global feature selection from microarray data using Lagrange multipliers

Research output: Contribution to journalArticlepeer-review

25 Scopus citations

Abstract

In microarray-based gene expression analysis, thousands of genes are involved to monitor their expression levels under a particular condition. In fact, however, only few of them are highly expressed, which has been proven by Golub et al. How to identify these discriminative genes effectively is a significant challenge to risk assessment, diagnosis, prognostication in growing cancer incidence and mortality. In this paper, we present a global feature selection method based on semidefinite programming model which is relaxed from the quadratic programming model with maximizing feature relevance and minimizing feature redundancy. The main advantage of relaxation is that the matrix in mathematical model only requires symmetric matrix rather than positive (or semi) definite matrix. In semidefinite programming model, each feature has one constraint condition to restrict the objective function of feature selection problem. Herein, another trick in this paper is that we utilize Lagrange multiplier as proxy measurement to identify the discriminative features instead of solving a feasible solution for the original max-cut problem. The proposed method is compared with several popular feature selection methods on seven microarray data sets. The results demonstrate that our method outperforms the others on most data sets, especially for the two hard feature selection data sets, Beast(Wang) and Medulloblastoma.

Original languageEnglish
Pages (from-to)267-274
Number of pages8
JournalKnowledge-Based Systems
Volume110
DOIs
StatePublished - 15 Oct 2016

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

  1. SDG 3 - Good Health and Well-being
    SDG 3 Good Health and Well-being

Keywords

  • Gene selection
  • High-dimensional and small sample size
  • Lagrange multiplier
  • Max-cut
  • Semidefinite programming

Fingerprint

Dive into the research topics of 'Global feature selection from microarray data using Lagrange multipliers'. Together they form a unique fingerprint.

Cite this