Class-Specific Joint Feature Screening in Ultrahigh-Dimensional Mixture Regression

Research output: Contribution to journalArticlepeer-review

Abstract

Finite mixture of regression models are ubiquitous for analyzing complex data. They aim to detect heterogeneity in the effects of a set of features on a response over a finite number of latent classes. When the number of features is large, a direct fitting of mixture regressions can be computationally infeasible and often leads to a poor interpretative value. One practical strategy is to screen out most irrelevant features before an in-depth analysis. In this article, we propose a novel method for feature screening in ultrahigh-dimensional Gaussian finite mixture of regressions. The new method is built upon a sparsity-restricted expectation-approximation-maximization algorithm, which simultaneously removes varying sets of irrelevant features from multiple latent classes. In the screening process, joint effects between features are naturally accounted and class-specific screening results are produced without ad hoc steps. These merits give the new method an edge to outperform the existing screening methods. The promising performance of the method is supported by both theory and numerical examples including a real data analysis. Supplementary materials for this article are available online, including a standardized description of the materials available for reproducing the work.

Original languageEnglish
Pages (from-to)2473-2483
Number of pages11
JournalJournal of the American Statistical Association
Volume120
Issue number552
DOIs
StatePublished - 2025

Keywords

  • Class-specific sure screening
  • Feature screening
  • Finite mixture of regressions
  • Heterogeneous data
  • Joint feature screening
  • Ultrahigh-dimensional data

Fingerprint

Dive into the research topics of 'Class-Specific Joint Feature Screening in Ultrahigh-Dimensional Mixture Regression'. Together they form a unique fingerprint.

Cite this