Hyper-YOLO: When Visual Object Detection Meets Hypergraph Computation

  • Yifan Feng
  • , Jiangang Huang
  • , Shaoyi Du
  • , Shihui Ying
  • , Jun Hai Yong
  • , Yipeng Li
  • , Guiguang Ding
  • , Rongrong Ji
  • , Yue Gao

Research output: Contribution to journalArticlepeer-review

131 Scopus citations

Abstract

We introduce Hyper-YOLO, a new object detection method that integrates hypergraph computations to capture the complex high-order correlations among visual features. Traditional YOLO models, while powerful, have limitations in their neck designs that restrict the integration of cross-level features and the exploitation of high-order feature interrelationships. To address these challenges, we propose the Hypergraph Computation Empowered Semantic Collecting and Scattering (HGC-SCS) framework, which transposes visual feature maps into a semantic space and constructs a hypergraph for high-order message propagation. This enables the model to acquire both semantic and structural information, advancing beyond conventional feature-focused learning. Hyper-YOLO incorporates the proposed Mixed Aggregation Network (MANet) in its backbone for enhanced feature extraction and introduces the Hypergraph-Based Cross-Level and Cross-Position Representation Network (HyperC2Net) in its neck. HyperC2Net operates across five scales and breaks free from traditional grid structures, allowing for sophisticated high-order interactions across levels and positions. This synergy of components positions Hyper-YOLO as a state-of-the-art architecture in various scale models, as evidenced by its superior performance on the COCO dataset. Specifically, Hyper-YOLO-N significantly outperforms the advanced YOLOv8-N and YOLOv9-T with 12% APval and 9% APval improvements.

Original languageEnglish
Pages (from-to)2388-2401
Number of pages14
JournalIEEE Transactions on Pattern Analysis and Machine Intelligence
Volume47
Issue number4
DOIs
StatePublished - 2025

Keywords

  • Hypergraph
  • hypergraph computation
  • hypergraph nerual networks
  • object detection

Fingerprint

Dive into the research topics of 'Hyper-YOLO: When Visual Object Detection Meets Hypergraph Computation'. Together they form a unique fingerprint.

Cite this