FiGraph: A Dynamic Heterogeneous Graph Dataset for Financial Anomaly Detection

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Scopus citations

Abstract

Graph anomaly detection (GAD) detects anomalous nodes in real-world networks by capturing topological and attributive information. Although a few of benchmark datasets are publicly available, there is a lack of dynamic heterogeneous graph datasets for advanced GAD research. To address this issue, this paper presents FiGraph, a real-world dynamic heterogeneous graph with ground-truth labels for financial anomaly detection. It consists of nine graph snapshots from 2014 to 2022 and comprises 730, 408 nodes and 1, 040, 997 edges. There are five types of nodes and four types of edges. Only partial nodes (target nodes) need to be identified, and these nodes have multimodal attributes that incorporate tabular data and textual input. The target nodes that correspond to the same entity in different snapshots may have different labels. The remaining nodes do not need to be categorized, serving as background nodes without attributes. In addition, multiple relations can exist simultaneously between the same pair of nodes. For example, two companies may share investment and supply chain relations, while a company and a human may share both investment and related-party transaction relations. These characteristics make FiGraph more realistic and complex than existing GAD datasets, encouraging the development of more effective GAD models. This paper details the construction and properties of FiGraph and discusses promising use cases. The dataset is publicly available at: https://github.com/XiaoguangWang23/FiGraph.

Original languageEnglish
Title of host publicationWWW Companion 2025 - Companion Proceedings of the ACM Web Conference 2025
PublisherAssociation for Computing Machinery, Inc
Pages813-816
Number of pages4
ISBN (Electronic)9798400713316
DOIs
StatePublished - 23 May 2025
Event34th ACM Web Conference, WWW Companion 2025 - Sydney, Australia
Duration: 28 Apr 20252 May 2025

Publication series

NameWWW Companion 2025 - Companion Proceedings of the ACM Web Conference 2025

Conference

Conference34th ACM Web Conference, WWW Companion 2025
Country/TerritoryAustralia
CitySydney
Period28/04/252/05/25

Keywords

  • Anomaly Detection
  • Dynamic Heterogeneous Graphs
  • Financial Dataset
  • Financial Fraud Detection

Fingerprint

Dive into the research topics of 'FiGraph: A Dynamic Heterogeneous Graph Dataset for Financial Anomaly Detection'. Together they form a unique fingerprint.

Cite this