URL-SemCom: An Alert Identification Model Based on URL Semantic Comprehension

  • Hao Huang
  • , Jin'ao Shang
  • , Zi'an Luo
  • , Xiaozhi Deng
  • , Yunfan Yang
  • , Qinqin Wu
  • , Chenwei Yang
  • , Yang Liu

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

As power systems continue to expand, the number of security alerts generated by intrusion detection system (IDS) has surged, making it increasingly challenging for security operation analysts in identifying genuine network intrusions among the vast number of alerts. Existing methods typically rely on machine learning to classify alerts, but such models often lack interpretability. To address this issue, we propose a novel framework called URL-SemCom, which employs a language model to understand the semantic information within the Uniform Resource Locator (URL) in alerts. We expand the language model's vocabulary with commonly used URL terms and design a specialized enhancement task. Additionally, we propose a cost-sensitive strategy to mitigate the poor performance caused by the imbalance of positive and negative samples in real-world power system data during the model training process. Finally, we employ an Adaptive boosting (Adaboost) classifier to improve the model's accuracy in classifying high-dimensional vectors. Comprehensive experiments demonstrate that our method significantly enhances the effectiveness of alert identification, providing a robust tool for improving cybersecurity measures in power systems.

Original languageEnglish
Title of host publicationICNSC 2024 - 21st International Conference on Networking, Sensing and Control
Subtitle of host publicationArtificial Intelligence for the Next Industrial Revolution
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9798350365221
DOIs
StatePublished - 2024
Event21st International Conference on Networking, Sensing and Control, ICNSC 2024 - Hangzhou, China
Duration: 18 Oct 202420 Oct 2024

Publication series

NameICNSC 2024 - 21st International Conference on Networking, Sensing and Control: Artificial Intelligence for the Next Industrial Revolution

Conference

Conference21st International Conference on Networking, Sensing and Control, ICNSC 2024
Country/TerritoryChina
CityHangzhou
Period18/10/2420/10/24

Keywords

  • Alert Identification
  • Ensemble Learning
  • Intrusion Detection Systems (IDS)
  • Large Language Model (LLM)
  • Uniform Resource Locator (URL)

Fingerprint

Dive into the research topics of 'URL-SemCom: An Alert Identification Model Based on URL Semantic Comprehension'. Together they form a unique fingerprint.

Cite this