SoTaNa: An Open-Source Software Engineering Instruction-Tuned Model

  • Ensheng Shi
  • , Yanlin Wang
  • , Fengji Zhang
  • , Bei Chen
  • , Hongyu Zhang
  • , Yanli Wang
  • , Daya Guo
  • , Lun Du
  • , Shi Han
  • , Dongmei Zhang
  • , Hongbin Sun

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Software development plays a crucial role in driving innovation and efficiency in modern societies. To meet the demands of this dynamic field, there is a growing need for an effective software development assistant. However, existing large language models represented by ChatGPT suffer from limited accessibility, including training data and model weights. Although other large open-source models like LLaMA have shown promise, they still struggle with understanding human intent. In this paper, we present SoTaNa, an open-source software engineering instruction-tuned model. SoTaNa utilizes ChatGPT to generate high-quality instruction-based data for the domain of software engineering and employs a parameter-efficient fine-tuning approach to enhance the open-source foundation model, LLaMA. We evaluate the effectiveness of SoTaNa in answering Stack Overflow questions and demonstrate its capabilities. Additionally, we discuss its capabilities in code summarization and generation, as well as the impact of varying the volume of generated data on model performance. Notably, SoTaNa can run on a single GPU, making it accessible to a broader range of researchers. Our code, model weights, and data are publicly available at https://github.com/DeepSoftwareAnalytics/SoTaNa.

Original languageEnglish
Title of host publicationProceedings - 2025 IEEE/ACM 2nd International Conference on AI Foundation Models and Software Engineering, FORGE 2025
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages26-37
Number of pages12
ISBN (Electronic)9798331502119
DOIs
StatePublished - 2025
Event2nd IEEE/ACM International Conference on AI Foundation Models and Software Engineering, FORGE 2025 - Ottawa, Canada
Duration: 27 Apr 202528 Apr 2025

Publication series

NameProceedings - 2025 IEEE/ACM 2nd International Conference on AI Foundation Models and Software Engineering, FORGE 2025

Conference

Conference2nd IEEE/ACM International Conference on AI Foundation Models and Software Engineering, FORGE 2025
Country/TerritoryCanada
CityOttawa
Period27/04/2528/04/25

Keywords

  • Data Generation
  • Instruction Fine-tuning
  • Large Language Models
  • Software Development Assistant

Fingerprint

Dive into the research topics of 'SoTaNa: An Open-Source Software Engineering Instruction-Tuned Model'. Together they form a unique fingerprint.

Cite this