跳到主要导航 跳到搜索 跳到主要内容

SoTaNa: An Open-Source Software Engineering Instruction-Tuned Model

  • Ensheng Shi
  • , Yanlin Wang
  • , Fengji Zhang
  • , Bei Chen
  • , Hongyu Zhang
  • , Yanli Wang
  • , Daya Guo
  • , Lun Du
  • , Shi Han
  • , Dongmei Zhang
  • , Hongbin Sun
  • Xi'an Jiaotong University
  • Sun Yat-Sen University
  • Microsoft USA
  • Chongqing University

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Software development plays a crucial role in driving innovation and efficiency in modern societies. To meet the demands of this dynamic field, there is a growing need for an effective software development assistant. However, existing large language models represented by ChatGPT suffer from limited accessibility, including training data and model weights. Although other large open-source models like LLaMA have shown promise, they still struggle with understanding human intent. In this paper, we present SoTaNa, an open-source software engineering instruction-tuned model. SoTaNa utilizes ChatGPT to generate high-quality instruction-based data for the domain of software engineering and employs a parameter-efficient fine-tuning approach to enhance the open-source foundation model, LLaMA. We evaluate the effectiveness of SoTaNa in answering Stack Overflow questions and demonstrate its capabilities. Additionally, we discuss its capabilities in code summarization and generation, as well as the impact of varying the volume of generated data on model performance. Notably, SoTaNa can run on a single GPU, making it accessible to a broader range of researchers. Our code, model weights, and data are publicly available at https://github.com/DeepSoftwareAnalytics/SoTaNa.

源语言英语
主期刊名Proceedings - 2025 IEEE/ACM 2nd International Conference on AI Foundation Models and Software Engineering, FORGE 2025
出版商Institute of Electrical and Electronics Engineers Inc.
26-37
页数12
ISBN(电子版)9798331502119
DOI
出版状态已出版 - 2025
活动2nd IEEE/ACM International Conference on AI Foundation Models and Software Engineering, FORGE 2025 - Ottawa, 加拿大
期限: 27 4月 202528 4月 2025

出版系列

姓名Proceedings - 2025 IEEE/ACM 2nd International Conference on AI Foundation Models and Software Engineering, FORGE 2025

会议

会议2nd IEEE/ACM International Conference on AI Foundation Models and Software Engineering, FORGE 2025
国家/地区加拿大
Ottawa
时期27/04/2528/04/25

学术指纹

探究 'SoTaNa: An Open-Source Software Engineering Instruction-Tuned Model' 的科研主题。它们共同构成独一无二的指纹。

引用此