
Research Article
Enrich Code Search Query Semantics with Raw Descriptions
@INPROCEEDINGS{10.1007/978-3-031-54521-4_16, author={Xiangzheng Liu and Jianxun Liu and Haize Hu and Yi Liu}, title={Enrich Code Search Query Semantics with Raw Descriptions}, proceedings={Collaborative Computing: Networking, Applications and Worksharing. 19th EAI International Conference, CollaborateCom 2023, Corfu Island, Greece, October 4-6, 2023, Proceedings, Part I}, proceedings_a={COLLABORATECOM}, year={2024}, month={2}, keywords={Code search Query expansion Software engineering Deep learning}, doi={10.1007/978-3-031-54521-4_16} }
- Xiangzheng Liu
Jianxun Liu
Haize Hu
Yi Liu
Year: 2024
Enrich Code Search Query Semantics with Raw Descriptions
COLLABORATECOM
Springer
DOI: 10.1007/978-3-031-54521-4_16
Abstract
Code search can recommend relevant source code according to the development intention (query statement) of the demander, thereby improving the efficiency of software development. In the research of deep code search model, code description is used to replace query sentences for training. However, the heterogeneity existing between the query statement and the code description will seriously affect the accuracy of the code search model. In order to make up for the shortcomings of code search, this paper proposes a sentence-integrated query expansion method—SIQE. Unlike previous query expansion methods that focus on word-level expansion, SIQE uses the entire code description fragment as the source of query expansion. And by learning the mapping relationship between the query statement and the code description, the heterogeneity problem between them is compensated. In order to verify the effect of the proposed model in code search tasks, the article conducts code search experiments and analyzes on two languages: python and java. Experimental results show that, compared with the baseline model, SIQE has higher code search results. Therefore, the SIQE model can effectively improve the search effect of query statements, improve the accuracy of code search, and further improve the development of software engineering.