Software code search oriented query statement regenerating method

A query statement and software code technology, applied in special data processing applications, instruments, electrical digital data processing, etc., can solve problems such as complex keyword relationships, low precision, and interfering vocabulary, so as to improve understanding and efficiency, and apply The effect of broadness and accuracy improvement

Active Publication Date: 2016-08-31
YANGZHOU UNIV
View PDF4 Cites 4 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, the relationship between keywords in the code base is very complicated. For example, code class names and method names are mostly composed of compound words. Each developer’s naming habits are also different, so it is only based on the position between keywords. The relationship cannot effectively find the relevant words of the query statement, and even interferes with the vocabulary and affects the accuracy of the query statement regeneration
In addition, Bunyamin et al. used machines to automatically select the way to form new query statements. Compared with manual selection, its accuracy is low, and even due to the selection bias of the machine, the query results run counter to the original query statement.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Software code search oriented query statement regenerating method
  • Software code search oriented query statement regenerating method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0022] Technical thinking of the present invention is:

[0023] The method of the present invention is to regenerate the query sentence, and effectively expand the relevant vocabulary for the query sentence based on the complex relationship of keywords in the software code base, and further screen the expanded words in combination with a manual method, so as to effectively regenerate the query sentence.

[0024] Attached below figure 1 , 2 The technical scheme of the present invention is described in detail:

[0025] Step (1) Divide the software code base into a pure code document base and a pure comment document base, and perform preprocessing with the query sentence input by the user, specifically including entry division, root processing, gerund synonyms, synonyms query, vocabulary expansion, etc. etc. to generate software code segment lexicon and query sentence lexicon;

[0026] For example: the result of preprocessing the description "Cookies set for www.foo.com.are se...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a software code search oriented query statement regenerating method. The method includes the following steps: performing pretreatment on codes and comments in a software code library and query statements input by a user; extracting compound words in the software code library, and defining two or more keywords divided by the compound words as a heterogeneous relationship; defining synonyms as a homogeneous relationship; searching homogeneous keywords and heterogeneous keywords of all the keywords in the query statements, and visualizing the homogeneous keywords and the heterogeneous keywords so as to allow the user to select more suitable keywords which are the final result. The method overcomes the defects of low accuracy, selection deviation and complex relationships of the conventional methods; the method can effectively search relevant information in the software code library, expand keywords included in original query statements, provide code snippets of the keywords, achieve information query and expansion, and effectively improve software understanding level of a software maintainer and the efficiency.

Description

technical field [0001] The invention belongs to the field of software understanding, in particular to a method for regenerating query sentences oriented to software code retrieval. Background technique [0002] In software engineering, it takes a lot of time for software maintainers to understand large sections of software code, and each software developer has their own naming style, including abbreviations and links of phrases, which greatly increases the difficulty of software maintenance. Therefore, in order to facilitate the understanding of software code, we query the software code base to more accurately retrieve relevant information for reference by software developers, so that software maintenance can proceed smoothly. Therefore, the quality of query statements determines whether we can accurately retrieve the key information that maintainers want. However, the constant replacement of software maintenance personnel and the increasingly large software source code mak...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
CPCG06F16/24534
Inventor 孙小兵邱馨贤杨辉孙伟松
Owner YANGZHOU UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products