Chinese crowdsourcing test report clustering method based on semantic similarity

A semantic similarity and test report technology, applied in the field of Chinese crowdsourcing test report clustering based on semantic similarity, can solve the problems of large consumption of human resources, false positives, repeated reports, and low efficiency of manual review, and achieve improvement Test report review automation process and improve review efficiency

Pending Publication Date: 2021-06-15
ARMY ENG UNIV OF PLA
View PDF2 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] The purpose of the present invention is to provide a Chinese crowdsourcing test report clustering method based on semantic similarity, which solves the problem of false positives and repeated reports in the review process of existing Chinese crowdsourcing test reports, and the human resource consumption of manual review is large and the efficiency is low. low technical problems

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Chinese crowdsourcing test report clustering method based on semantic similarity
  • Chinese crowdsourcing test report clustering method based on semantic similarity
  • Chinese crowdsourcing test report clustering method based on semantic similarity

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0046] The present invention will be further described below in conjunction with specific embodiments. The following examples are only used to illustrate the technical solution of the present invention more clearly, but not to limit the protection scope of the present invention.

[0047] The present invention provides a Chinese crowdsourcing test report clustering method based on semantic similarity, such as figure 1 , figure 2 As shown, to receive the Chinese test report dataset DataSet, the training set TrainSet and the similarity matrix weight μ, the steps are as follows:

[0048] Step 1: Put forward idealized assumptions for the crowdsourcing test process, so that the clustering method can meet the actual needs of crowdsourcing test report analysis:

[0049] (1) After the participation of a large number of testers and multiple rounds of crowdsourcing testing, it is assumed that the defects at this stage can basically be detected;

[0050] (2) After a round of crowdsour...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a Chinese crowdsourcing test report clustering method based on semantic similarity, and the method comprises the steps: inputting a Chinese test report set, removing invalid test reports, and carrying out the sentence segmentation of effective test reports, and obtaining a test report sentence pair data set; constructing a test report sentence pair training set to train the semantic similarity model to obtain a semantic similarity calculation model; inputting the test report sentence pair data set into a semantic similarity calculation model for semantic similarity calculation to obtain a test report similarity matrix; setting an expected bug number of the test item, and performing spectral clustering according to the test report similarity matrix to obtain a test report class cluster; and decomposing the test report similarity matrix according to the test report class clusters to obtain the test report similarity matrix of each class cluster, and calculating a test report with an accumulated similarity score Top-5 in each class cluster as a final output result. The test report review automation process of the crowdsourcing test platform is improved, and the test report review efficiency is effectively improved.

Description

technical field [0001] The invention relates to the field of communication technology, in particular to a Chinese crowdsourcing test report clustering method based on semantic similarity. Background technique [0002] In the process of crowdsourcing software testing, crowdsourcing workers discover and submit problems that arise during the use of the software, write a test report and submit it to the tested party in exchange for remuneration. A crowdsourcing testing project usually receives hundreds or thousands of test reports. According to research, less than 50% of the submitted software problem reports reveal real defects in the software, with an average of 82%. Crowdtest reports are duplicated. It will consume a lot of time and manpower testing cost if the report set containing a lot of duplicates and false positives is checked manually by the tested party. Therefore, it is very necessary to remove duplicate reports and false positive reports efficiently and automatica...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/62G06F11/36G06F40/30
CPCG06F11/3692G06F40/30G06F18/23213
Inventor 黄松陈浩史涯晴郑长友王梅娟吴开舜刘语婵骆润
Owner ARMY ENG UNIV OF PLA
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products