Unlock instant, AI-driven research and patent intelligence for your innovation.

A Method for Duplicative Detection of Public Contribution Merge Requests Based on Hybrid Similarity

A technology that mixes similarity and detection methods, applied in the field of software collaborative development, can solve problems that cannot be identified in time, achieve efficient organization, improve aggregation efficiency, and improve detection effects

Active Publication Date: 2021-03-26
NAT UNIV OF DEFENSE TECH
View PDF6 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Under the current mechanism, only when a developer happens to be aware of the existence of two duplicate contribution merge requests, their duplication is discovered, which causes most of the duplicate contribution merge requests to not be identified in time

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A Method for Duplicative Detection of Public Contribution Merge Requests Based on Hybrid Similarity
  • A Method for Duplicative Detection of Public Contribution Merge Requests Based on Hybrid Similarity
  • A Method for Duplicative Detection of Public Contribution Merge Requests Based on Hybrid Similarity

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0027] The following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only some, not all, embodiments of the present invention. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.

[0028] Such as figure 1 Shown is the schematic flow sheet of the inventive method; Concrete steps are as follows:

[0029] S1. Calculate the text similarity between the newly submitted public contribution merge request and the historical public contribution merge request, the text similarity includes title text similarity and description text similarity.

[0030] For text extracted from crowd-contributed merge request titles and descriptions, standard prepro...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention belongs to the field of software coordinative development, and discloses a public contribution combination request repeatability detecting method based on hybrid similarity. The method includes the steps of calculating the text similarity between a newly-submitted public contribution combination request and a historical public contribution combination request, calculating the variation similarity between the newly-submitted public contribution combination request and the historical public contribution, searching a public coordinative development platform for a historical repeatedcontribution data set, combining the text similarity and the variation similarity by means of a weight calculating method based on a greedy search strategy under the training of the data set to calculate the hybrid similarity of public contributions, and finally obtaining a list of historical public contribution combination requests the most probably repeated with a given public contribution combination request according to the value of the hybrid similarity. The public contribution repeatability can be detected in time, repeated artificial code inspection work is avoided, and the public contribution inspection efficiency is improved.

Description

technical field [0001] The invention belongs to the field of software collaborative development, and relates to a method for detecting repetition of public contribution merging requests based on mixed similarity. Background technique [0002] In open source communities (such as GitHub), the software development model based on large-scale group collaboration has greatly improved the efficiency of software innovation and inspired more and more developers to devote themselves to the creation process of open source software. However, this development model is a parallel and uncoordinated process. When multiple developers spontaneously contribute code to the same open source software project, if they want to achieve the same purpose, they may submit duplicate Contributed merge requests (Pull-requests), especially those popular projects that attract a large number of peripheral developers and receive a steady stream of community contributions, are more prone to this problem. Such...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F40/30G06F40/289G06F40/194
CPCG06F40/194G06F40/289G06F40/30
Inventor 余跃李志星尹刚王涛王怀民范强於杰张迅晖胡东阳
Owner NAT UNIV OF DEFENSE TECH