Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

OSS (Open Source software) project developer prediction method based on Email networks

An open source software and prediction method technology, which is applied in the field of network science and machine learning, can solve the problem that it is impossible to accurately predict whether a programmer will be promoted to the actual developer of the project.

Inactive Publication Date: 2017-03-22
ZHEJIANG UNIV OF TECH
View PDF10 Cites 5 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] In order to overcome the inability of existing machine learning methods to accurately predict whether a certain programmer will be promoted to the actual developer of the project, the purpose of the present invention is to predict the currently promoted developer of the project based on the Email communication data in the OSS project (from a mere discussant turned into an actual code contributor)

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • OSS (Open Source software) project developer prediction method based on Email networks
  • OSS (Open Source software) project developer prediction method based on Email networks
  • OSS (Open Source software) project developer prediction method based on Email networks

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0063] The present invention will be further described below in conjunction with the accompanying drawings of the description.

[0064] refer to Figure 1 ~ Figure 3 , a method for predicting developers of an open source software project based on an Email network, the present invention uses OSS project-related data on Apache to predict project developers. Table 1 shows the main data information of the OSS project, where T o is the time when the first email of the project is sent, T f is the end time of the project, N u is the total number of programmers in the project, N d Number of developers, N e is the total number of emails. attached figure 2 The ROC curve and the area under the curve AUC are used as the prediction results of various algorithms in the evaluation index, where the X-axis is the "False Positive Rate" (False Positive Rate, referred to as FPR), and the Y-axis is the "True Positive Rate" (TruePositive Rate). Rate, referred to as TPR), the algorithm with ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

An OSS project developer prediction method based on Email networks comprises the following steps that 1) different types of Email networks are constructed; 2) different network node ordering algorithms are used to calculate the different nodes and obtain corresponding characteristic values, and a network topology property is used to obtain the characteristic vector centrality and clustering coefficient of each node; 3) the characteristic values obtained via different algorithms and ranks of topology property parameters are normalized and serve as sample features; 4) part of the nodes serves as a sample and input to a machine learning classifier, and a Bayesian algorithm is used for learning; and 5) residual node samples are predicted. The method is aimed at the characteristic that an OSS project includes a lot of participants but a few core developers, developers in different OSS projects can be predicted effectively, and compared with a network node ordering algorithm, the accuracy is improved substantially.

Description

technical field [0001] The invention relates to the fields of network science and machine learning, in particular to a method for predicting developers of open source software projects based on Email networks. Background technique [0002] With the development of science and technology, many advantages of open source software (Open Source Software, OSS) projects are gradually recognized by people, reference 1 (Q.Xuan, M.Gharehyazie, P.T.Devanbu, and V.Filkov, "Measuring the Effect of Social Communication on Individual Working Rhythms: A Case Study of Open Source Software," in International Conference on Social Informatics, 2012, pp.78-85.1, that is, Q.Xuan, M.Gharehyazie, P.T.Devanbu, and V.Filkov, social situation on the working rhythm Impact Research: A Case Study of Open Source Software Projects, in International Conference on Social Informatics, 2012, pp.78-85.1). In order to effectively maintain the success of an OSS project, it is particularly important to use its pub...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06K9/62G06Q10/04
CPCG06Q10/04G06F18/23G06F18/24155G06F18/214
Inventor 宣琦李甫宪周鸣鸣陈风雷李嘉南吴哲夫傅晨波翔云俞立
Owner ZHEJIANG UNIV OF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products