Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Github open source project recommendation method based on sparse autoencoder

A sparse autoencoder and item recommendation technology, applied in the field of software engineering recommendation systems and data mining, can solve the problems of determining user preferences, high cost, inappropriateness, etc., to reduce sparsity, improve accuracy, and improve performance.

Active Publication Date: 2018-03-16
HOHAI UNIV
View PDF4 Cites 20 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, the social connection between developers has not been further explored, and the consideration of the characteristic attributes associated with developers and open source projects is relatively simple, and the valuable characteristics in user historical behavior data have not been fully utilized.
[0004] At the same time, it is not appropriate to use traditional recommendation algorithms to recommend suitable open source projects for developers.
First of all, the data in Github is extremely sparse. As far as open source projects are concerned, most open source projects only involve individual developers, and many open source projects are difficult to be discovered by developers; as far as developers are concerned, due to time and energy constraints , the open source projects that developers can participate in are very limited, and developers need to spend a lot of time and energy to find the projects they are interested in from a large number of open source projects
Secondly, semantic information such as comments and descriptions in Github has no obvious emotional tendency, and it is difficult to determine user preferences through semantic information. Therefore, it is not advisable to recommend open source projects for developers purely through semantic information.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Github open source project recommendation method based on sparse autoencoder
  • Github open source project recommendation method based on sparse autoencoder
  • Github open source project recommendation method based on sparse autoencoder

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0056] Below in conjunction with specific embodiment, further illustrate the present invention, should be understood that these embodiments are only used to illustrate the present invention and are not intended to limit the scope of the present invention, after having read the present invention, those skilled in the art will understand various equivalent forms of the present invention All modifications fall within the scope defined by the appended claims of the present application.

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The present invention discloses a Github open source project recommendation method based on a sparse autoencoder. The method comprises: preprocessing data from three dimensions of a project, a user, and a project-user, and obtaining a user association degree matrix, a project association degree matrix, and a user-project association degree matrix; extracting textual information and clustering projects by analyzing text similarity; combining a collaborative filtering model and a sparse autoencoder to help developers find suitable open source projects; and taking the three matrices obtained by data preprocessing as input, obtaining two potential factor vectors through iterative neural network learning, predicting missing items in the user-project association degree matrix through the inner product of the potential factor vectors, and recommending the top N items with higher scores in the same category according to the clustering information of the open source project. According to the method disclosed by the present invention, suitable projects are recommended to the developers, the time taken by the developers to find interestied projects in massive open source projects is saved, and the performance of the developers participating in the open source projects is improved.

Description

technical field [0001] The invention relates to a Github open source project recommendation method based on a sparse autoencoder, and belongs to the technical fields of software engineering recommendation systems and data mining. Background technique [0002] Github is today's largest hosting platform for open source and proprietary software projects, and developers use it to realize social programming. The openness and flexibility of Githhub have enabled more and more software development enthusiasts to join this community, forming a huge software productivity. In the Github open source community, developers can follow other developers, bookmark or follow open source projects they are interested in, and clone projects locally for modification and updates. This enables code collaboration to be realized in different regions at different times. With the continuous growth of open source resources, a lot of reusable software has been brought to software development. But at th...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06Q10/10G06F17/30
CPCG06Q10/103G06F16/35G06F16/9535
Inventor 张鹏程熊芳张雷程坤周学武金惠颖贾旸旸赵齐
Owner HOHAI UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products