Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Group discovery method based on data enhancement and non-negative matrix sparse decomposition

A sparse decomposition and non-negative matrix technology, applied in the field of big data, can solve problems such as difficult to distinguish nodes, difficult to judge node membership group relationship, etc., to achieve the effect of improving accuracy and interpretability

Inactive Publication Date: 2019-08-16
FUDAN UNIV
View PDF0 Cites 3 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Different from the non-sparse decomposition method, the probability that some nodes may belong to multiple groups is similar, which makes it difficult to judge the relationship between the nodes belonging to the group; on the overlapping group discovery task, the sparse decomposition method can make each node belong to different groups. The probability of the subordinate group is close to 0, instead of the sparse decomposition method, each group may get a non-zero probability, making it difficult to distinguish which groups the node belongs to

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Group discovery method based on data enhancement and non-negative matrix sparse decomposition
  • Group discovery method based on data enhancement and non-negative matrix sparse decomposition
  • Group discovery method based on data enhancement and non-negative matrix sparse decomposition

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0041] The specific implementation of the group discovery method based on data enhancement and non-negative matrix factorization will be given below.

[0042] (1) Data preparation: Select a data set Cornell (https: / / linqs-data.soe.ucsc.edu / public / lbc / WebKB.tgz) from the public website of the University of California, Santa Cruz. The data set contains an adjacency matrix A and node attribute matrix X. The data set gives the groups corresponding to the nodes (courses, educational affairs, students, engineering and staff groups), and the adjacency matrix A represents the link relationship between these five groups. The node attribute matrix X represents the attribute of the website, which is represented by a word vector with a value of 0-1. For example, the dictionary has a total of 1703 words, sorted as: "homework", "student", "submit",..., if there is a The words in the dictionary are represented by 1 in the corresponding position, and vice versa, are represented by 0. For ex...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention belongs to the technical field of big data, and particularly relates to a group discovery method based on data enhancement and non-negative matrix sparse decomposition. According to themethod, neighbor distribution of nodes with different attributes in a network with attributes of the nodes is learned through the recurrent neural network; a neighbor mode formed by similar neighbor distribution of a plurality of nodes is extracted to enhance data, a non-negative matrix is subjected to sparse decomposition through alternating least squares and Glynov regularization to find a group, and the group discovery accuracy is improved. According to the method, the edges of the network topology structure are filled through data enhancement, so that all-zero lines are not iterated as much as possible during matrix decomposition, and iteration stability is ensured; moreover, sparse group representation is obtained through a non-negative matrix sparse decomposition method, a large number of non-zero probabilities cannot be obtained, the interpretability of the group discovery method is enhanced, and the problem that a large number of non-zero probabilities are difficult to interpret group membership relations is solved.

Description

technical field [0001] The invention belongs to the technical field of big data, and in particular relates to a network data enhancement based on deep learning and a group discovery method of non-negative matrix sparse decomposition. Background technique [0002] In reality, there are connections and interactions between various objects, and these objects and the connections between them can be abstracted into a network structure, or called a graph structure. Connections or interactions are represented by edges in the network, while these objects and their properties are represented as nodes and their node properties in the network. Analyzing these network data to find out the set of similar points among them is called group discovery task. Group discovery is an important task in big data mining. For example, in a social network, network nodes represent everyone, while edges in the network represent the social relationships that exist between them, such as classmates, relat...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06Q50/00G06F16/901G06N3/04
CPCG06Q50/01G06F16/9024G06N3/045
Inventor 熊贇陈惠迪朱扬勇
Owner FUDAN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products