Heterogeneous network embedding method for reserving label information based on node signature

A technology of label information and heterogeneous network, which is applied in special data processing applications, other database retrieval, other database indexing, etc., and can solve the problems of professional domain knowledge relying on label information independence and insufficient consideration

Pending Publication Date: 2019-12-10
NANKAI UNIV
View PDF0 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0007] The purpose of the present invention is to solve the existing heterogeneous network embedding method's dependence on professional domain knowledge of network data, dependence on random walk strategy selection, and insufficient consideration of label information independence, and meet the requirements of improving the accuracy of downstream machine learning tasks Requirements, propose a heterogeneous network embedding method based on node signatures to preserve label information

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Heterogeneous network embedding method for reserving label information based on node signature
  • Heterogeneous network embedding method for reserving label information based on node signature
  • Heterogeneous network embedding method for reserving label information based on node signature

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0074] Example 1: A heterogeneous network embedding method based on node-directly-connected neighborhoods that retain label information

[0075] 1. Perform prime-digit dictionary mapping for all label types in the network

[0076] we are like figure 2 For the heterogeneous network shown, the prime number dictionary mapping is performed according to the method of the present invention. figure 2 Selected from a portion of a telephone communication network diagram. The node labels in the original graph include core user and edge user labels, and the edge labels include short-duration calls and long-duration calls. Core users start with L c Indicates that edge users start with L p Indicates that long-term calls are preceded by L l Indicates that short-term calls are preceded by L s express.

[0077] Therefore, the label types are mapped to 4 prime numbers from small to large, namely {2, 3, 5, 7}. Use the label type as the index to store the corresponding relationship in ...

Embodiment 2

[0095] Example 2: Heterogeneous Network Embedding Method Based on Node Egocentric Network Preserving Label Information

[0096] 1. Perform prime-digit dictionary mapping for all label types in the network

[0097] we are like figure 2 For the heterogeneous network shown, the prime number dictionary mapping is performed according to the method of the present invention. figure 2 Selected from a portion of a telephone communication network diagram. In the original picture, the four labels of core users, marginal users, long-duration calls, and short-duration calls are represented by L c , L p , L l , L s express. Map the label types one by one to the four prime numbers {2,3,5,7} from small to large, assuming that Figure 4 Shown, L c , L p , L l , L s The four labels are respectively mapped to {2,3,5,7} in turn. The corresponding relationship is indexed by the tag type and stored in the form of a dictionary.

[0098] 2. Extract the egocentric network label set of n...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a heterogeneous network embedding method for reserving label information based on node signature. During network embedding, high-dimensional graph data is mapped to low-dimensional vectors so that that a machine learning algorithm is adopted to effectively the high-dimensional graph data. The method comprises the following steps: carrying out prime number dictionary mappingon all label types in a network; extracting a neighborhood label set of network nodes; constructing a node signature vector; and building network node representations. The network structure, semantics and label information of heterogeneous graphs are comprehensively utilized. The idea of digital signature and the characteristics of prime numbers are used. A network representation learning framework of the heterogeneous graphs is constructed, reservation of network node and edge label information on the heterogeneous graph is realized, subsequent node clustering, classification, link prediction and other machine learning tasks are performed according to learned heterogeneous network node representations, and the existing homogeneous and heterogeneous network embedding method can be universally expanded and improved.

Description

technical field [0001] The invention belongs to the technical field of graph data processing. Background technique [0002] In today's information age, data is often represented by a network graph model in various applications. Effectively analyzing the structural information and label information in graph data helps to discover the internal relationship of complex network data, which in turn helps to make effective use of the hidden information in graph data, including but not limited to scientific search, personalized recommendation, etc. With the development of artificial intelligence technology, machine learning algorithms provide a general and effective means for data analysis and prediction. [0003] Due to the nature of graph data, it is difficult to perform data analysis directly on raw graphs. On the one hand, the traditional way of storing graph data in an adjacency matrix is ​​difficult to directly use as the input of machine learning algorithms for data analysi...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F16/901
CPCG06F16/9024
Inventor 宋春瑶郭佳雯袁晓洁
Owner NANKAI UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products