Subject-based community search method on heterogeneous information network

A technology of heterogeneous information network and search method, applied in digital data information retrieval, unstructured text data retrieval, instruments, etc., can solve the problem of high computing cost and achieve the effect of improving efficiency

Pending Publication Date: 2022-07-08
NANKAI UNIV
View PDF0 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, the computational cost of community search through meta-path is very h

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Subject-based community search method on heterogeneous information network
  • Subject-based community search method on heterogeneous information network
  • Subject-based community search method on heterogeneous information network

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0045] This embodiment provides a topic-based community search method on a heterogeneous information network. The heterogeneous information network is a complex graph data, including nodes and edges. The edges and nodes have different types. In the heterogeneous information network The types of edges and nodes are specified. The heterogeneous information network contains rich semantic information, and the nodes under a specific type contain textual description information, such as figure 2 Examples of heterogeneous information networks are given. The invention proposes to extract topics in heterogeneous information network text description information and apply a community search algorithm to search for topic relevance. Specifically, it includes the following steps:

[0046] S1. Extract the topic of the node carrying the text information from the text description information of the target heterogeneous information network, and then perform topic aggregation on the nodes of t...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a topic-based community search method on a heterogeneous information network, which comprises the following steps of: firstly, extracting topics of nodes carrying text information from text description information of a target heterogeneous information network, and then carrying out topic aggregation on nodes with the same type as a target query node; reconstructing the target heterogeneous information network according to a given meta-structure, returning a new heterogeneous information network with a smaller scale reconstructed according to the mode of the meta-structure, and returning a meta-path equivalent to the meta-structure; and finally, on the obtained new heterogeneous information network, according to the new meta-path and the input target query node, performing community search by adopting an existing method for performing community search according to the meta-path, and searching communities which are closely associated with the target query node and have similar themes.

Description

technical field [0001] The invention belongs to the technical field of graph data processing under big data, and in particular relates to a topic-based community search method on a heterogeneous information network. Background technique [0002] With the development of information technology, the data generated in daily applications has ushered in an explosive growth, and many real-world scenarios can be modeled as graphs. The nodes in the graph represent an entity, the edges between the nodes represent the relationship between the entities, and the entities may be associated with many attributes, labels and content. Graph has become a very effective model for describing real-world data, especially the heterogeneous information network is closest to the real scene, bibliographic information network, knowledge network, shopping network, etc., all belong to this category. In the context of today's big data era, community search is widely used in applications, such as temporar...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F16/9536G06F16/33G06Q50/00G06K9/62
CPCG06F16/9536G06Q50/01G06F16/3346G06F16/3344G06F18/23
Inventor 宋春瑶李玉奇袁晓洁
Owner NANKAI UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products