Check patentability & draft patents in minutes with Patsnap Eureka AI!

Method and device for mining frequent sub-graphs of single graph

A frequent subgraph and individual technology, which is applied in the fields of instruments, computing, and electrical digital data processing, etc., can solve problems affecting the mining efficiency of frequent subgraph algorithms, and achieve the effect of avoiding subgraph growth and repeated searches

Pending Publication Date: 2022-05-13
电科云(北京)科技有限公司
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] However, the existing massive data frequent subgraph mining algorithm engine is not a dedicated engine for graph computing optimization, which greatly affects the mining efficiency of frequent subgraph algorithms

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and device for mining frequent sub-graphs of single graph
  • Method and device for mining frequent sub-graphs of single graph
  • Method and device for mining frequent sub-graphs of single graph

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0074] In order to make the purpose, technical solutions and advantages of the embodiments of the present invention more clear, the embodiments of the present invention will be further described in detail below in conjunction with the accompanying drawings. Here, the exemplary embodiments and descriptions of the present invention are used to explain the present invention, but not to limit the present invention.

[0075] First, the terms that may be involved in this description are explained as follows:

[0076] Canonical Adjacency Matrix (CAM): Given the adjacency matrix M of a single graph G, it is generated by splicing the lower triangular elements and diagonal elements of the adjacency matrix M in order from top to bottom and from left to right The string is called the coding sequence of the matrix M, denoted as Code(M). Due to the different sorting of the nodes (graph nodes) of the diagonal elements, the same single graph G will generate multiple coding sequences, and fur...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a frequent subgraph mining method and device for a single graph, and the method comprises the steps: generating a standard adjacency matrix according to a dictionary sorting result of node labels of the single graph, and numbering the nodes of each graph in sequence; an initial suboptimal normative adjacency matrix tree is generated through the normative adjacency matrix, leaf nodes comprise a first number of edges, and a CSP search space is a dictionary sorting sequence combination of numbers of graph nodes corresponding to node labels contained in the leaf nodes; performing FFSM-Join operation or FFSM-Extension operation on the leaf nodes according to the standard adjacency matrix, and increasing a sub-graph to obtain child nodes of which one edge is expanded; taking the child nodes as candidate sub-graphs, and constructing a CSP search space of the child nodes according to a sub-graph growth mode; if the valid number of the search spaces is smaller than a set support degree threshold value, marking the candidate sub-graphs as invalid sub-graphs; and if the increase is not completed, continuing to perform sub-graph increase, and if the sub-graph increase is completed, outputting a frequent sub-graph. Through the scheme, the frequent subgraph mining efficiency can be improved.

Description

technical field [0001] The invention relates to the technical field of data mining, in particular to a method and device for mining frequent subgraphs of a single graph. Background technique [0002] With the rapid development of big data technology, the use of graph structure to describe data is gradually applied to massive data. Traditional big data analysis technologies usually have a relatively general analysis engine based on SQL or SQL-like tabular analysis tools, but due to the complexity and particularity of relational storage, massive graph data often requires a dedicated calculation and analysis engine to achieve . [0003] A graph is a high-level abstraction of a structure. Frequent subgraph mining is one of the key technologies of graph mining. It has a wide range of applications in social networks, intelligence mining, bioengineering, communication network optimization, text mining, and knowledge reasoning, such as protein structure analysis, link prediction, ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F16/2458
CPCG06F16/2465
Inventor 田群戴永恒李荣华李艳斌潘敏佳刘学谦
Owner 电科云(北京)科技有限公司
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More