Large-scale graphic constraint linkage path searching algorithm in high-dimensional vector space

A vector space and connection path technology, applied in the field of big data, can solve problems such as high computing costs, achieve the effects of reducing computational complexity, reducing text copy costs and path calculation costs, and quickly answering and responding

Inactive Publication Date: 2018-03-23
LIAONING UNIVERSITY
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] In order to solve the problem that the analysis operation of a large-scale graph data set in a high-dimensional space requires high calculation costs, the present invention provides an algorithm with low computational complexity and high execution capability in a large-scale graph constrained connection path in a high-dimensional vector space query algorithm

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Large-scale graphic constraint linkage path searching algorithm in high-dimensional vector space
  • Large-scale graphic constraint linkage path searching algorithm in high-dimensional vector space
  • Large-scale graphic constraint linkage path searching algorithm in high-dimensional vector space

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0043] The steps of the present invention are as follows:

[0044] Step 1 projection deletion.

[0045] Step 1-1 Select the dimensions of the projection

[0046] The determination of node projection adopts probability selection, divides the vector space into several cells, randomly selects n nodes in the entire vector space, and randomly selects a candidate node in the adjacent cell where the node is located, and calculates the n The projected distance from n nodes to candidate nodes is determined by the dimension with the largest distance value, and the dimension with the largest distance value from the n nodes is naturally summarized, and finally the dimension with the most occurrences is selected as the selected dimension.

[0047] Step 1-2 Delete and select nodes

[0048] In the high-dimensional data space, randomly select k nodes to project onto the dimension selected in step 1-1, such as figure 2 As shown, the projection distance of the point on the D2 dimension has ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a large-scale graphic constraint linkage path searching algorithm in a high-dimensional vector space. The algorithm mainly includes the following steps of projection screening; dimensionality reduction deletion and selection; node deletion and selection; node subset high-dimensional spatial integration, wherein constraint linkage paths are output. By means of the algorithm, a node similarity linkage algorithm based on distance in the vector space is improved, a gridding technique and constraint linkage characters are introduced, a great quantity of invalid nodes are screened out, and the calculation complexity is reduced; a Map-Reduce framework is introduced into the algorithm, the path of each node is calculated by means of a four-stage deletion and selection strategy, an optional node set meeting the constraint conditions is found, a result set is called circularly to return to a reachable constraint path, unnecessary text copying costs and unnecessary path calculation costs are reduced, and the algorithm has high efficiency and a low error rate. Meanwhile, certain improvement and innovation are conducted on the algorithm during processing of the high-dimensional data space, unnecessary calculation processing is decreased through various dimension reduction means, the processing execution capability of the algorithm is improved, and therefore the algorithm can give answers and make responses to the questions of users more quickly.

Description

technical field [0001] The invention belongs to the field of big data, and in particular relates to the design of a processing method for a large-scale graph data set in a vector space, in particular to the design of a query algorithm for a large-scale graph constraint connection path in a high-dimensional vector space. Background technique [0002] In recent years, with the continuous development of new services such as big data and cloud computing, applications related to spatial location have gradually increased, and the scale of spatial map data is growing and accumulating at an unprecedented rate. How to find satisfying users in high-dimensional spatial map datasets? The optimal path result set for demand is a research hotspot in the field of large-scale graphs. The Map-Reduce framework provides effective means for batch processing of large-scale data. In the Map stage, according to the self-similarity of the high-dimensional vector space data set, the size of the rela...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
CPCG06F16/29
Inventor 丁琳琳宋宝燕王俊陆单晓欢陈博强张师文
Owner LIAONING UNIVERSITY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products