Unlock instant, AI-driven research and patent intelligence for your innovation.

Search word optimizing method based on graph data structure

An optimization method and technology of graph data, applied in network data retrieval, other database retrieval, network data indexing and other directions, can solve the problems of large data volume, reduced retrieval effect, and enlarged retrieval workload, etc., to improve production efficiency and collect tasks. volume reduction effect

Inactive Publication Date: 2016-05-11
HYLANDA INFORMATION TECH
View PDF2 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0002] With the development of Internet technology, all kinds of data are vast, such as news, Weibo, forums, e-commerce, etc. Some customers pay attention to the dynamics of a certain event, and some customers pay attention to the news of a certain brand. Word of mouth, some customers are concerned about the reputation of a certain company. How to accurately and quickly obtain the data that customers really care about from the Internet requires the help of search engines to filter and filter the data. However, whether the search terms are appropriate or not will directly affect the search. Effect
If there are too many useless words in the search terms, the search effect will be reduced, and there will be less relevant data or even zero results. If the search terms are too limited, the amount of retrieved data will be too large, and further data screening is required to expand the Retrieval workload

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0014] The present invention will be described in detail below through specific embodiments.

[0015] The invented search term optimization method based on graph data structure includes the following steps:

[0016] A. Extract multiple word sets and the relationship between these word sets from the rules of the graph. These original search terms are abstracted into N lines of AND or expressions;

[0017] B. Organize word sets and the relationship between word sets: name each word set according to the line number and position in the line, and merge the sets containing the same word;

[0018] C. Analyze each line of expression, count the number of occurrences of the same noun set, and the number of words in each word set, find a word set with fewer words and a large number of lines covering the expression, and give each set according to these two dimensions The word number set is assigned a weight; in the weight calculation formula, the coverage rate accounts for the main part. The high...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a search word optimizing method based on a graph data structure. Multiple word sets and the relations among the word sets are extracted from the rules of graphs; the word sets and relations are settled; every word set is named; the word sets and the relations are simplified into multi-line And / Or expressions; every expression is analyzed; a weight value is endowed to every word count set; the parts of speeches are identified through word segmentation and inverse document rates; the association degrees of the word sets and themes are accurately analyzed; the minimum search word sets with the highest demand association degrees can be rapidly extracted from tens of thousands of rules. According the optimizing method, the relatively high recall rate is obtained in the indexing process; the logic expressions are covered most completely; the word count sets are minimum; the finally generated collection task amount is reduced; and the production efficiency of an enterprise is improved.

Description

Technical field [0001] The invention relates to the technical field of Internet information collection, in particular to a search term optimization method based on graph data structure. Background technique [0002] With the development of Internet technology, all kinds of data are vast, such as news, microblogs, forums, e-commerce, etc. Some customers are concerned about the dynamics of a certain event, and some customers are concerned about a certain brand. Word of mouth, some customers are concerned about the reputation of a certain company, how to accurately and quickly obtain the data that customers really care about from the Internet, you need to filter the data through the search engine, but whether the search term is appropriate or not will directly affect the search effect. Too many useless words in the search terms will reduce the search effect, and there will be less relevant data or even zero results. If the search terms are too limited, the amount of data retrieved ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F17/30
CPCG06F16/90332G06F16/951
Inventor 涂君兰杨伟锋
Owner HYLANDA INFORMATION TECH