Unlock instant, AI-driven research and patent intelligence for your innovation.

Fuzzy query method and system based on big data

A fuzzy query and big data technology, applied in text database query, electronic digital data processing, special data processing applications, etc., can solve problems such as unsupportable performance, slow operation, and inability to query

Active Publication Date: 2016-04-06
山东合天智汇信息技术有限公司
View PDF3 Cites 4 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] If a traditional relational database is used to process data, the performance cannot be supported, and the operation is extremely slow, especially for fuzzy queries, which often take a long time to return the query results
[0006] Use luncen-like technology for fuzzy query. Since luncen uses word segmentation algorithm technology, it can only separate words, and can only query according to the words it separates. Sometimes, the fuzzy query is not a word, but just two words that are close together. character, at this time, it cannot be queried

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Fuzzy query method and system based on big data

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0034] The present invention will be further described below in conjunction with the accompanying drawings and embodiments.

[0035] In the existing method, because of the big data environment, the performance of the traditional relational database cannot be handled. However, fuzzy queries on platforms that specialize in processing big data generally rely on word segmentation algorithms. These word segmentation algorithms can only separate some commonly used words, place names, and personal names, but not all words. words, fuzzy queries cannot be performed. And utilize all possible words to be separated in advance, just can avoid the situation that can't find out. For example, "Shandong Hetian Zhihui Information Co., Ltd.", the word segmentation algorithm may only be able to separate out the words "Shandong", "Information", "Company", and "Limited", which leads you to query "Donghe", "He When you search for words such as "Tianzhi", "Information", and "Information Co., Ltd.",...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a fuzzy query method and system based on big data. An upper threshold and a lower threshold of the length of query data are set; the length of data needing fuzzy query is determined; if the length of the data is smaller than the upper threshold, the upper threshold is set to be the data length, and the data needing fuzzy query is segmented according to the set lower threshold of the data length from each character to form a segmented phrase set; typed-in data continues to be segmented according to the length formed by adding 1 to the lower threshold of the data length each time till the length is equal to the upper threshold, and all segmented phrases are put into the segmented phrase set; whether nodes corresponding to the phrases in the segmented phrase set exist or not is queried from a graph database, if yes, the nodes are obtained, and if not, nodes corresponding to the phrases are newly built in the graph database, and connection lines between the nodes in the graph database and attribute nodes are created. By means of the method, precise fuzzy query of data can be achieved, and the situation that the data can not be query does not occur.

Description

technical field [0001] The invention relates to a fuzzy query method and system based on big data. Background technique [0002] With the rapid development of the Internet in recent years, the Internet has become more and more popular, the content on the Internet has also exploded, and the threshold for people to obtain the content they need from the Internet has become lower and lower, which has also spawned many "Gold diggers" analyze potential and valuable data, intelligence, laws and other content from the massive content of the Internet. [0003] Whether it is in the traditional IT era or in the Internet era, to develop various management and analysis systems, fuzzy query is generally required, that is, to query the data containing the entry according to a certain word. In the traditional IT era, due to the small amount of data, we generally use relational databases to store data. To perform fuzzy queries, we can directly use the relational database to provide the "lik...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
CPCG06F16/316G06F16/33
Inventor 高军田立娜王可鑫段文良
Owner 山东合天智汇信息技术有限公司