Unlock instant, AI-driven research and patent intelligence for your innovation.

A fuzzy query method and system based on big data

A technology of fuzzy query and big data, applied in the direction of text database query, electronic digital data processing, special data processing application, etc.

Active Publication Date: 2018-11-02
山东合天智汇信息技术有限公司
View PDF3 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] If a traditional relational database is used to process data, the performance cannot be supported, and the operation is extremely slow, especially for fuzzy queries, which often take a long time to return the query results
[0006] Use luncen-like technology for fuzzy query. Since luncen uses word segmentation algorithm technology, it can only separate words, and can only query according to the words it separates. Sometimes, the fuzzy query is not a word, but just two words that are close together. character, at this time, it cannot be queried

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A fuzzy query method and system based on big data

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0034] The present invention will be further described below in conjunction with the accompanying drawings and embodiments.

[0035] In the existing method, because of the big data environment, the performance of the traditional relational database cannot be handled. However, fuzzy queries on platforms that specialize in processing big data generally rely on word segmentation algorithms. These word segmentation algorithms can only separate some commonly used words, place names, and personal names, but not all words. words, fuzzy queries cannot be performed. And utilize all possible words to be separated in advance, just can avoid the situation that can't find out. For example, "Shandong Hetian Zhihui Information Co., Ltd.", the word segmentation algorithm may only be able to separate out the words "Shandong", "Information", "Company", and "Limited", which leads you to query "Donghe", "He When you search for words such as "Tianzhi", "Information", and "Information Co., Ltd.",...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The present invention discloses a fuzzy query method and system based on big data. The upper and lower thresholds of the query data length are set to determine the data length requiring fuzzy query. If the data length is less than the upper limit threshold, the upper limit threshold is set to the data length, from At the beginning of each character, the data that needs fuzzy query is segmented with the length of the set data length lower limit threshold to form a set of segmented phrases; the entered data is continued to be segmented according to the length of each data length lower limit threshold plus 1, Until the length is equal to the upper threshold, put all the segmented phrases into the segmented phrase set; for the phrases in the segmented phrase set, query whether the node corresponding to the word exists from the graph database, and if it exists, get the node. If it does not exist, then create a new node corresponding to the word in the graph database, and create a connection line from the node to the attribute node in the graph database; the present invention can realize "precise" fuzzy query on the data, and the situation that the query cannot be found will not occur.

Description

technical field [0001] The invention relates to a fuzzy query method and system based on big data. Background technique [0002] With the rapid development of the Internet in recent years, the Internet has become more and more popular, the content on the Internet has also exploded, and the threshold for people to obtain the content they need from the Internet has become lower and lower, which has also spawned many "Gold diggers" analyze potential and valuable data, intelligence, laws and other content from the massive content of the Internet. [0003] Whether it is in the traditional IT era or in the Internet era, to develop various management and analysis systems, fuzzy query is generally required, that is, to query the data containing the entry according to a certain word. In the traditional IT era, due to the small amount of data, we generally use relational databases to store data. To perform fuzzy queries, we can directly use the relational database to provide the "lik...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F17/30
CPCG06F16/316G06F16/33
Inventor 高军田立娜王可鑫段文良
Owner 山东合天智汇信息技术有限公司