Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

A Semantic Fuzzy Search Method Based on Sentence-Level Deep Learning Language Model

A language model and deep learning technology, applied in semantic analysis, unstructured text data retrieval, text database query, etc., can solve the problem that texts with different language structures cannot be matched, semantic similarity cannot be accurately calculated, and inefficient Measure semantic correlation and other issues to achieve the effect of improving recall rate, fast operation speed, and convenient invocation

Active Publication Date: 2022-04-01
深圳前海黑顿科技有限公司
View PDF7 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] 1. Most of the current text fuzzy search cannot solve the problem of semantic understanding of the text, which makes the search recall rate low, because if the problem of semantic understanding is not considered, many texts with the same meaning but different language structures cannot be matched;
[0005] 2. Most of the current fuzzy text searches are relatively inefficient. When searching for keywords or key sentences in relatively long texts, the efficiency is relatively low due to the use of violent methods to process the text;
[0006] 3. The current text fuzzy search cannot solve the problem of semantic deviation of keywords or key sentences caused by the context in the text when analyzing the semantic understanding of the text, which will reduce the recall rate of the search, and in the When analyzing semantic similarity, a relatively single similarity measurement index is used, which cannot accurately calculate the similarity between semantics, that is, it cannot efficiently measure the correlation between semantics

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A Semantic Fuzzy Search Method Based on Sentence-Level Deep Learning Language Model
  • A Semantic Fuzzy Search Method Based on Sentence-Level Deep Learning Language Model
  • A Semantic Fuzzy Search Method Based on Sentence-Level Deep Learning Language Model

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0064] In the present invention, the sentence-level deep learning language model is used to solve the problem of text semantic understanding in fuzzy search scenarios, and it is extended to text information query and other scenarios, so that it can perform fuzzy query combined with semantics. Using the custom terminator mode to divide long text into tasks helps to vectorize the deep learning operation of long text, replacing the conventional loop traversal processing method, so as to ensure that each semantic matching task unit can be parallelized processing to improve the computing speed. It effectively solves the problem that fragment characters are interfered by surrounding characters in fuzzy search scenarios. If the target text Q is queried in the long text S, and Q itself is not a sentence with a complete structure; suppose that the two sentences Si and Sj in S are highly similar to Q, and Sj is slightly higher than Si. It is known that Sj contains a string segment Sg w...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a semantic fuzzy search method based on a sentence-level deep learning language model. In the present invention, the present invention has a high degree of ambiguity, and the present invention introduces a deep learning language model, which fully considers semantic issues, and can retrieve sentences with high semantic similarity with the target sentence, and use the method of hierarchically calculating semantic similarity Efficiently judge the semantic similarity between sentences; the operation speed is fast, and the vectorized processing is used instead of the conventional loop traversal to process text, ensuring that each semantic matching task unit can be processed in parallel, which greatly improves the search speed; The search recall rate is high, and the use of implication index makes the system more robust to grammatical interference, effectively improving the search recall rate; the system is flexible, and the present invention integrates mechanisms such as semantic understanding, fuzzy query, and precise information positioning , and then encapsulate the entire algorithm module with an interface, which is convenient for users to call.

Description

technical field [0001] The present invention relates to the relevant field of deep learning language model and fuzzy search, in particular to a semantic fuzzy search method based on sentence-level deep learning language model. Background technique [0002] Language models are widely used. As early as the 1970s, statistical language models have been successfully applied. In 2003, some scholars proposed to apply the concept of word vector to neural network-based language models. However, This kind of model has many parameters, and the training is more complicated; in 2010, some scholars proposed to apply the Recurrent Neural Network (Recurrent Neural Network) to the language model, which opened the wide use of deep learning in the language model, and later A series of excellent language models have emerged, such as: ELMO (Embedding from Language Models) model, Transformer model, and BERT (Bidirectional Encoder Representations from Transformer) model. Text fuzzy search is appl...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F16/33G06F40/30
CPCG06F16/3344
Inventor 裴正奇段必超黄梓忱朱斌斌段朦丽于秋鑫
Owner 深圳前海黑顿科技有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products