Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Text matching method and system based on text similarity model

A text similarity and similarity technology, applied in natural language data processing, special data processing applications, instruments, etc., can solve the problems of inaccurate similarity, high cost of manual collection, and small coverage of statistical methods, so as to achieve simple collection, The effect of accurate text similarity and rich model parameters

Inactive Publication Date: 2019-03-12
AISPEECH CO LTD
View PDF6 Cites 17 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] In order to at least solve the problem of inaccurate similarity caused by only considering the number of similar words between strings in the prior art, or the coverage of statistical methods is small, and the cost of manual collection is high

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Text matching method and system based on text similarity model
  • Text matching method and system based on text similarity model
  • Text matching method and system based on text similarity model

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0032] In order to make the purpose, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below in conjunction with the drawings in the embodiments of the present invention. Obviously, the described embodiments It is a part of embodiments of the present invention, but not all embodiments. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without creative efforts fall within the protection scope of the present invention.

[0033] Such as figure 1 Shown is a flow chart of a text similarity model training method provided by an embodiment of the present invention, including the following steps:

[0034] S11: Receive a thesaurus training set, perform word segmentation processing on each preset sentence in the thesaurus training set, and determine a text string of the p...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The embodiment of the invention provides a text matching method based on a text similarity model. The method comprises the following steps of: receiving text information and determining a feature vector of the text information, wherein, the feature vector at least comprises a text string, a text phonetic alphabet and a word vector; Feature vectors are input into the text similarity model; Obtaining the feature similarity of the output of the text similarity model; At least one preset statement reach a preset feature threshold is determined as match text of that text information according to the feature similarity. The embodiment of the invention also provides a text matching system based on a text similarity model and a training method and system of the text similarity model. The embodiment of the invention determines the feature similarity between the user input statement and each preset statement in the text similarity model by using the text similarity model considering a pluralityof dimension feature vectors, and further determines the matching text with relatively high precision.

Description

technical field [0001] The invention relates to the field of natural language processing, in particular to a text matching method and system based on a text similarity model. Background technique [0002] Text similarity calculation is a basic problem in natural language processing, and text similarity algorithms are needed as support in many fields. In daily life, due to the user's colloquial description, the use of input methods, or hand errors, the user's description is not as standard as the formal text, but the user's description still contains the information that the user wants, accurate To capture these weak information, you need to use text similarity algorithm. For example, the user inputs "Where is the Yangtze River Bridge?" In fact, the user really wants to ask "Where is the Yangtze River Bridge". How to search for "Yangtze River Bridge" in the preset corpus according to "Where is the Yangtze River Bridge?" is an important application scenario of the text simil...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F16/332G06F17/27G10L15/26
CPCG10L15/26G06F40/284
Inventor 朱钦佩
Owner AISPEECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products