Unlock instant, AI-driven research and patent intelligence for your innovation.

A text matching method and device

A matching method and text technology, applied in the computer field, can solve problems such as inaccurate matching results and inconsistent positions.

Active Publication Date: 2020-12-01
NEUSOFT CORP
View PDF2 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] For example, for two texts, namely "the black cat is sitting on the yellow chair" and "the yellow cat is sitting on the black chair", if the above horizontal matching method is used to calculate the similarity between the two texts degree, because the two texts are completely consistent in the co-occurrence of word segmentation, so the similarity of the two texts is 1, which means that the two texts are completely matched, but obviously, the key information in the two texts " Black" and "yellow" appear in different positions in the two texts, so the matching result obtained by horizontal matching is inaccurate

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A text matching method and device
  • A text matching method and device
  • A text matching method and device

Examples

Experimental program
Comparison scheme
Effect test

no. 1 example

[0080] see Figure 1A , is a schematic flowchart of a text matching method provided in this embodiment. The text matching method includes the following steps:

[0081] S101: Obtain the first text and the second text to be matched.

[0082] For ease of distinction, in this embodiment, the two texts to be matched are respectively defined as the first text and the second text. For example, in a search scenario, the text entered by the user can be used as the first text, and each text in the corpus can be used as the second text; or, each text in the corpus can be used as the first text, and the text entered by the user text as the second text. For another example, in the FAQ system, the questions raised by the user are used as the first text, and each question in the FAQ library is used as the second text respectively; or, each text in the FAQ library is used as the first text respectively, and the user Enter the question as the second text.

[0083] In order to facilitate th...

no. 2 example

[0096] see Figure 1B , is a schematic flowchart of a text matching method provided in this embodiment. The text matching method is mainly for the specific introduction of S104 in the first embodiment, and specifically includes the following steps:

[0097] S1041: Draw a feature matrix, the number of rows of the feature matrix is ​​the total number of the first participle, and the number of columns of the feature matrix is ​​the total number of the second participle.

[0098] Determine the total number Row of each first participle in the first text, and determine the total number Col of each second participle in the second text; afterward, draw the feature matrix M, the number of rows and the number of columns of the feature matrix M are respectively for Row and Col.

[0099] Such as figure 2 The schematic diagram of the feature matrix shown in the above example continues the above example, taking the number of word segments of the text S1 as the number of rows, and taking t...

no. 3 example

[0121] see Figure 4 , is a schematic flowchart of a text matching method provided in this embodiment. The text matching method is mainly specifically introduced for S1045 in the first embodiment, and specifically includes the following steps:

[0122] S401: If the size of the feature value is positively correlated with the degree of similarity, then when the feature value is greater than a first feature threshold, keep the feature value.

[0123] In this embodiment, for each eigenvalue in the eigenmatrix M, the eigenvalues ​​corresponding to each group of words with higher similarity are retained. Specifically, when the value range of the feature value is [a1, b1], if the size of the feature value is positively correlated with the degree of similarity, then the first feature threshold may be δ1=(b1-a1 )*θ1%, where θ1% is a value greater than 50%, such as θ1=85%, and the eigenvalues ​​greater than δ1 are reserved in the characteristic matrix M.

[0124] For example, for i...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a text matching method and device. The method comprises the steps of conducting word splitting on a first text to obtain all first split words and conducting word splitting ona second text to obtain all second split words; determining the matching degree of the first text and the second text according to the sequences of all the first split words in the first text and thesequences of all the second split words in the second text. By means of the text matching method and device, since the sequences of the words in the texts are considered, the matching result of the texts will be more accurate.

Description

technical field [0001] The present application relates to the field of computer technology, in particular to a text matching method and device. Background technique [0002] In the field of text analysis, text matching plays an important role in many practical scenarios. For example, in a search scenario, when a user inputs a text to be matched, the system needs to search the corpus for content as similar as possible to the text to be matched, and return the matching result to the user; another example, in Frequently Asked Questions (FAQ, In the FAQ) system, the user asks a question, and the system needs to find the most similar question in the FAQ database according to the question raised by the user, and return the answer corresponding to the similar question. In these scenarios, the accuracy of text matching directly affects the user experience. Therefore, in the field of text analysis, text matching plays a very important role. [0003] The text matching process is gen...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F16/31
CPCG06F16/3344G06F40/284
Inventor 董超崔朝辉赵立军张霞
Owner NEUSOFT CORP