Unlock instant, AI-driven research and patent intelligence for your innovation.

Text retrieval matching method and system

A matching method and text technology, applied in the field of text retrieval matching methods and systems, can solve the problems of not taking into account the limitation of encoder length, model length limitation, lack of topic similarity between sentences, etc., to enhance understanding and improve effect Effect

Active Publication Date: 2022-05-03
ZHEJIANG LAB
View PDF8 Cites 3 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Although such a method simply and effectively learns the semantic relationship between sentences, there are two problems: 1) lack of similarity of topics between sentences, 2) the model has a length limit
[0004] In short text matching, the current technology often only considers the semantic similarity between sentences and ignores the correlation between the topics, and when dealing with long text matching, the past method is to stitch all sentences together, not only Does not take into account the limit of the length of the encoder, but also misses the semantic interaction information between sentences

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Text retrieval matching method and system
  • Text retrieval matching method and system
  • Text retrieval matching method and system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0051] In order to make the object, technical solution and technical effect of the present invention clearer, the present invention will be further described in detail below in conjunction with the accompanying drawings and embodiments.

[0052] Such as figure 1 and figure 2 As shown, a text retrieval matching method, based on contrastive learning, graph convolutional neural network and layered coding, includes the following steps:

[0053] Step 1: collect the existing Chinese natural language inference text corpus in different fields, and use it as the training corpus for the sentence representation model.

[0054] The existing text corpus of Chinese natural language reasoning in different fields is collected through the network. The text corpus includes the AFQMC ant financial semantic similarity corpus, the CMNLI language reasoning task corpus, and the CNSD Chinese natural language translated from the English data set through translation. Language reasoning data set, LCQ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention belongs to the field of artificial intelligence, and relates to a text retrieval matching method and system.The method comprises the steps that firstly, Chinese natural language reasoning text corpora in different fields are collected to serve as training corpora of a sentence representation model; step 2, training a sentence representation model in combination with a contrast learning method, and testing and screening out an optimal sentence representation model by using a sentence semantic matching reference data set; 3, performing similarity calculation on sentences in the long and short texts to be matched by using the screened optimal sentence representation mode; and 4, according to a similarity calculation result, obtaining matching scores of the sentences by adopting a Sigmod function to judge whether the sentences are similar sentences or not, and completing text retrieval. According to the length of the text and the characteristics of various models, the most suitable model architecture is used for carrying out the task of text retrieval, the problems that similarity of themes between sentences is lacked to be considered in text matching, the model length is limited and the like are solved, and the matching effect is improved.

Description

technical field [0001] The invention belongs to the field of artificial intelligence, and in particular relates to a text retrieval matching method and system. Background technique [0002] Text matching is a core task in natural language processing. Many natural language processing tasks can be abstracted into text matching problems. Whether in dialogue systems, recommendation systems, or search engines, text matching is essential. [0003] In the retrieval model, the traditional text matching method is to directly calculate the relevance ranking based on keyword retrieval or BM25 and other algorithms, but the disadvantage of this method is that it needs to maintain a large number of thesaurus databases and matching rules. Later, latent semantic analysis technologies such as LSA and LDA gradually appeared, trying to use the latent semantics hidden in the document to match the text, and can achieve better results than direct keyword matching. With the rise of deep learning,...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F16/335G06F16/33G06F40/284G06N3/04G06N3/08
CPCG06F16/335G06F16/3344G06N3/08G06F40/284G06N3/048G06N3/045
Inventor 李太豪黄剑韬阮玉平张晓宁郑书凯
Owner ZHEJIANG LAB