Mixed attention mechanism text title matching method based on multi-task learning

A multi-task learning and matching method technology, applied in the field of text title matching based on multi-task learning mixed attention mechanism, can solve the problems that the detection mechanism cannot be effectively detected, cannot be applied to application scenarios, and the number of words contained in the sensitive word dictionary is limited. , to avoid text noise, improve detection accuracy and accuracy, and reduce manual operations

Active Publication Date: 2021-06-15
CHENGDU UNIV OF INFORMATION TECH
View PDF13 Cites 3 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Loose detection mechanism cannot effectively detect "headline party" text
Most of the "headline party" texts use legal words, and the number of words contained in the dictionary of sensitive words is limited. This method cannot be applied to this application scenario in practical applications.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Mixed attention mechanism text title matching method based on multi-task learning
  • Mixed attention mechanism text title matching method based on multi-task learning
  • Mixed attention mechanism text title matching method based on multi-task learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0041] In order to make the object, technical solution and advantages of the present invention clearer, the present invention will be further described in detail below in combination with specific embodiments and with reference to the accompanying drawings. It should be understood that these descriptions are exemplary only, and are not intended to limit the scope of the present invention. Also, in the following description, descriptions of well-known structures and techniques are omitted to avoid unnecessarily obscuring the concept of the present invention.

[0042] A detailed description will be given below in conjunction with the accompanying drawings.

[0043]For longer texts and shorter titles, most of the words in the text are irrelevant to the title, and the calculation of text similarity through text embedding vectors will be disturbed by a lot of noise. This program proposes a mixed attention strategy based on multi-task learning to extract key information from the te...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a mixed attention strategy text title matching method based on multi-task learning, and the multi-task learning of a model is embodied in that the model carries out a classification task 1 of an original text category and a classification task 2 of whether the text is a'title type 'article on an input text at the same time. The model is jointly trained through the multi-task learning model, and one task assists the other task in learning better parameters. According to the scheme, model parameters are adjusted by using back propagation of the classification task 1, so that the classification task 2 obtains better performance. According to the invention, key information is extracted from the text and is matched with the title, so that detection of the title type article is realized, and the detection precision and accuracy of the title type are obviously improved. According to the attention mechanism provided by the method, the correlation degree between each element and other elements can be calculated in one step, the calculation amount is small, and the efficiency is high.

Description

technical field [0001] The invention relates to the field of text processing, in particular to a multi-task learning-based mixed attention mechanism text title matching method. Background technique [0002] In the Internet age, based on the actual benefits brought by traffic hoarding and traffic incentives, the Internet text "headline party" has taken advantage of the trend, which has degraded the browsing experience of Internet users. And the platform where the "headline party" is active is experiencing the loss of users, which has an impact on the sustainable development of the platform. [0003] "Headline Party" refers to making eye-catching headlines on forums or media represented by the Internet to attract the audience's attention. The general term for purpose website editors, reporters, managers and netizens. The main behavior of "title party" is that the title of the post is seriously exaggerated, and the content of the post is usually completely irrelevant or has l...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/35G06F40/284G06N3/04G06N3/08
CPCG06F16/35G06F40/284G06N3/084G06N3/047G06N3/044
Inventor 王维宽冯翱宋馨宇张学磊张举蔡佳志
Owner CHENGDU UNIV OF INFORMATION TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products