Text similarity calculation system and method based on distance perception self-attention mechanism and multi-angle modeling

A text similarity and distance perception technology, applied in the field of computer text information processing, can solve the problems of ignoring the single semantic importance of text and affecting the accuracy of the final text, so as to improve the performance of similarity calculation, improve the accuracy of similarity calculation, performance-efficient effects

Pending Publication Date: 2022-06-07
NORTHWEST UNIV(CN)
View PDF0 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, although this type of method captures the alignment relationship between two texts, as the model continues to deepen, when extracting the interaction information between the two texts, the importance of the single semantics of the text is ignored to a certain extent.
[0007] In summary, although the above method can capture the semantic vector of the text, and then calculate the similarity between the texts, it is difficult to extract the deeper semantic features while extracting the interaction information between the two texts. In addition, from various perspectives The similarity measurement of the text vector after feature extraction will also affect the accuracy of the final text to similarity calculation

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Text similarity calculation system and method based on distance perception self-attention mechanism and multi-angle modeling
  • Text similarity calculation system and method based on distance perception self-attention mechanism and multi-angle modeling
  • Text similarity calculation system and method based on distance perception self-attention mechanism and multi-angle modeling

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0040] see figure 1 , this embodiment provides a text similarity calculation system based on distance-aware self-attention mechanism and multi-angle modeling, at least including a word representation module, a context representation module, an enhanced semantic modeling module, a soft alignment module, Multi-angle similarity modeling module and similarity result output module, where:

[0041] Word representation module: used to input texts P and Q, to obtain the preprocessed text pair , and then use the pre-trained word embedding to obtain the text pair through the word vector of each word in it, And further fuse character embedding and exact matching tag features;

[0042] Context representation module: It is used to perform context encoding on the word vector of each word in the text pair , and obtain the encoded context representation vector of ;

[0043] Enhanced semantic modeling module: used to model text P and Q respectively, and extract deeper semantic features of t...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a text similarity calculation system and method based on a distance perception self-attention mechanism and multi-angle modeling, for a text P and a text Q, text similarity calculation is completed by means of Word2vec, BiLSTM, the distance perception self-attention mechanism and multi-angle modeling, firstly Word2ve is used to obtain word representations of the text P and the text Q, and context representations are captured through the BiLSTM; the method comprises the following steps: firstly, extracting a text P and a text Q, then further extracting text features by utilizing a distance-aware self-attention mechanism, then carrying out interactive modeling on the text P and the text Q by utilizing an interactive attention mechanism, and finally, carrying out similarity modeling on the text P and the text Q from different levels by proposing a multi-angle similarity modeling method so as to obtain a final similarity. According to the method, through a self-attention mechanism based on distance perception and a multi-angle similarity modeling architecture, the text semantic modeling capability is enhanced, the accuracy of text similarity calculation is improved, and the model is simple in structure, efficient in performance, high in robustness and easy to popularize and use in practice.

Description

technical field [0001] The invention relates to the technical field of computer text information processing, relates to a method for calculating similarity between texts, and in particular relates to a text similarity calculating system and method based on distance-aware self-attention mechanism and multi-angle modeling. Background technique [0002] In recent years, with the vigorous development of the mobile Internet and the popularization of smart devices, people are more dependent on the Internet in their work and life. As a result, a large number of Internet-based service platforms have emerged, such as search engines, blogs, and intelligent customer service. These services bring convenience to people, but also generate massive data, especially text data, such as query and response data generated when people use search engines, question and answer data between customer service and users, and a large number of similarities in blogs Article data, etc. However, simply rel...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/33G06F16/35G06F40/289G06F40/30G06K9/62G06N3/04G06N3/08
CPCG06F16/3335G06F16/3344G06F16/35G06F40/289G06F40/30G06N3/08G06N3/048G06N3/045G06F18/22G06F18/241G06F18/25
Inventor 孙霞邓瑶黄立张梦延陈嘉诚
Owner NORTHWEST UNIV(CN)
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products