Chinese zero-anaphora resolution method and system based on Mask mechanism and twin network

A twin network and anaphora resolution technology, applied in the field of information processing, can solve problems such as noise and information redundancy, and achieve the effect of avoiding redundancy and noise

Pending Publication Date: 2020-08-04
SUZHOU UNIV
View PDF0 Cites 3 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] For this reason, the technical problem to be solved by the present invention is to overcome the problem of information redundancy and noise in the prior a

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Chinese zero-anaphora resolution method and system based on Mask mechanism and twin network
  • Chinese zero-anaphora resolution method and system based on Mask mechanism and twin network
  • Chinese zero-anaphora resolution method and system based on Mask mechanism and twin network

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0031] Such as figure 1 As shown, this embodiment provides a Chinese zero-reference resolution method based on the Mask mechanism and the twin network, including: Step S1: Add "[MASK]" to the position where the zero pronoun is located to obtain the completed zero-pronoun position If the antecedent and [MASK] are in the same sentence, splicing is not performed. If the antecedent and [MASK] are not in the same sentence, the sentence where the antecedent is located and the zero pronoun where the completion is located The sentence is spliced; step S2: input the above preprocessed sentence into the pre-trained BERT model to extract the first antecedent and the first zero pronoun; step S3: integrate the attention mechanism into the BERT model, for The first antecedent is processed by the first linear function to obtain the second antecedent; the first zero pronoun is processed by the second linear function, and the second zero pronoun is obtained after the third linear function in c...

Embodiment 2

[0054] Based on the same inventive concept, this embodiment provides a Chinese zero-reference resolution system based on the Mask mechanism and the twin network, and its problem-solving principle is the same as the Chinese zero-reference resolution method based on the Mask mechanism and the twin network. I won't repeat them here.

[0055] The Chinese zero-reference resolution system based on the Mask mechanism and the twin network described in this embodiment includes:

[0056] The Mask marking module is used to add the "[MASK]" mark at the position where the zero pronoun is located to obtain the completed sentence where the zero pronoun is located, wherein if the antecedent and [MASK] are in the same sentence, splicing is not performed, If the antecedent and [MASK] are not in the same sentence, the sentence where the antecedent is located and the sentence where the zero pronoun is after completion are spliced;

[0057]The input module is used to input the above-mentioned pre...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a Chinese zero-anaphora resolution method and a Chinese zero-anaphora resolution system based on a Mask mechanism and a twin network. The method comprises the following stepsof: adding a [MASK] mark at the position of a zero pronoun; wherein if the antecedent and the [MASK] are in the same sentence, not carrying out splicing processing and if the antecedent and the [MASK]are not in the same sentence, carrying out splicing processing on the sentence where the antecedent is located and the sentence where the complemented zero pronouns are located; inputting the preprocessed sentence into a pre-trained BERT model to extract a first antecedent and a first zero pronoun; integrating an attention mechanism into the BERT model, and processing the first antecedent througha first linear function to obtain a second antecedent; for the first zero pronouns, in combination with preselected manual features, acquiring second zero pronouns through respective linear functionprocessing; and calculating the similarity between the second antecedent and the second zero-generation word, and outputting the antecedent with the highest similarity. According to the invention, information redundancy and noise are avoided.

Description

technical field [0001] The invention relates to the technical field of information processing, in particular to a Chinese zero-reference resolution method and system based on a Mask mechanism and a twin network. Background technique [0002] Reference refers to the use of a reference pronoun in a text to refer back to a previously spoken language unit. In linguistics, the referring pronoun is called anaphora, and the object or content referred to is called antecedent. Anaphoria is a rhetorical term that refers to the phenomenon of referring to the same word, person, or thing over and over again in a passage or discourse. Anaphora resolution is the process of determining the relationship between anaphors and antecedents, and it is one of the key issues in natural language processing. Example 1: Xiao Ming likes his schoolbag very much. The first step is to detect that "he" is an anaphora, and the second step is to determine that the antecedent is the entity "Xiao Ming", tha...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F40/211G06F40/221G06F40/253G06N3/04G06N3/08
CPCG06F40/211G06F40/221G06F40/253G06N3/084G06N3/044
Inventor 孔芳葛海柱周国栋
Owner SUZHOU UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products