A Chinese zero pronoun resolution method and system
A technology of pronouns and Chinese, applied in the field of Chinese zero pronoun resolution method and system, can solve the problems of low accuracy rate of automatic syntactic analysis, zero pronoun recognition and resolution accuracy difficult to meet application standards, etc.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0061] The present embodiment provides a method for dissolving Chinese zero pronouns, wherein the zero pronoun resolution actually includes two processes of zero pronoun identification and zero pronoun resolution; figure 1 shown, including:
[0062] S101. Obtain candidate zero pronoun markers by preprocessing the target corpus;
[0063] Further, said preprocessing the target corpus to obtain the candidate zero pronoun mark includes:
[0064] Divide the target data set according to the data set division method, and obtain the marks of zero pronouns on the training set, test set and verification set.
[0065] Specifically, the target data set is the OntoNotes5.0 data set, and the OntoNote5.0 is divided according to the data set division method of the CoNLL-2012Share Task coreference resolution evaluation task; wherein, the OntoNotes5.0 data set itself contains zero pronoun marks information, and CoNLL-2012 provides the training, verification, and testing three-part data set di...
Embodiment 2
[0113] The present embodiment provides a Chinese zero pronoun resolution system, such as Figure 5 shown, including:
[0114] The preprocessing module 110 is used to obtain the candidate zero pronoun mark by preprocessing the target corpus;
[0115] Further, the preprocessing module 110 includes:
[0116] The zero pronoun marking unit 111 is configured to divide the target data set according to the data set division method to obtain the marking of the zero pronouns on the training set, test set and verification set.
[0117] The zero pronoun identification module 120 is used to identify the position of the candidate zero pronoun; the result of the position identification is combined with the preset optimization rule to obtain the target zero pronoun;
[0118] Further, the zero pronoun recognition module 120 includes:
[0119] The context semantic feature acquisition unit 121 is used to use the word vector of the candidate zero pronoun context as input, and utilize the bidir...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com