Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Chinese homodigital event recognition method and system

An event, Chinese technology, applied in the field of Chinese homonymous event recognition methods and systems, can solve the problems of insufficient language pertinence, inconsistent homonymous event chains, and insufficient versatility.

Active Publication Date: 2016-02-03
SUZHOU UNIV
View PDF3 Cites 23 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0009] At present, in the field of Chinese homologous event recognition, most methods use classifier-based machine learning methods and rule-based methods. These methods have the following problems: 1) Most Chinese homologous event recognition methods that use machine learning still use English homologous event recognition. method, the language is not specific enough
These characteristics make the method of identifying the same event in English lack of performance; 2) The machine learning method assumes that the event pairs are independent of each other, which may easily cause conflicts in classification results and inconsistent event chains; 3) The disadvantage of the rule method is that The construction cost of the rules is high, and the versatility is not enough to be used across domains

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Chinese homodigital event recognition method and system
  • Chinese homodigital event recognition method and system
  • Chinese homodigital event recognition method and system

Examples

Experimental program
Comparison scheme
Effect test

example 1

[0080] Example 1: At 7 am on December 14, 2012, more than 10 monkeys used monkey paws to create a wounding case in the corn field of Chenpeng Village. Four villagers were injured when they were scratched by the monkey's paw. Subsequently, the monkey who caused the wounding case was driven away by the police. So far, two villagers have been seriously injured. ...the group of monkeys once broke into the residence of an elderly man who lived alone. When the monkey attacked the old man, the old man resisted. After the old man was slightly injured, the monkey rushed into the cornfield of Chenpeng Village.

[0081] Event annotation information can be generated by event extraction tools or manually, as shown in Example 2:

example 2

[0082] Example 2: E1:Tri=SenID=1Type=AttackArgs={December 14th, 2012 at 7 am / TIME / Time; more than 10 monkeys / PER / Attacker; monkey paw / WEA / Instrument; Chenpeng village corn Ground / LOC / Place}Polarity=TrueTense=Past

[0083] E2: Tri=scratch SenID=2Type=AttackArgs={village / PER / Target; monkey paw / WEA / Instrument}Polarity=TrueTense=Past

[0084] E3: Tri=Injured SenID=2Type=InjureArgs={Villager / PER / Victim; Monkey Paw / WEA / Instrument}Polarity=TrueTense=Past

[0085] E4: Tri=Assault SenID=3Type=AttackArgs={Monkey / PER / Attacker}Polarity=TrueTense=Past

[0086] E5: Tri=Drive SenID=3Type=ArrestArgs={Civil Police / PER / Agent; Monkey / PER / Person}Polarity=TrueTense=Past

[0087] E6: Tri=Seriously Injured SenID=4Type=InjureArgs={Current / TIME / Time; Villager / PER / Victim}Polarity=TrueTense=Past

[0088] E7: Tri=Intrusion SenID=9 Type=TransportArgs={Monkey / PER / Artifact; Residence / LOC / Place} Polarity=TrueTense=Past

[0089] E8: Tri=Attack SenID=10Type=AttackArgs={monkey / PER / Attacker; old man / PER / Targ...

example 3

[0095]

[0096] Indicates that E1 and E2, E1 and E4, E2 and E4, E3 and E6 are the same events.

[0097] figure 2 It is the decomposed flowchart of step S1 of the method for identifying Chinese homonymous events provided by a preferred embodiment of the present invention. Such as figure 2 As shown, step S1 of the method for identifying Chinese homonymous events provided by a preferred embodiment of the present invention further includes the following steps.

[0098] S101. Invoke a word segmentation tool to segment words for each event sentence in the annotation text of the same index and the test text, and obtain a word segmentation annotation set and a word segmentation test set separated by spaces.

[0099] For example: the event sentence "at 7 o'clock in the morning on December 14, 2012, more than 10 monkeys used monkey paws to create a wounding case in the cornfield of Chenpeng Village." After word segmentation, it becomes:

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a Chinese homodigital event recognition method and system. The method comprises: carrying out word segmentation, entity recognition and syntactic analysis on each sentence containing an event in a homodigital labelled text and a test text to obtain a preprocessing labelled text set and a preprocessing test text set, and extracting events of the same event type and feature information thereof in the preprocessing labelled text set and the preprocessing test text set with one document as a unit to obtain a labelled text feature set and a test text feature set; training a homodigital event recognition model according to features of each event pair in the labelled text feature set; using the homodigital event recognition model to determine whether there is a homodigital relation among the event pair corresponding to each feature in the test text feature set, and obtaining a first event homodigital set; and carrying out global optimization on homodigital event results initially recognized in the first event homodigital set with one document as a unit. Therefore homodigital event recognition performance is improved.

Description

technical field [0001] The invention belongs to the field of natural language processing, and in particular relates to a method and system for recognizing Chinese homonyms among events. Background technique [0002] Event (Event) is a main form of information representation. It is an objective fact (also called "natural event") of specific people, things, and things interacting at a specific time and a specific place, such as human injury and death events. and food additive incidents, etc. An article often contains many events, and there are various relationships between these events. When two events point to the same event ontology, it is considered that the two events have a co-reference (or coreference) relationship. For example: [0003] Example 1: The heads of state of the two countries held talks in Paris today. ... The two sides discussed the issue of peace in the Middle East during the talks. [0004] Example 2: The financial crisis broke out in the United State...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F17/27
Inventor 李培峰朱巧明周国栋朱晓旭
Owner SUZHOU UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products