A method and system for identifying Chinese homonyms
A technology for identifying methods and events, which is applied in the fields of instrumentation, computing, and electrical and digital data processing, and can solve problems such as insufficient versatility, conflicting classification results, and lack of performance.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
example 1
[0080] Example 1: At 7 am on December 14, 2012, more than 10 monkeys used monkey paws to create a wounding case in the corn field of Chenpeng Village. Four villagers were injured when they were scratched by the monkey's paw. Subsequently, the monkey who caused the wounding case was driven away by the police. So far, two villagers have been seriously injured. ...the group of monkeys once broke into the residence of an elderly man who lived alone. When the monkey attacked the old man, the old man resisted. After the old man was slightly injured, the monkey rushed into the cornfield of Chenpeng Village.
[0081] Event annotation information can be generated by event extraction tools or manually, as shown in Example 2:
example 2
[0082] Example 2: E1:Tri=SenID=1 Type=Attack Args={December 14th, 2012 at 7 am / TIME / Time; more than 10 monkeys / PER / Attacker; monkey paw / WEA / Instrument; Chen Pengcun Cornfield / LOC / Place}Polarity=True Tense=Past
[0083] E2: Tri=Scratch SenID=2 Type=Attack Args={Villager / PER / Target; Monkey Paw / WEA / Instrument} Polarity=True Tense=Past
[0084] E3: Tri=Injured SenID=2 Type=Injure Args={Villager / PER / Victim; Monkey Paw / WEA / Instrument} Polarity=True Tense=Past
[0085] E4: Tri=Assault SenID=3 Type=Attack Args={Monkey / PER / Attacker}Polarity=True Tense=Past
[0086] E5: Tri=Drive SenID=3 Type=Arrest Args={Civil Police / PER / Agent; Monkey / PER / Person}Polarity=True Tense=Past
[0087] E6: Tri=Serious Injury SenID=4 Type=Injure Args={Current / TIME / Time; Villager / PER / Victim}Polarity=True Tense=Past
[0088] E7: Tri=Intrusion SenID=9 Type=Transport Args={Monkey / PER / Artifact; Residence / LOC / Place}Polarity=True Tense=Past
[0089] E8: Tri=Attack SenID=10 Type=Attack Args={Monkey / PER / Attacker; O...
example 3
[0095]
[0096] Indicates that E1 and E2, E1 and E4, E2 and E4, E3 and E6 are the same events.
[0097] figure 2 It is the decomposed flowchart of step S1 of the method for identifying Chinese homonymous events provided by a preferred embodiment of the present invention. Such as figure 2 As shown, step S1 of the method for identifying Chinese homonymous events provided by a preferred embodiment of the present invention further includes the following steps.
[0098] S101. Invoke a word segmentation tool to segment words for each event sentence in the annotation text of the same index and the test text, and obtain a word segmentation annotation set and a word segmentation test set separated by spaces.
[0099] For example: the event sentence "at 7 o'clock in the morning on December 14, 2012, more than 10 monkeys used monkey paws to create a wounding case in the cornfield of Chenpeng Village." After word segmentation, it becomes:
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


