Unlock instant, AI-driven research and patent intelligence for your innovation.

System and method for improving event extraction annotation efficiency, event extraction method and system

An event extraction and event technology, applied in special data processing applications, instruments, unstructured text data retrieval, etc., can solve problems such as interference with labeling progress, waste of labeling manpower, and sparse distribution of events, so as to reduce calculations and save calculations , the effect of reducing the amount of data

Active Publication Date: 2019-11-22
成都数联铭品科技有限公司
View PDF7 Cites 3 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

In actual work, it is found that the distribution of events in the corpus to be annotated is very sparse, and it is often necessary for annotators to read multiple corpora that do not contain events before they can truly annotate a corpus with events. Reading a large amount of irrelevant text seriously interferes with the annotation progress , a waste of labeling manpower

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • System and method for improving event extraction annotation efficiency, event extraction method and system
  • System and method for improving event extraction annotation efficiency, event extraction method and system
  • System and method for improving event extraction annotation efficiency, event extraction method and system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0043] Such as figure 1 As shown, this embodiment schematically provides a method for improving the efficiency of event extraction and labeling, including the following steps:

[0044] Step 1, according to punctuation marks, such as sentence end symbols such as full stop, question mark, exclamation point, several parts (such as 20-30 parts) texts that have been manually marked are split into several sentences, and extracted from the several sentences Annotated statement, and replace the name of person, company, and institution in the annotated statement with PER, COM, and ORG. The marked text here can be a part of the text to be marked, or it can be a text other than the text to be marked. If the marked text here can be a part of the text to be marked, then the text to be marked in step 3 below refers to the remaining part of the text to be marked.

[0045] In this embodiment, the NER tool is used here to replace the person's name, company name, and organization name in the ...

Embodiment 2

[0072] see Figure 5 , an event extraction method is provided in this embodiment, comprising the following steps:

[0073] Step 21, sort the text to be extracted from large to small according to the possibility of event existence.

[0074] In step 22, event extraction is performed on only a set number of texts to be extracted that are ranked higher.

[0075] This method is carried out based on the same idea of ​​the foregoing embodiment 1, so the execution process of the method can refer to the relevant description of the method described in embodiment 1. For example, the execution process of step 21 is as follows:

[0076] Convert several annotated texts into a reference matrix composed of multidimensional vectors. Concretely, firstly, split the marked texts into several sentences (a text contains one or more sentences), extract the marked sentences from the several sentences, and define the marked sentences The number of entries is n; then replace the entity name existin...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to a method and a system for improving the event extraction annotation efficiency. The method comprises the steps: sorting to-be-annotated texts from big to small according to the possibility of existence of an event; and thus, when the to-be-labeled texts are labeled, only the to-be-labeled texts with the set number in the front are labeled. According to the method and the system, the possibility that the event exists in each to-be-annotated text is pre-judged. The sorting is carried out according to the possibility. Only the to-be-annotated texts with the set number infront of the sorting need to be annotated during annotation, so that the event extraction annotation efficiency can be greatly improved.

Description

technical field [0001] The present invention relates to the technical field of natural language processing, in particular to a system and method for improving the efficiency of event extraction and labeling, and an event extraction method and system. Background technique [0002] In the field of knowledge graphs, an event is a thing or a state change that occurs at a specific point in time or within a specific geographical range, and consists of one or more actions involving one or more roles. Event extraction refers to extracting event information of interest to users from natural language text and presenting it in a structured form, such as who / organization, when, where, and what they did. Event extraction and labeling refers to manually marking out the content of events that need to be extracted in the data for event extraction. With this part of the data manually labeled, you can use the algorithm model to learn how to automatically extract event elements and other cont...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/31G06F16/335
CPCG06F16/313G06F16/335Y02D10/00
Inventor 罗镇权练睿唐远洋刘世林张发展李焕
Owner 成都数联铭品科技有限公司