Unlock instant, AI-driven research and patent intelligence for your innovation.

Event key information extraction method combining domain synonym dictionary and pattern matching

A technology of key information and pattern matching, which is applied in the field of data processing, can solve problems such as low efficiency, unclear targetness of information, complicated model training process, etc., and achieve the effect of improving accuracy

Pending Publication Date: 2021-12-07
COMMUNICATION UNIVERSITY OF CHINA
View PDF0 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The shortcomings of this method are mainly reflected in the complexity of the training process of the model, and the efficiency is not very high.
[0010] At present, most of the information extraction methods take the texts publicly released on the Internet as the main research objects, such as Wikipedia and Baidu Encyclopedia data. The extracted information has no clear target, which belongs to the general method research stage.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Event key information extraction method combining domain synonym dictionary and pattern matching
  • Event key information extraction method combining domain synonym dictionary and pattern matching
  • Event key information extraction method combining domain synonym dictionary and pattern matching

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0038] The present invention will be further described below. It should be noted that this embodiment is based on the technical solution and provides detailed implementation and specific operation process, but the protection scope of the present invention is not limited to this embodiment.

[0039] This embodiment provides a method for extracting key event information by combining domain thesaurus and pattern matching, including the following steps:

[0040] S1. Starting from the real application scenario, analyze the text features of a specific field and determine the key information of the event, and formulate the filling slot accordingly.

[0041] Taking the case text as an example, the characteristics of the case text mainly include: ①The text is short and small, with a relatively simple structure. ② Contains a lot of personal identity-related information, such as name, ID card, telephone number, mobile phone number, license plate number, and address. ③The text has a cert...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses an event key information extraction method combining domain synonym dictionary and pattern matching. The method comprises the following steps: from real application scenarios and for a specific field, performing information extraction to clarify a target and design a filling groove; constructing an exclusive information dictionary in combination with the uniqueness of specific field information, and expanding a knowledge base through seed words by adopting a word vector method based on deep learning; conducting a set of rule system capable of performing shallow syntactic analysis through pattern matching on the basis of the knowledge base; inviting a linguist to perform feature analysis on related texts in the specific field, compiling rules, and finally completing key information extraction in the specific field. According to the method, rules and systems are separated by executing the rules and extracting the related information, linguistics and computer science are combined in the aspect of rule writing, and the accuracy of information extraction is jointly improved by integrating the force of language workers and the force of computer science technicians.

Description

technical field [0001] The invention relates to the technical field of data processing, in particular to an event key information extraction method based on the combination of domain thesaurus and pattern matching. Background technique [0002] With the widespread popularization of computers in various fields and the rapid development of the Internet, the total amount of information in society is increasing exponentially. The magnitude of the total amount of information has transitioned from MB (10^6) in the early 1990s to GB (10^9) to the current TB (10^12). After entering the 21st century, the total amount of information in the world is doubling every three years. According to statistics, 60% to 70% of these massive amounts of information exist in the form of electronic documents. In order to cope with the challenges brought by the information explosion, there is an urgent need for some automated technologies to help people quickly find the information they really need i...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F16/33G06F40/242G06F40/211G06F40/247G06F40/289G06F40/295G06F40/30G06N3/08
CPCG06F16/3344G06F40/242G06F40/211G06F40/295G06F40/289G06F40/247G06F40/30G06N3/08
Inventor 程南昌李正涵李姣杨柳
Owner COMMUNICATION UNIVERSITY OF CHINA