Entity extraction method based on grammar templates

A technology of entity extraction and grammar, applied in special data processing applications, instruments, electrical digital data processing, etc., to achieve the effect of real-time modification

Active Publication Date: 2017-01-11
BEIJING JINGDONG SHANGKE INFORMATION TECH CO LTD +1
View PDF3 Cites 6 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0011] Aiming at the current problems of entity recognition, especially the defects of special entity recognition in specific fields, we propose a method for extracting entities based on grammar templates

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Entity extraction method based on grammar templates
  • Entity extraction method based on grammar templates
  • Entity extraction method based on grammar templates

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0022] Exemplary embodiments of the present invention are described below in conjunction with the accompanying drawings, which include various details of the embodiments of the present invention to facilitate understanding, and they should be regarded as exemplary only. Accordingly, those of ordinary skill in the art will recognize that various changes and modifications of the embodiments described herein can be made without departing from the scope and spirit of the invention. Also, descriptions of well-known functions and constructions are omitted in the following description for clarity and conciseness.

[0023] figure 1 is a schematic flowchart of a grammar template-based entity extraction method according to an embodiment of the present invention. The technical solution of the present invention focuses on the definition and matching of grammar templates, so as to extract related entities. Therefore, the definition of grammar templates, grammar template matching and entit...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to an entity extraction method based on grammar templates. The method comprises the following steps: defining the grammar templates with contexts to ensure that the grammar templates can be mutually referenced and can be used for supporting regular expressions, ordinary characters and the combination thereof; converting each grammar defined in the grammar templates to a grammar tree, implementing matching on each of multiple branch nodes of nodes of each grammar tree, and finding out one branch node which consumes maximum characters to be used as an optimal matching; and implementing category filtering according to matching results of the grammar templates to extract required entities.

Description

technical field [0001] The invention relates to an entity extraction method based on a grammar template. Background technique [0002] Entity recognition is an important basic tool in natural language processing. Its main task is to identify entities with specific meanings in texts, such as names of people, places, institutions, proper nouns, etc. It is an integral part of many natural language processing technologies such as question answering systems. [0003] The process of entity recognition mainly includes two parts: entity boundary identification and determination of entity category (person name, place name or others). Entities in English have obvious form marks (that is, the first letter of each word in the entity should be capitalized), so entity boundary recognition is relatively easy, and the focus of the task is to determine the category of the entity. Compared with English, the Chinese named entity recognition task is more complex, and compared with the entity ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/27G06F17/30
Inventor 唐培忠
Owner BEIJING JINGDONG SHANGKE INFORMATION TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products