Evaluative text-oriented kernel entity identification method and apparatus

An entity recognition and entity technology, applied in the information field, can solve problems such as accurate identification of core entities, complex and changeable core entity names, and lack of information, and achieve the effect of reducing the size of training samples and improving training efficiency and effectiveness

Active Publication Date: 2017-04-19
INST OF INFORMATION ENG CAS
View PDF4 Cites 25 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, these massive evaluative texts are rich in variety, language fragmentation is serious, and the names of core entities are complex and changeable. It is difficult for rule matching to accurately identify core entities from evaluative texts.
[0004] Although manual annotation has a high accuracy rate, it is ...

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Evaluative text-oriented kernel entity identification method and apparatus
  • Evaluative text-oriented kernel entity identification method and apparatus
  • Evaluative text-oriented kernel entity identification method and apparatus

Examples

Experimental program
Comparison scheme
Effect test

example

[0041] Example: A Core Entity Recognition Method for Evaluative Text

[0042] For different types of evaluative texts, find out the core entities among them. Taking tourism reviews as an example, "In spring, the scenery of the Summer Palace is beautiful." This sentence mainly focuses on the Summer Palace, so the core entity is "Summer Palace".

[0043] 1) First, analyze whether the entity category of a certain type of review has a relatively standardized naming, such as scenic spot names, car brands, etc., and there are limited and unified names on the whole. A specific industry nomenclature dictionary can be constructed through network collection. Since the entities discussed in the evaluative text appear in the first half of the sentence, the first word in the industry naming dictionary that appears in the first half of the text is taken as the core entity of the sentence.

[0044] For the text that is not successfully matched, it is output to the subsequent model recogniti...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to an evaluative text-oriented kernel entity identification method and apparatus. The method comprises the following steps of 1) inputting an evaluative text, and identifying a kernel entity in the evaluative text according to expert rules and an industry specialized dictionary, 2) identifying a kernel entity via a well-trained bidirectional LSTM model for evaluation texts yet to be identified, 3) generating a candidate entity for a kernel entity according to existing entity set statistics and a combination of text segmentation and part-of-speed tagging for evaluative texts yet to be identified. The apparatus comprises a rule matching module, a model identifying module and a candidate generation module. For various types mixed evaluative texts, the kernel entity in the text can be accurately and effectively extracted; and powerful foundation can be laid for user decision judgment.

Description

technical field [0001] The invention belongs to the field of information technology, and in particular relates to an evaluation text-oriented core entity recognition method and device. Background technique [0002] Evaluative text refers to the comment sentences on various commodities and services in user consumption behavior. The common ones include user comments on various shopping, catering, and travel websites, such as food reviews, after-viewing movies, travel notes, etc. Core entity recognition, which is to identify the most important entities discussed in the text from the evaluative text in combination with the context. This kind of evaluative text is an important factor affecting the consumption of potential users, and extracting the core entities in the text can provide a strong basis for user decision-making and judgment. [0003] With the development of network technology and the popularization of mobile terminals, online consumption by users is becoming more an...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/30G06F17/27
CPCG06F16/3346G06F40/284
Inventor 李全刚柳厅文王玉斌李柢颖时金桥亚静郭莉
Owner INST OF INFORMATION ENG CAS
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products