Method for automatically extracting character relations from text set

A technology of character relationship and automatic extraction, applied in special data processing applications, instruments, electrical digital data processing, etc.

Inactive Publication Date: 2013-08-07
BEIJING INSTITUTE OF TECHNOLOGYGY
View PDF4 Cites 31 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0019] The present invention proposes a character relationship extraction method based on the features of the sentence semant

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method for automatically extracting character relations from text set
  • Method for automatically extracting character relations from text set
  • Method for automatically extracting character relations from text set

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0067] In order to better illustrate the purpose and advantages of the present invention, the implementation of the method of the present invention will be further described in detail below in conjunction with the accompanying drawings and examples.

[0068] The data source used in the experiment is obtained from the Internet through manual retrieval. In the search, the popular figures "Yao Ming", "Liu Xiang", "Jay Chou", "Zhou Xingchi", "Jackie Chan" and "Kobe" counted by Google are used as the key words of the search. 1540 texts were fetched on the webpage. The description of the data source is shown in Table 1, in which the number of person objects is obtained through manual statistics.

[0069] Table 1 Data source of character relationship extraction experiment

[0070]

[0071] In order to verify the person relationship extraction method, two experiments were conducted:

[0072] (1) Character relationship extraction experiment: used to test the accuracy and comprehe...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a method for automatically extracting character relations from a Chinese text or a text set, and belongs to the technical field of computer science and information extraction. In the method, by means of sentence meaning model characteristics, the relation attribute affiliation is determined; the character relations scattered in the text or the text set are automatically extracted by combining methods such as relation attribute disambiguation, character relation strength calculation and the like; and the character relations are organized by a character relation network, and the character relations comprising character relation attributes and the relation strength are displayed in a character relation graph manner. According to the method, sentence meaning model characteristics are introduced, so that the accuracy of the method for extracting entity relations is improved, and the method for extracting the character relations is enriched. Besides, as the number of texts about a central character in the text set is increased, the method can extract character relations of the central character more accurately and comprehensively, and the application range is wider.

Description

technical field [0001] The invention relates to a method for automatically extracting character relationships from Chinese texts or Chinese text sets, and belongs to the technical field of computer science and information extraction. Background technique [0002] Character relationship extraction is the accurate and rapid automatic extraction of character entities scattered in the text and the relationship between characters, which belongs to the research content of the field of information extraction. [0003] Information extraction technology (IE, Information Extraction) has to complete two major research tasks: entity recognition (EDR, Entity Detection and Recognition) and relational recognition (RDR, Relation Detection and Recognition). Among them, relationship recognition (also known as "relationship extraction") is to extract existing relationships between entities from text, and the types of these relationships are predefined. Character relationship is a kind of enti...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/27
Inventor 罗森林魏超潘丽敏韩磊
Owner BEIJING INSTITUTE OF TECHNOLOGYGY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products