Entity information atlas generation method and device

A technology of entities and graphs, applied in special data processing applications, instruments, electrical digital data processing, etc., can solve problems such as no semantics, no knowledge, and difficult keywords to express user retrieval intentions clearly

Active Publication Date: 2016-04-06
刘玉静
View PDF7 Cites 94 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] 1. There are too many relevant information recalled by search engines, and it is difficult for users to locate the required information;
[0007] 2. Keyword-based search, although the matching algorithm is simple and easy to implement, it stays on the surface of the language without touching the semantics, and it is difficult to express the user's search intention clearly with the logical combination of several keywords;
[0008] 3. Even if the correct result is obtained, it is only a link to each independent article, which requires users to browse one by one;
[0009] 4. Unable to provide the physical and temporal correlation between articles, and reveal the internal connections and relationships of things
[0010] The development of the Internet has become a huge knowledge base, but because most of the information exists in unstructured data, people cannot organize and use this knowledge achievement, so there is no information and no knowledge.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Entity information atlas generation method and device
  • Entity information atlas generation method and device
  • Entity information atlas generation method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0055] The preferred embodiments of the present invention will be described in detail below in conjunction with the accompanying drawings. It should be understood that the preferred embodiments described below are only used to illustrate and explain the present invention, and are not intended to limit the present invention.

[0056] figure 1 is the principle diagram of the entity information map generation method provided by the embodiment of the present invention, such as figure 1 As shown, the steps include:

[0057] Step S101: Collect text files from local and / or network.

[0058] Specifically, there are three main ways to collect text files:

[0059] 1. Use a web crawler (predefined URL) to obtain text files in the network;

[0060] 2. Obtain text files through existing search engines;

[0061] 3. Get the text file locally.

[0062] Step S102: According to the predefined category names and relational words, extract the named entities related to each category name and ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses an entity information atlas generation method and device. The method comprises the following steps: collecting a text file; according to predefined class names and relation words, independently extracting a naming entity associated with each class name from each text file, and the attribute of the naming entity associated with each relation word; according to the attribute of the naming entity, independently carrying out association processing on the naming entities in each collected text file to obtain the entity relationship among all naming entities; according to a predefined event name, looking up the naming entity associated with the predefined event name, binding the predefined event name with the found naming entity; and taking the predefined event name as a clue, establishing mapping for associated information dispersed in each text file according to the extracted naming entity and the entity relation, and carrying out aggregation on the associated information to form an entity information atlas. The method can convert unstructured text data into structured data to realize multidimensional and complex knowledge atlas.

Description

technical field [0001] The present invention relates to natural language processing technology, in particular to a method for generating an entity information map and a related device. Background technique [0002] With the rapid development of the Internet, people are faced with an information explosion. Massive information is scattered on the Internet, which is fragmented, multilingual and international. The Internet is actually like a huge library. Every computer connected to the network is like a bookcase. This library has no catalog and is dynamic and rapidly increasing. At present, the work of the search engine is only to provide the position of the relevant books containing the keywords inquired by the user according to the keywords of the user, and find out the position of the books in the library. People are often submerged in the ocean of information. [0003] Due to the rapid development of the network, the dissemination speed of Internet information has increase...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
Inventor 李晓戈李宗海高剑凌
Owner 刘玉静
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products