Unlock instant, AI-driven research and patent intelligence for your innovation.

Unstructured natural language information extraction method based on 6W semantic annotation

An unstructured, natural language technology used in the information domain

Active Publication Date: 2015-02-25
KARAMAY HONGYOU SOFTWARE
View PDF4 Cites 12 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] The object of the present invention is to provide a kind of unstructured natural language information extraction method based on 6W semantic mark, thus solve the aforementioned problems existing in the prior art

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Unstructured natural language information extraction method based on 6W semantic annotation
  • Unstructured natural language information extraction method based on 6W semantic annotation
  • Unstructured natural language information extraction method based on 6W semantic annotation

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0052] refer to figure 1 , a kind of unstructured natural language information extraction method based on 6W semantic mark, this extraction method, comprises the following steps:

[0053] S1, copy the metadata stored in the complete data metadata model in the database to the cache module to obtain the metadata copy text;

[0054] S2, performing text analysis on the unstructured natural language to obtain a file File with data elements in the unstructured language;

[0055] S3, manually process the data element, then create an index file, and finally go through metadata registration, record and save the path of the file, and complete the extraction of the unstructured natural language information based on the 6W semantic mark;

[0056] The 6W refers to six scenes, specifically including: time scene, activity scene, object scene, location scene, participant scene and result scene, and data elements related to each scene are stored in the six scenes.

[0057] refer to figure ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses an unstructured natural language information extraction method based on 6W semantic annotation and relates to the technical field of information. The unstructured natural language information extraction method based on 6W semantic annotation includes the following steps that firstly, metadata stored in an integral data element data model in a database are copied to a cache, so that copying test of the metadata is obtained; secondly, test analysis is carried out on an unstructured natural language to obtain a file 8 with data elements of the unstructured language; thirdly, the data elements are processed manually, then an index file is built, and finally unstructured natural language information extraction based on 6W semantic annotation is completed by registering metadata and recording and saving file paths, wherein 6W is scene data of six dimensions. The unstructured natural language information extraction method based on 6W semantic annotation solves the problems that an existing information extraction method has high requirements for engineers compiling rules, time and labor are wasted, and needed information aggregate maximization can not be met.

Description

technical field [0001] The present invention relates to the field of information technology, in particular to a method for extracting unstructured natural language information based on 6W semantic tags. Background technique [0002] Information extraction technology is to structurally process the information contained in the text into a table-like organizational form. It originated from natural language processing and was the first tool to process free text. However, with the rise of the Internet, the amount of structured text and semi-structured text has continued to surge, causing scientists to widely apply information extraction technology to these two types of text. Therefore, the existing information extraction technology is responsible for how to describe text and how to learn features. Responsibility: Among them, how to describe the text is to use features to describe the text; the basis of how to learn features is the knowledge engineering method and automatic train...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
CPCG06F16/313
Inventor 贾磊
Owner KARAMAY HONGYOU SOFTWARE