Unlock instant, AI-driven research and patent intelligence for your innovation.

Semantic labeling method based on resource description framework (RDF) knowledge base

A technology of semantic annotation and knowledge base, applied in special data processing applications, instruments, electronic digital data processing, etc., can solve problems such as insufficient processing efficiency, and achieve the effect of improving efficiency and accuracy

Active Publication Date: 2012-11-14
杜小勇
View PDF3 Cites 11 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Therefore, the methods for labeling unstructured data in the prior art have insufficient processing efficiency.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Semantic labeling method based on resource description framework (RDF) knowledge base
  • Semantic labeling method based on resource description framework (RDF) knowledge base
  • Semantic labeling method based on resource description framework (RDF) knowledge base

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0015] In each embodiment of the present invention, the object to be semantically tagged is unstructured data of text type, and the data to be semantically tagged is extracted from the unstructured data by using information extraction technology. The to-be-labeled data described in the following embodiments Data, that is, the data extracted from unstructured data; the data to be labeled extracted from unstructured data can be words, phrases or sentences with a system preset length; and then use the method in each embodiment of the present invention Semantic annotation is performed on the extracted data to be annotated.

[0016] Various embodiments of the present invention implement semantic annotation of unstructured data based on a cloud platform. In specific applications, 2-3 or more ordinary computers that can build a cloud platform can be used to build a cloud platform, and a server with a higher configuration can be used to virtualize multiple computers to build a cloud p...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a semantic labeling method based on a resource description framework (RDF) knowledge base. The method includes utilizing data to be labeled as key words to look through the RDF knowledge base to acquire attribute information of one or a plurality of matching objects in fuzzy match with the data to be labeled, utilizing acquired entity names respectively corresponding the attributed information of the matching objects as first labeling information, respectively distributing preset first weight to the entity names in the first labeling information, utilizing one or a plurality of entity names acquired in an entity neighborhood list in the knowledge base and has neighborhood relation with the entity names in the first labeling information as second labeling information, distributing preset second weight to the entity names in the second labeling information in the way that the second weight is smaller than the first weight, conducting statistics on the weight of the acquired entity names, utilizing entity names with highest weights as semantic labeling information of the data to be labeled and outputting the semantic labeling information. The semantic labeling method effectively improves semantic labeling accuracy and efficiency of the unstructured data.

Description

technical field [0001] The invention relates to computer technology, in particular to a semantic tagging method based on RDF knowledge base. Background technique [0002] Unstructured data refers to data without an explicit data structure, including text data, web page information, emails, graphic images, audio and video, etc. Since the data sources of these data are diverse, and there are many redundant, erroneous, and semantically unclear information in the data, it is necessary to semantically annotate the unstructured data before utilizing it. [0003] At present, using traditional natural language processing methods combined with data mining tools, through lexical and grammatical analysis of unstructured data, the information such as speech and semantics of unstructured data is marked. [0004] However, labeling unstructured data by analyzing lexical and grammatical methods requires pre-defining complex natural language models, or using supervised or semi-supervised me...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
Inventor 杜小勇陈跃国陈晋川杜方
Owner 杜小勇