Method and apparatus for marking network contents semantic structure

A semantic structure and network content technology, applied in the field of computer science, can solve the problems of resource consumption, unobvious semantic structure of network content, unacceptable cost, etc.

Inactive Publication Date: 2009-07-08
华天清 +1
View PDF3 Cites 23 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Although the development investment of a single individual is very small, the total development investment of the entire industry is huge, so the current decentralized state consumes considerable resources
[0006] 2. Information extraction development results cannot be inherited and accumulated
[0008] 3. The semantic structure of web content is not obvious
However, this type of technology requires content publishers to use standard content annotation tags or element attributes to write HTML pages, but to become a standard, it needs to go through a long negotiation process, and content publishers cannot wait for the standard to be promulgated and give up content publishing. ; On the other hand, the current stock content of the Internet is already very huge, and the cost of rewriting using standard content annotations is unacceptable

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and apparatus for marking network contents semantic structure
  • Method and apparatus for marking network contents semantic structure
  • Method and apparatus for marking network contents semantic structure

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0031] Preferred embodiments of the present invention will be described in detail below in conjunction with the accompanying drawings.

[0032] A Method for Semantic Structure Annotation of Web Content

[0033] figure 1 It is a schematic diagram of a web page content DOM tree and a semantic structure tree. The left figure is a web page content DOM tree, and dark nodes represent DOM nodes carrying semantic symbols. Using the method of the present invention, semantic nodes can be identified and a semantic structure tree can be constructed , as shown in the right figure, the user can use the semantic structure editing means provided by the preferred embodiment of the present invention to modify the semantic tree. Many calculations in the present invention use the layer numbers of the DOM tree and the semantic structure tree, and the layer numbers are numbered incrementally from the root node, as shown by the dotted line. In order to better understand the present invention, so...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a method and equipment for labeling a semantic structure of network contents. The semantic structure includes the relation of semantic comment of network contents, attributes of commented information and semantic comment. The method comprises the following steps: recognizing the semantic structure of network contents; editing the semantic structure; issuing and sharing description documents of the semantic structure, and establishing a knowledge library of the semantic structure; and utilizing format command files in web pages generated by the semantic structure. The invention belongs to the technical field of computer science and the branch of semantic network, and is suitable for the application fields such as network information extraction, network knowledge base establishment, semantic network service and the like.

Description

technical field [0001] The invention belongs to the semantic network branch of the field of computer science and technology, and specifically relates to a method and device for labeling the semantic structure of network content, which is suitable for network information extraction, self-service promotion of enterprise websites, full-text search and semantic search, business intelligence mining, information aggregation, network Knowledge classification, establishment of network knowledge base, network service and content mashup and other application fields. Background technique [0002] The World Wide Web (Web) has brought great changes to people's information and knowledge management. Now, people are gradually accustomed to searching for knowledge and answers to questions on the World Wide Web. Previously, it took hours or even more time to read books and materials. Now it may only be It takes a few minutes. However, the current information processing method on the World Wi...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
Inventor 华天清齐勇挺
Owner 华天清
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products