Internet faced sensing string digging method and system

A meaningful and Internet-based technology, applied in the fields of information retrieval and operating systems, can solve problems such as ignorance of obtaining or grasping key information, and difficulties for Web users to effectively obtain useful information, so as to reduce time complexity, improve accuracy and The effect of recall

Inactive Publication Date: 2008-03-26
INST OF COMPUTING TECH CHINESE ACAD OF SCI
View PDF0 Cites 23 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0002] There is a vast amount of information on the Internet, but its huge amount makes it difficult for Web users to effectively obtain useful information from it. Users often feel overwhelmed by the vast ocean of information that is updated day and night, and do not know how to learn from the massive amount of i

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Internet faced sensing string digging method and system
  • Internet faced sensing string digging method and system
  • Internet faced sensing string digging method and system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0043] In order to make the purpose, technical solution and advantages of the present invention clearer, the method and system for mining meaningful strings oriented to the Internet of the present invention will be further described in detail below in conjunction with the accompanying drawings and embodiments. It should be understood that the specific embodiments described here are only used to explain the present invention, not to limit the present invention.

[0044] The present invention defines a character string that has useful information on the Internet and is used in various environments as a meaningful string. The most important feature of meaningful strings is semantic integrity. The present invention analyzes statistics, structure, pragmatics and semantics, and proposes a universal mining method and system for meaningful strings.

[0045] The present invention divides meaningful string mining method process into repeated string discovery, context adjacency analysis,...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses an Internet-oriented meaningful excavating method and system. The method includes the following steps: step A, repeat character string discovery; steps B, filter the character string through analysis the context; steps C, analysis and filter character string through language model. It can effectively extract net page or meaningful string in large scale of text data.

Description

technical field [0001] The invention relates to an information retrieval field and an operating system field, in particular to an Internet-oriented meaningful string mining method and system. Background technique [0002] There is a vast amount of information on the Internet, but its huge amount makes it difficult for Web users to effectively obtain useful information from it. Users often feel overwhelmed by the vast ocean of information that is updated day and night, and do not know how to learn from the massive amount of information. Looking for the information you really want, let alone how to obtain or grasp the key information in the massive information, and grasp the current important information in time. At the same time, in the face of new information emerging all the time, no one can "see six directions and listen to all directions". At this time, people urgently need the strong support of natural language processing technology to deal with the increasingly serious...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/30
Inventor 张华平贺敏黄玉兰龚才春
Owner INST OF COMPUTING TECH CHINESE ACAD OF SCI
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products