Method for extracting and processing network information and its system

A network information and web page technology, applied in the field of data processing, can solve problems such as error rate reduction, ambiguity and powerlessness

Inactive Publication Date: 2004-10-13
陈文中
View PDF0 Cites 84 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

This method improves the efficiency of word segmentation, but it can do nothing for ambiguity, and the error rate does not decrease

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method for extracting and processing network information and its system
  • Method for extracting and processing network information and its system
  • Method for extracting and processing network information and its system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0081] Below in conjunction with accompanying drawing and embodiment the present invention is described in further detail:

[0082] We only considered the process of automatic download and content analysis, and did not construct a corresponding matching model for each website. We implemented a general algorithm for news websites, which is based on the frequency of Chinese content and the intimacy of the content. The frequency and position of the html tag to determine which part is the news content. The implementation method will be described in detail later.

[0083] Since we need to obtain content with relatively high accuracy, and extract information from it and pass it on to end users, we do not need robots to perform deep recursive access. The specific method of realizing automatic download will be introduced in detail later.

[0084] Due to the consideration of generality, we do not consider the web page features of the text, but the automatic summarization of the pure ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a network information extracting and processing method, adopting artificial intelligence and natural language processing technique, able to automatically download daily up-to-date news and information from named websites, making content extraction, classification, automatic abstracting and retrenching full text, then storing the full text, and then indexing the full text for making high-efficiency full text retrieval in future.

Description

technical field [0001] The present invention relates to a data processing method and system, more specifically, to a method and system for extracting and processing various information on a computer network, especially online news. Background technique [0002] Today is an era of information explosion. With the rapid development of the Internet, more and more people obtain the latest consulting information through the Internet. [0003] Now, almost everyone has the habit of reading newspapers, especially some individuals and enterprises that have urgent needs for consulting information, and they need to obtain the information they need from many newspapers. We can see almost all the news from the Internet, and many people have obtained the latest news information through the Internet. However, just reading news on the Internet does not reduce the time we need. We still need to read through a large piece of news to know what the news describes, or to check many web pages bef...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F9/445G06F17/00G06F17/27G06F17/30
Inventor 陈文中
Owner 陈文中
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products