Method for automatically finding network content quotation

A network content and automatic discovery technology, which is applied in special data processing applications, instruments, calculations, etc., can solve the problems of high cost and existence, and achieve the effect of fast speed, accelerated automatic discovery process, and low hardware requirements

Inactive Publication Date: 2008-08-20
NEW FOUNDER HLDG DEV LLC +2
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] Since there is currently no method for automatically discovering network content citations, and the method of manual discovery requires a lot of manpower and material resources, the cost is too high, resulting in a large number of unauthorized network content citations and reprints, and the problem of network content homogeneity is very serious

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method for automatically finding network content quotation

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0027] The present invention will be further described below in conjunction with the accompanying drawings and specific embodiments.

[0028] The present invention carries out experiment on common PC, and CPU is P42.0GHz, memory is 512MB, Windows2000 operating system. As shown in Figure 1, a network content reference automatic discovery method includes the following steps:

[0029] 1) Content reading: read the specified website content to be found whether it is referenced;

[0030] 2) Feature analysis: First, word segmentation is carried out, and keyword extraction technology is used to calculate the weight score of each word according to the frequency, position, part of speech, word length, common words and other information of each word in the document, and then select 10 with high weight words as feature words;

[0031] 3) search condition: according to the retrieval format requirement that search engine website provides, content characteristic forms search condition, in ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a method for finding network contents being quoted automatically which comprises steps of: introducing pre-searching process for accelerating automatic found process, employing the indexing service provided by searching engine website to eliminate web page grabs and establishing content index. The invention has the advantages of having low requirement on hardware and of being abet to protect intelligent property of network contents.

Description

technical field [0001] The invention belongs to intelligent information processing technology, and in particular relates to a method for automatic discovery of network content references. Background technique [0002] At present, the problem of mutual citation between network contents on the Internet is very prominent, and most of them are illegal citation or plagiarism without authorization, which seriously violates the intellectual property rights of relevant copyright owners. However, there is no automatic method for discovering citations of web content, and people have to resort to manual methods. The main methods for artificially discovering that specific website content is cited are as follows: [0003] 1. Website browsing method. Go to the relevant website and browse to see if there are references to these contents. Due to the large number of websites, rich content, and frequent updates, this method not only requires a lot of manpower, but also inevitably omits. ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06F17/30
Inventor 杨建武陈晓鸥吴於茜
Owner NEW FOUNDER HLDG DEV LLC
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products