Universal internet data acquisition method

A data collection and Internet technology, applied in the direction of network data retrieval, network data indexing, electrical digital data processing, etc., can solve problems such as multi-dimensional object data mining, and achieve the effect of efficient data collection

Inactive Publication Date: 2017-10-10
成都布林特信息技术有限公司
View PDF3 Cites 7 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Existing search tools do not consider the complex environment formed by the above multi-dimensional objects for data mining

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Universal internet data acquisition method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0015] The following and accompanying appendices illustrating the principles of the invention Figure 1 A detailed description of one or more embodiments of the invention is provided together. The invention is described in connection with such embodiments, but the invention is not limited to any embodiment. The scope of the invention is limited only by the claims and the invention encompasses numerous alternatives, modifications and equivalents. In the following description, numerous specific details are set forth in order to provide a thorough understanding of the present invention. These details are provided for the purpose of example and the invention may be practiced according to the claims without some or all of these specific details.

[0016] One aspect of the present invention provides a general Internet data collection method. figure 1 It is a flowchart of a general Internet data collection method according to an embodiment of the present invention.

[0017] The s...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a universal internet data acquisition method. The method comprises the steps of executing transaction scheduling, judging the type of an acquisition transaction, and if the acquisition transaction is a media or a file link, executing corresponding document acquisition processing; if the access address of the webpage acquisition transaction is not in a history grasping library, conducting acquisition according to a newly found webpage; obtaining the last acquisition information of the webpage address from the history grasping library if the acquisition transaction is in the history grasping library; comparing the amount of page content of a current webpage address to the amount of the last webpage content if an internal time exceeds a renewing frequency, if the amount of the webpage content of the current webpage address is not equal to that of the last webpage content, obtaining a webpage source code of the webpage link, renewing acquisition information of the webpage address in a history access library, and executing webpage washing and extraction. The invention provides a universal internet data acquisition method. According to the universal internet data acquisition method, by utilizing a transaction control strategy to conduct efficient data acquisition, data mining is conducted aiming at a coupling relation among multi-dimensional objects.

Description

technical field [0001] The invention relates to data retrieval, in particular to a general Internet data collection method. Background technique [0002] With the continuous development of Web technology, network information resources are growing geometrically. How to quickly retrieve useful data related to users from the massive information on the Internet has become an urgent problem to be solved. Search engines are developed on the basis of information retrieval technology. The search engine helps the invention better express and store essential information in the real world, and by analyzing the connection information in the search engine, it can be used as a useful tool for mining hidden information. Existing search engines simply rely on limited search words to express user needs, which has the problem of incomplete expression. Even for the same search term, different users may expect different results. For example, the Weibo system, if considering the relationship...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
CPCG06F16/951
Inventor 张鹏
Owner 成都布林特信息技术有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products