Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Webpage data capture method

A technology for web page data and data acquisition, which is used in electrical digital data processing, special data processing applications, instruments, etc. It can solve problems such as difficulties in logging in and query data, and achieve the effect of fast and effective capture.

Inactive Publication Date: 2013-08-14
INSPUR COMMON SOFTWARE
View PDF3 Cites 8 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0002] With the continuous development of information technology, the number of systems owned by enterprises is also increasing. The relatively independent data storage mechanism of multiple systems will cause certain difficulties to the integration and analysis of future data, especially some dealers have strong technical strength. Through the website Open data query for enterprises, but one enterprise corresponds to many dealers, and it is difficult to log in and query data from house to house. This invention mainly solves this problem

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0017] In order to make the objectives, technical solutions, and advantages of the present invention clearer, the following further describes the embodiments of the present invention in detail.

[0018] A method for capturing webpage data includes the following steps:

[0019] A. Establish a configuration file describing webpage data, describing the login information, page structure, and data acquisition area required to obtain webpage data;

[0020] B. Realize the processing program for the configuration file. The program first connects to the designated webpage through the login information, analyzes the ordinary webpage or the webpage using AJAX technology, and extracts the text information of the webpage; according to the webpage structure described in the configuration file The character string of the webpage text is intercepted to obtain the two-dimensional table data. The program will create a data table with the same structure in the database according to the table data, and ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to the technical field of data analysis and acquisition, in particular to a webpage data capture method. Data information of some websites with access right is captured quickly and effectively by means of establishing a data channel of concurrent execution and defining a data capture process of the websites. Facing to ERP (enterprise resource planning) software developers, a scheme for quickly conveniently defining data capture of corresponding websites is provided, and trouble of artificially accessing to website downloaded information is avoided by timed automatic data capture of a background.

Description

Technical field [0001] The invention relates to the technical field of data analysis and collection, in particular to a method for webpage data capture. Background technique [0002] With the continuous development of information technology, the number of systems owned by enterprises is also increasing. The relatively independent data storage mechanism of multiple systems will cause certain difficulties in the integration and analysis of future data, especially some dealers who have strong technical strength through the website It is difficult to log in and query data from one company to a large number of dealers, and the present invention mainly solves this problem. Summary of the invention [0003] In order to solve the problems of the prior art, the present invention provides a method for webpage data capture, which can quickly and effectively capture data information of some websites with access rights during the data collection process. [0004] The technical scheme adopted by...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F17/30
Inventor 李海啸付传伟肖祝川刘清华
Owner INSPUR COMMON SOFTWARE
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products