The invention provides a universal internet
data acquisition method. The method comprises the steps of executing
transaction scheduling, judging the type of an acquisition transaction, and if the acquisition transaction is a media or a file link, executing corresponding document acquisition
processing; if the access address of the webpage acquisition transaction is not in a history grasping
library, conducting acquisition according to a newly found webpage; obtaining the last acquisition information of the webpage address from the history grasping
library if the acquisition transaction is in the history grasping
library; comparing the amount of page content of a current webpage address to the amount of the last webpage content if an
internal time exceeds a renewing frequency, if the amount of the webpage content of the current webpage address is not equal to that of the last webpage content, obtaining a webpage
source code of the webpage link, renewing acquisition information of the webpage address in a history access library, and executing webpage washing and extraction. The invention provides a universal internet
data acquisition method. According to the universal internet
data acquisition method, by utilizing a transaction control strategy to conduct efficient data acquisition,
data mining is conducted aiming at a
coupling relation among multi-dimensional objects.