The invention relates to the technical field of
network data, and discloses an automatic
network data acquisition method, which comprises the following steps: S1, acquiring
network data to obtain an original webpage; s2, performing
data extraction on the original webpage to obtain an analyzed webpage; s3, carrying out null removal, error removal, repetition removal, normalization and incomplete value supplement
processing on the analyzed webpage to obtain processed data; s4, storing the processed data; and S5,
processing the stored data. According to the automatic network data collection method, 24-hour uninterrupted collection can be carried out on data disclosed by a
third platform, minute-level third-party platform
data retrieval synchronization is supported, second-level updating can be achieved for data updating of increment parts of multiple sites, manual supervision is not needed, meanwhile, through keyword retrieval configuration, the data updating efficiency is improved, and the efficiency is improved. According to the method, irrelevant contents can be filtered out while automatic retrieval is realized, the accuracy is improved, and non-supervision, non-omission and rapid iterative
data acquisition is realized.