The invention provides a method for automatically reconstructing a website
site map. The method specifically comprises the following steps: S1, collecting website page; S2, extracting the digital identifier from each collected
web page to obtain the unique digital identifier DOM_ID of each
web page, and storing the unique digital identifier DOM_ID: PAGEs in a key value pair mode to classify and save the unique digital identifier DOM_ID: PAGEs to obtain the
web page information set MAP of the
web site; 3, statistically analyze that MAP of the web page information set of the
web site by using the judgment rule, and determining the column object
list COLUMNs of the
web site; S4, for the column object
list COLUMNs determined in the step S3, the column tree is reconstructed through the column hierarchical relationship to obtain a complete
site map. In addition, the invention also provides a
system for automatically reconstructing a website
site map. Through the technical proposal of the invention, the site map of the website is automatically constructed, so that the crawler can collect the key column pages of the website in time and comprehensively, so as to collect more articles with fewer resources, improve the SEO friendliness of the website and bring more users to the website.