A Web Page Ranking Method Based on Random Forest Algorithm

A random forest algorithm and sorting method technology, applied in the field of web page sorting, can solve problems such as not very good, not very good search experience, and achieve the effect of good information, strong target, and accurate search.
CN108182186BActive Publication Date: 2020-10-02GUANGDONG KINGPOINT DATA SCI & TECH CO LTD

Patent Information

Authority / Receiving Office
CN · China
Patent Type
Patents(China)
Current Assignee / Owner
GUANGDONG KINGPOINT DATA SCI & TECH CO LTD
Publication Date
2020-10-02

Smart Images

  • Figure 1
    Figure 1
  • Figure 2
    Figure 2
  • Figure 3
    Figure 3
Patent Text Reader

Abstract

The invention provides a webpage sequencing method based on a random forest algorithm. The method includes the steps of obtaining keywords and keyword candidate words corresponding to search webpages;calculating the word frequencies and weights of the keywords or keyword candidate words corresponding to the search webpages; calculating PR values of quality related indexes of the search webpages;calculating pivot values and weight values of the search webpages; calculating the relevance between the recently browsed webpages and the search webpages and the product of TF-IDF values of the keywords and keyword candidate words of the recently browsed webpages; calculating whether an output index is larger than a set threshold or not, wherein the output index is the product of the number of times of a user for browsing the search webpages beyond stipulated access time and a certain function of the webpage staying time meeting the conditions; establishing a random forest model and recordinga corresponding result; calculating final search webpage scores and conducting sequencing. Compared with the prior art, a traditional HITS algorithm is improved to a certain extent by means of a random forest method, the service experience of the user is improved, and information is better and more accurate.
Need to check novelty before this filing date? Find Prior Art

Description

technical field

[0001] The invention relates to the technical field of webpage sorting, in particular to a method for sorting webpages based on a random forest algorithm. Background technique

[0002] With the rapid development of computer technology, people have more and faster ways to obtain information, but with the explosive growth of information, it is more difficult for people to accurately obtain information. How to provide faster and better information to The information that users want appears to be very important. The birth of search engines such as Baidu and Google is to make it easier for people to quickly and accurately find what they need in the vast ocean of information. And an excellent search engine should provide users with the most important and valuable webpage information they need and rank it in front, and the service provided should be simple and humanized, so that users can search and search in a short period of time. Get satisfactory relevant searc...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More