Webpage training method and system and webpage prediction method and system

A training method and webpage technology, applied in the Internet field, can solve the problems of huge category system and difficult to overcome the heterogeneity of websites, and achieve the effect of solving the problem of sparsity

Active Publication Date: 2014-07-09
ALIBABA GRP HLDG LTD
View PDF4 Cites 35 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0011] The main purpose of this application is to provide a webpage training scheme and a webpage prediction scheme to solve the problems of website heterogeneity, huge category system and data sparsity that are difficult to overcome in the prior art

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Webpage training method and system and webpage prediction method and system
  • Webpage training method and system and webpage prediction method and system
  • Webpage training method and system and webpage prediction method and system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0047] The main idea of ​​this application is that this application can well solve data sparsity and category system heterogeneity through unified processing of users' browsing / searching behavior on the Internet, general data interface, and classification algorithm with automatic adaptation capability. The three important problems of nature and the number of categories are too large, and provide services for many websites at the same time in a unified process.

[0048] In order to make the purpose, technical solution and advantages of the present application clearer, the present application will be further described in detail below in conjunction with the accompanying drawings and specific embodiments.

[0049] The intentions of users’ browsing and searching behaviors on the Internet can be commercial or non-commercial, and commercial intentions can be further classified according to the specific commodity category system of a specific website.

[0050] Recognition of a user's...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a webpage training method and system and a webpage prediction method and system. The webpage training method comprises obtaining a prior probability table of classified keywords according to existing data which are associated with the classified keywords; preprocessing a webpage to be trained to obtain a webpage text to be trained; extracting features in the webpage text to be trained according to the prior probability table to obtain an association relation feature vector representation F1 between the webpage to be trained and a specified category; performing model training on the association relation feature vector representation F1 to obtain a classification result of the webpage to be trained. According to the webpage training method and system and the webpage prediction method and system, category systems which are strong in heterogeneity can be simultaneously processed, the large category systems can be processed through less training data, and the problem of data sparseness is largely solved due to the fact that the browsing and search behavior of a user on the whole Internet not just a website is collected.

Description

technical field [0001] The present application relates to the field of the Internet, and in particular to a classification and prediction of Internet access behaviors of users. Background technique [0002] With the continuous popularization of computer technology, modern society has already relied heavily on the convenience brought by information technology. As computer and network technology become more and more efficient, safe and reliable, more and more wholesalers, retailers and consumers choose to conduct commodity transactions on the Internet. Specific websites are becoming the most commercially valuable service providers on the Internet. [0003] Users can browse, search, compare prices, purchase, pay, and evaluate a series of actions on a specific website to purchase commodities that meet their business intentions. [0004] At the same time, the scale and number of specific websites are also growing. For example, Taobao, Tmall, JD.com, Amazon, Dangdang, and numer...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
CPCG06F16/9535
Inventor 陈俊波薛贵荣李玉龙严孝伟李华康韩定一
Owner ALIBABA GRP HLDG LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products