Product clustering method and apparatus

A clustering method and product technology, applied in the field of product clustering methods and devices, can solve problems such as increasing the operating load of the system

Active Publication Date: 2016-02-17
ALIBABA GRP HLDG LTD
View PDF4 Cites 9 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0008] The embodiment of the present application provides a product clustering method and device to solve the problem existing in the prior art in order to achieve accurate clustering of massive products, thereby increasing the operating load of the system

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Product clustering method and apparatus
  • Product clustering method and apparatus
  • Product clustering method and apparatus

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0051] In order to solve the problem existing in the prior art in order to realize the accurate clustering of massive products, thereby increasing the operating load of the system. In the embodiment of the present application, a top similarity network is constructed according to the similarity between products, and then a heuristic algorithm is used to cluster products based on the top similarity network.

[0052] Of course, the implementation of the technical solution of the present application relies on the analysis of a large amount of user behavior data, thus requiring a parallel computing platform like hadoop. On the other hand, the technical solution of the present application is not only applicable to product clustering, but also applicable to other clustering scenarios such as user clustering and store clustering, which will not be repeated here.

[0053] The following only takes the product as an example, and describes the preferred implementation modes of the present...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The present application relates to electronic commerce technology, and particularly relates to a product clustering method and apparatus. The method comprises: based on similarity between products, sorting out products with similarity meeting a preset condition; determining cluster center products among the sorted out products based on a preset principle; and based on each cluster center product, classifying each non-cluster center product to a same cluster as a cluster center product with highest similarity to the non-cluster center product. The above method is not limited by the number of clusters, the product similarity needs to be calculated only once to construct a similarity network and achieve product clustering progressively based on a heuristic algorithm. In this way, not only the accuracy of the clustering result can be significantly improved, but also time complexity and spatial complexity of the product clustering can be greatly reduced, thus preventing a heavy operation load from being caused to a system and controlling the implementation cost within an ideal range. The method and apparatus are particularly applicable to large-scale product clustering.

Description

technical field [0001] The present application relates to e-commerce technology, in particular to a product clustering method and device. Background technique [0002] With the development of e-commerce technology, the number of products displayed on e-commerce websites is increasing day by day, and the calculation complexity of the similarity between products is very high. Usually, e-commerce websites have hundreds of millions of users, and user behaviors are also very rich. However, due to the huge amount of product data, the user's operation behavior on the product (such as clicking, purchasing, collecting, etc.) is very sparse. Due to the sparsity of user-to-product data, when calculating user preferences and calculating user similarity and other parameters, the coverage rate is often not high, and it also affects accuracy. [0003] In view of the above problems, under the existing technology, products that are sufficiently similar are usually aggregated into a cluster...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30G06Q30/00
Inventor 陈海凯
Owner ALIBABA GRP HLDG LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products