Massive data processing, searching and recommendation methods and devices

A technology of massive data and processing methods, applied in the field of data processing, can solve the problems of ineffective data differentiation and large sparseness of original data, and achieve the effect of speeding up processing speed and processing efficiency, good differentiation, and rapid acquisition.

Inactive Publication Date: 2013-11-13
ALIBABA GRP HLDG LTD
View PDF3 Cites 17 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0007] This application provides a method and device for processing massive data to solve the problem that the effect of data differentiation is not obvious due to the large sparseness of the original data

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Massive data processing, searching and recommendation methods and devices
  • Massive data processing, searching and recommendation methods and devices
  • Massive data processing, searching and recommendation methods and devices

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0049] In order to make the above objects, features and advantages of the present application more obvious and comprehensible, the present application will be further described in detail below in conjunction with the accompanying drawings and specific implementation methods.

[0050] This application provides a method for processing massive data, which eliminates the original data assigned a value of 0 in the original matrix A, thereby reducing the sparsity of the original matrix A, and obtains the reconstructed matrix B without sparsity. The data in the above reconstruction matrix B are processed to distinguish different data. The present application fundamentally solves the problem of relatively large sparseness of original data, so that subsequent massive data can be better differentiated during processing.

[0051] refer to figure 1 , which provides a flow chart of a massive data processing method described in the embodiment of the present application.

[0052] Step 11, ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a massive data processing method and a massive data processing device, and aims to solve the problem of unremarkable data discrimination effects caused by higher sparsity of original data. The method comprises the following steps of storing the massive original data into an (m*n)th-order original matrix A, wherein m and n are positive integers, and the original data is operating data for each user; when the original matrix A is subjected to singular value decomposition, distributing the original data in the original matrix A to a plurality of processing nodes for processing; reconstructing a first unitary matrix U, a first diagonal matrix S and a second unitary matrix V, which are obtained by the singular value decomposition, to obtain a corresponding reconstructed matrix B; and clustering data in the reconstructed matrix B to discriminate the data of different types. According to the method and the device, the problem of higher sparsity of the original data is radically solved, so that higher data discrimination performance during subsequent massive data processing is ensured.

Description

technical field [0001] The present application relates to data processing technology, in particular to a massive data processing method and device, a massive data-based search method and device, and a massive data-based recommendation method and device. Background technique [0002] With the rapid development of information technology in today's society, the data processed on a network platform can reach tens of millions every day, and the processing of massive data has also attracted more and more attention. [0003] One category of massive data processing methods is to distinguish different data by processing massive data, for example, clustering massive data. However, sometimes the sparsity of massive data is relatively large, which will cause the difference between the data to be not obvious after processing the massive data, and it is impossible to distinguish the differences of each data well. [0004] For example, applying massive data processing to the field of prod...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/16G06F17/30
Inventor 陈欢
Owner ALIBABA GRP HLDG LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products