Method and device for generating statistical data matrix of user behaviors

A statistical data and data technology, applied in the computer field, can solve the problems that affect the generation efficiency of statistical data matrix, slow reading of statistical data matrix, and consume a lot of resources, so as to improve data processing efficiency, save storage space, and read quickly. Effect

Pending Publication Date: 2020-11-27
BEIJING JINGDONG SHANGKE INFORMATION TECH CO LTD +1
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

In actual scenarios, the number of item types is often relatively large, which makes the writing, storage, and execution of Hive SQL statements consume a lot of resources, which seriously affects the generation efficiency of the above statistical data matrix
At the same time, the number of users is often also relatively large, and the above-mentioned statistical data matrix is ​​generally a sparse matrix, and the statistical data matrix stored in the existing method (that is, the method of displaying each element of the matrix) is slow to read and wastes a lot of space

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and device for generating statistical data matrix of user behaviors
  • Method and device for generating statistical data matrix of user behaviors
  • Method and device for generating statistical data matrix of user behaviors

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0030] Exemplary embodiments of the present invention are described below in conjunction with the accompanying drawings, which include various details of the embodiments of the present invention to facilitate understanding, and they should be regarded as exemplary only. Accordingly, those of ordinary skill in the art will recognize that various changes and modifications of the embodiments described herein can be made without departing from the scope and spirit of the invention. Also, descriptions of well-known functions and constructions are omitted in the following description for clarity and conciseness.

[0031] It should be noted that, in the case of no conflict, the embodiments of the present invention and the technical features in the embodiments can be combined with each other.

[0032]figure 1 is a schematic diagram of main steps of a method for generating a statistical data matrix of user behavior in an embodiment of the present invention.

[0033] Such as figure 1 ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a method and device for generating a statistical data matrix of user behaviors, and relates to the technical field of computers. One specific embodiment of the method comprisesthe following steps: acquiring a plurality of pieces of single behavior data of a plurality of users for a plurality of articles within a preset time interval; determining a non-zero statistical result of each user for the target dimension data of each article and key value pair data formed by the non-zero statistical result and article identifiers according to the single behavior data; independently storing the key value pair data of the same user in a set, storing the article identifiers in a first subset, and storing the non-zero statistical result in a second subset; and converting the article identifiers in each set into a column serial number or a row serial number of the matrix such that each set becomes a row vector or a column vector for forming a statistical data matrix. According to the embodiment, the statistical result of the user behavior data can be quickly obtained on the premise of not using a structured query language such that the statistical data matrix without wasting the storage space is generated.

Description

technical field [0001] The invention relates to the field of computer technology, in particular to a method and device for generating a statistical data matrix of user behavior. Background technique [0002] In the field of computer technology, it is often faced with a scenario where user behavior needs to be analyzed. The above user behavior may be the user's behaviors such as browsing, searching, and purchasing for items. In order to more intuitively display the behavior discrimination of different users for different items, a statistical data matrix of user behavior can be established. Each row of data in the matrix can correspond to a user, each column of data can correspond to an item, and each element represents a user. Statistical results of behavioral data for an item within a preset time interval. [0003] In the existing distributed systems using Hive (a data warehouse tool applied in distributed systems) as the query engine, the above statistics are generally obt...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/2458G06F16/22
CPCG06F16/2282G06F16/2462G06F16/2477
Inventor 李慧萍
Owner BEIJING JINGDONG SHANGKE INFORMATION TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products