Unlock instant, AI-driven research and patent intelligence for your innovation.

Recording data processing method and device

A processing method and data technology, applied in the computer field, can solve the problem of low cluster accuracy

Pending Publication Date: 2021-09-10
AGRICULTURAL BANK OF CHINA
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

In the existing recording data processing technology, the clustering is only completed based on the content of the recording text, and the accuracy of the final cluster is not high

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Recording data processing method and device
  • Recording data processing method and device
  • Recording data processing method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0025] The technical solution in this application will be described below with reference to the accompanying drawings.

[0026] For ease of understanding, the relevant terms involved in this application are briefly described below.

[0027] 1. Cluster: a collection of multiple nodes with high similarity characteristics, that is, multiple nodes in the same cluster have high similarity characteristics.

[0028] 2. Likelihood function: In statistics, the likelihood function is a function of the parameters of the statistical model. When the output x is given, the likelihood function L(θ|x) about the parameter θ is numerically equal to the given parameter θ, and the probability of the variable X is: L(θ|x)=P(X=x|θ) .

[0029] 3. Expectation-maximization algorithm (EM): EM is an optimization algorithm that performs maximum likelihood estimation (MLE) through iteration, and is usually used as an alternative to the Newton-raphson method. Used for parameter estimation of probabilist...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

Provided are a recording data processing method and device. The method comprises: based on a predefined probability generation model, obtaining model parameters and a model matrix, wherein the model parameters comprise: K, L, alpha, beta, and omega, K represents the number of topic clusters to which a plurality of records obtained in advance belong, L represents the number of transfer relationship clusters, alpha represents the prior probability of a hidden variable g, beta represents the prior probability of the hidden variable z and the probability that one record has one attribute under the condition that one record belongs to one topic cluster, omega represents the probability that one operation has a switching relationship with another operation under the condition that one operation belongs to one switching relationship cluster, the model matrix comprises an attribute matrix Y of a first recording set, a transfer relation matrix A of a second recording set, and a relation matrix R of each recording in the first recording set and the second recording set, and based on the model parameters and the model data, clustering analysis is performed on the recording to obtain a relatively accurate probability that each recording belongs to a topic cluster.

Description

technical field [0001] The present application relates to the field of computers, and more specifically, to a method and device for processing recording data. Background technique [0002] As an important means, clustering algorithm can complete the similarity node clustering of massive data. Probabilistic generative models, that is, Bayesian probability models, have very mature applications in the current algorithm field. The cluster structure is described by constructing a generative graph model, and the cluster structure is derived by defining different types of objective functions and adopting different optimization methods. In the existing recording data processing technology, the clustering is only completed based on the content of the recording text, and the accuracy of the final cluster is not high. Contents of the invention [0003] The embodiment of the present application provides a recording data processing method and device, by taking into account the text c...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/62
CPCG06F18/2321Y02D10/00
Inventor 王珍珠
Owner AGRICULTURAL BANK OF CHINA