Supercharge Your Innovation With Domain-Expert AI Agents!

GPU based data classification method of conditional random field model

A conditional random field and data technology, used in computer parts, character and pattern recognition, instruments, etc., can solve the problems of low performance of a single processor, many execution units, and low cost performance, and improve the speed of model learning and derivation process. , to ensure the effect of accuracy

Inactive Publication Date: 2014-03-05
HUAZHONG UNIV OF SCI & TECH
View PDF2 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

This method obtains a linear speedup ratio within a certain range, but compared to the consumed computing resources, the cost performance is not high; on the other hand, the image processor (Graphic Process Units, referred to as GPU) is With the characteristics of many units, strong floating-point computing ability and low cost and power consumption, it has gradually made a difference in the field of general-purpose parallel computing, but GPU also has its own limitations and special processing: a single processor has low performance and requires a large number of computing units Improve performance in parallel at the same time; memory IO consumption has hierarchical overhead characteristics; parallel read shared memory efficiency issues, etc.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • GPU based data classification method of conditional random field model
  • GPU based data classification method of conditional random field model
  • GPU based data classification method of conditional random field model

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0070] The present invention will be described in detail below in conjunction with the accompanying drawings.

[0071] Such as figure 1 Shown, the data classification method of the parallel conditional random field model based on GPU of the present invention comprises the following steps:

[0072] (1) Read the learning data, including: the length value N of the observation data sequence X, the length value M of the feature sequence Y, all observation data-feature pairs set {(x, y)} (x is any element in X, y is any element in Y), feature transition probability array F[M][M], feature appearance probability array G[M][N], and initialize feature weight array λ[], μ[], likelihood function value lh, The likelihood function cache value lhTemp is 0;

[0073] According to a given observation sequence X, the set probability of a given observation data-feature pair can be calculated by the following formula 1:

[0074] P ( y | ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The present invention provides a GPU based data classification method of a conditional random field model, including the following steps: a CPU reads a training data set and generates a feature set according to matching template definition rules; the CPU initializes a correlation matrix and a vector to first calculate the initial probabilities of all features corresponding to starting nodes of an observation sequence; the CPU determines whether to perform data segmentation processing; after transmitting parameters, the GPU automatically selects a proper parallel computing method according to processing data scale and adopts the parallel computing method to calculate the maximum probability value of each observation node corresponding to each feature. After calculation, with the results being returned, the CPU determines a completion situation of data processing and final output. This method in the invention is characterized by universal use, high training speed, effective processing of large-scale sequence data and self-adaptation capability.

Description

technical field [0001] The invention relates to a data classification method, in particular to a data classification method based on a GPU-based parallel conditional random field model. Background technique [0002] Conditional Random Field (CRF) is one of the important discriminative models in machine learning. It was first used in natural language processing and gradually introduced into data mining, bioinformatics, image intelligent recognition and search engines, etc. in popular fields. It solves the label bias problem of other traditional discriminative models (such as the maximum entropy Markov model, referred to as MEMMS), and has a more accurate training prediction effect. [0003] However, with the widespread application of classification, serial CRF faces some problems. In practical applications, the number of training sets is growing rapidly, the data analysis sequence is getting longer and longer (for example, a DNA contains thousands of amino acid pairs), and ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06K9/62
Inventor 金海郑然章勤韩丹冯晓文
Owner HUAZHONG UNIV OF SCI & TECH
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More