Training method and device of deep nonlinear principal component analysis network and computer readable storage medium
A non-linear principal component and training method technology, applied in the field of data processing, can solve the problem of high efficiency of data extraction that cannot be complicated, and achieve the effect of strong feature extraction ability
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0031] figure 1 It shows the implementation flow of the training method of the deep nonlinear principal component analysis network provided by Embodiment 1 of the present invention. see figure 1 As shown, the implementation process of this method is detailed as follows:
[0032] Step S101, obtaining a deep nonlinear principal component analysis network.
[0033]In this embodiment, the deep nonlinear principal component analysis network acquired in step S101 is an initial deep nonlinear principal component analysis network, and the parameter values in this network are all initial values, and its parameters can be adjusted through training later to obtain the final The deep nonlinear principal component analysis network, after training, the deep nonlinear principal component analysis network can extract highly effective features for complex data.
[0034] Step S102, using the deep non-linear principal component analysis network to perform forward propagation, encode the inp...
Embodiment 2
[0084] Compared with the previous embodiment, the training method of the deep nonlinear principal component analysis network provided by this embodiment uses the deep nonlinear principal component analysis network to perform forward propagation, encodes the input data layer by layer, and solves Before outputting the reconstruction error between the input value of each layer and its estimated value, it also includes:
[0085] Orthogonal constraints are imposed on the k weights in the k-layer deep nonlinear PCA network, and the objective function after the orthogonal constraints is:
[0086]
[0087] Among them, λ is a parameter that controls the strength of orthogonal constraints. If the weight matrix W is limited i Each vector in is a unit vector, then Λ i =I, the initial weight matrix W of the deep nonlinear principal component analysis network under this constraint i A set of basis vectors under the new vector space is formed.
[0088] The goal of deep nonlinear princi...
Embodiment 3
[0094] Compared with Embodiment 1, the training method of the deep nonlinear principal component analysis network provided by this embodiment uses the deep nonlinear principal component analysis network to perform forward propagation, encodes the input data layer by layer, and solves The reconstruction error between the input value of each layer and its estimated value also includes before:
[0095] Sparse constraints are added to the activation values of the hidden layers in the deep nonlinear principal component analysis network, and the objective function of the activation value sparse constraints is:
[0096]
[0097] Among them, β is the coefficient of the activation value sparse term, h(Z i ) is the sparse regularization item of the activation value of the i-th layer, Z i is the activation value of the i-th layer.
[0098] In the deep neural network, we can think that the activation value close to "1" is "active", and the activation value close to "0" is "inactive...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


