Deep learning network training method based on artificial intelligence
A deep learning network and artificial intelligence technology, applied in the fields of equipment and storage media, systems, and deep learning network training methods, can solve the problems of fitting, neural network models are difficult to guarantee results, parameter redundancy, etc.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0026] see figure 1 A kind of deep learning network training method based on artificial intelligence that the embodiment of the present application provides, comprises the following steps:
[0027] S101. Construct a neural network;
[0028] The steps of constructing the neural network include establishing the input layer, hidden layer and output layer of the neural network; determining the activation function of the neural network.
[0029] Artificial neural network is a parallel distributed system, which adopts a completely different mechanism from traditional artificial intelligence and information processing technology, overcomes the defects of traditional logical symbol-based artificial intelligence in processing intuition and unstructured information, and has self-adaptive, Features of self-organization and real-time learning.
[0030]The concept of deep learning comes from artificial neural networks, and the multi-layer perceptron with multiple hidden layers is a deep ...
Embodiment 2
[0069] see figure 2 , in some embodiments of the present invention, the above-mentioned training set includes question items and answer items corresponding to the above-mentioned question items; in step S103, the process of training the above-mentioned neural network on the above-mentioned training set is:
[0070] S201. Input the data of the above-mentioned training set into the above-mentioned neural network, and the above-mentioned neural network obtains an output value through forward propagation calculation;
[0071] Forward propagation is the process of continuously calculating the weights and biases of each layer from the input layer through layers of hidden layers, and finally obtaining the output value y^.
[0072] S202. Calculate and obtain a loss function according to the above output value and the above answer item;
[0073] The loss is then calculated based on the difference between the output value y^ and the answer item y (true value). For the loss function, ...
Embodiment 3
[0080] In some embodiments of the present invention, in step S105, before adjusting the parameters of the selected sub-neural network model according to the verification result, it also includes: judging whether the sub-neural network model is overfitting or not according to the verification result Underfitting.
[0081] Before adjusting the parameters of the sub-neural network model, it is necessary to judge whether the model is under-fitting or over-fitting to facilitate subsequent optimization of the sub-neural network model.
[0082] Exemplarily, the easiest way to judge the current model is to compare its error on the training set with the error on the cross-validation set. When the cross-validation error is not much different from the training set error, and the training set error is large, the current model is more likely to be underfit; and when the cross-validation error is much larger than the training set error, and the training set error is small, the current model...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com