Method and device for building neural network
A neural network and neural network model technology, applied in neural learning methods, biological neural network models, physical implementation, etc., can solve the problems of difficult neural network training and low efficiency, and achieve easy implementation, simple optimization process, and improved efficiency. Effect
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0029] see figure 1 , is a flowchart of a method for constructing a neural network in an embodiment of the present invention, the method comprising:
[0030] Step 101, constructing an initial neural network, in which a plurality of specific structures preset in the initial neural network are respectively provided with corresponding sparse scaling operators, wherein the sparse scaling operators are used to scale the output of the corresponding specific structures.
[0031] Step 102, using preset training sample data to train the weights of the initial neural network and the sparse scaling operator with a specific structure to obtain an intermediate neural network.
[0032] Step 103 , deleting a specific structure in the intermediate neural network whose sparse scaling operator is zero to obtain a target neural network.
[0033] Preferably, the aforementioned step 101 can be realized through the following steps A1 to A3:
[0034] Step A1, selecting a neural network model.
[...
Embodiment 2
[0083] Based on the same inventive concept as the method for constructing a neural network provided in the first embodiment, the second embodiment of the present invention provides a device for constructing a neural network, the structure of which is as follows Image 6 shown, including:
[0084] The first construction unit 61 is configured to construct an initial neural network, wherein a plurality of specific structures preset in the initial neural network are respectively provided with corresponding sparse scaling operators, wherein the sparse scaling operators are used to perform output of corresponding specific structures Zoom;
[0085] The training unit 62 is configured to use preset training sample data to train the weight of the initial neural network and the sparse scaling operator of a specific structure to obtain an intermediate neural network;
[0086] The second construction unit 63 is configured to delete a specific structure in the intermediate neural network i...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com