Model migration training method, device and equipment, and storage medium
A training method and model technology, applied in computational models, character and pattern recognition, instruments, etc., can solve the problems of easy overfitting and poor generalization ability of target models, so as to avoid overfitting and improve generalization. effect of ability
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0053] figure 1 It is a flowchart of a model migration training method in the first embodiment of the present application. The embodiment of the present application is applicable to the case where the source model in the source domain is transferred to the target model in the target domain, and the target model is trained. The method is executed by a model migration training device, which is implemented by software and / or hardware, and is specifically configured in an electronic device.
[0054] Such as figure 1 A model migration training method shown includes:
[0055] S101. Use network parameters of at least two migration layers in the source model as initial parameters of associated migration layers in the target model.
[0056] Among them, the source model can be understood as a stable network model that is successfully trained through a large number of source training samples in the source domain. The target model can be understood as a model to be trained in the target domain ...
Embodiment 2
[0079] figure 2 It is a flowchart of a model migration training method in the second embodiment of the present application. The embodiment of the present application is optimized and improved on the basis of the technical solutions of the foregoing embodiments.
[0080] Further, the operation "construct an objective function based on the distance between the training parameters associated with the at least two migration layers and the initial parameters" is refined into "according to the weights of the at least two migration layers, and The distance between the training parameters associated with the at least two migration layers and the initial parameters, construct an objective function" to improve the objective function construction mechanism.
[0081] Such as figure 2 A model migration training method shown includes:
[0082] S201: Use network parameters of at least two migration layers in the source model as initial parameters of associated migration layers in the target model...
Embodiment 3
[0096] Figure 3A It is a flowchart of a model migration training method in the third embodiment of the present application. The embodiment of the present application provides a preferred implementation on the basis of the technical solutions of the foregoing embodiments.
[0097] Such as Figure 3A A model migration training method shown includes:
[0098] S301. Use the network parameters of each migration layer in the source model as the initial parameters of the corresponding migration layer in the target model. Among them, the migration layer is the image feature extraction layer.
[0099] S302. Divide the migration layer into multiple network blocks.
[0100] S303: Based on the weight function, determine the weight of the migration layer according to the sequence number of the network block to which each migration layer belongs.
[0101] Specifically, the weight of the migration layer is determined according to the following formula:
[0102] W i =softmax(N-i);
[0103] Where W i Is...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com