Model training method, device, computer equipment, and computer-readable storage medium
A model training and model technology, applied in computing, neural learning methods, biological neural network models, etc., to achieve the effect of improving generalization ability
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0054] Please refer to figure 1 , figure 1 It shows a schematic block diagram of the steps of a model training method provided by the embodiment of the present application.
[0055] Such as figure 1 As shown, the first model training method provided by the embodiment of the present application can be applied to a cross-domain slot filling model (Label-aware Transfer learning for Cross-domain Slot Filling, LTCS) incorporating label-aware transfer learning, including S110 to S140.
[0056] S110: Input a preset number of training samples into the embedded coding layer of the cross-domain slot filling model to obtain the hidden information of each word segment, wherein the training samples include samples in the first domain and samples in the second domain, and each training The samples all include real BIO tags.
[0057] In this embodiment, the BIO label marks each element as "B-X", "I-X" or "O". Among them, "B-X" indicates that the segment where this element is located belo...
Embodiment 2
[0097] Please refer to figure 2 , figure 2 A schematic structural block diagram of a model training device provided by an embodiment of the present application is shown. The model training device 500 includes an obtaining module 510 , a calculating module 520 and a training module 530 .
[0098] Wherein, the obtaining module 510 is configured to input a preset number of training samples into the embedded coding layer of the cross-domain slot filling model to obtain the hidden information of each word segment, wherein the training samples include the first domain samples and Second domain samples, each training sample includes real BIO labels;
[0099] The calculation module 520 is configured to calculate the maximum average difference value between the hidden information of the first domain samples and the second domain samples having the same real BIO label based on a first preset formula;
[0100] The calculation module 520 is further configured to add the maximum avera...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More - R&D
- Intellectual Property
- Life Sciences
- Materials
- Tech Scout
- Unparalleled Data Quality
- Higher Quality Content
- 60% Fewer Hallucinations
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2025 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com



