Language model training method and device and computer equipment
A language model and training method technology, applied in computing, neural learning methods, biological neural network models, etc., can solve problems such as difficult migration of vertical fields, difficult task data labeling, and difficulty in ensuring application scenarios, and achieve better recognition results. Effect
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0045] In order to make the purpose, technical solution and advantages of the present application clearer, the present application will be further described in detail below in conjunction with the accompanying drawings and embodiments. It should be understood that the specific embodiments described here are only used to explain the present application, and are not intended to limit the present application.
[0046] refer to figure 1 , a method for training a language model in this embodiment, comprising:
[0047]S1: Input the modified MLM task and the modified NSP task into the first Bert model for training, and obtain the first model parameters corresponding to the first Bert model;
[0048] S2: Apply the first model parameters to the second Bert model, and train the second Bert model through the modified MLM task and the modified NSP task, wherein the second Bert model is compared with the Describe the first Bert model, compress the parameter quantity of FFN layer, expand ...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com