Language model training method and system in self-reconstruction mode and computer readable medium
A language model and training method technology, applied in natural language data processing, computing, neural learning methods, etc., can solve the problems of low prediction accuracy and high cost of language model, reduce the number of model parameters, reduce model size, speed up The effect of calculating speed
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Example Embodiment
[0042] In order to make the objectives, technical solutions and advantages of the present invention clearer, the following further describes the present invention in detail with reference to the accompanying drawings and implementation examples. It should be understood that the specific embodiments described herein are only used to explain the present invention, but not to limit the present invention.
[0043] See figure 1 , The first embodiment of the present invention provides a language model training method in a self-reconstruction manner, which includes the following steps:
[0044] Step S1: Extract at least one sentence to be trained from the pre-training text and segment it into a single word sequence, and map the corresponding single subsequence into a text matrix through position coding;
[0045] Step S2: Combine the transformer model and the self-attention mechanism to establish a neural network structure;
[0046] Step S3: The text matrix is used as the input sample of th...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap