Language model-oriented double-unit search space structure search method
A search space and language model technology, applied in the field of artificial intelligence, can solve the problems of gradient disappearance, interruption of sequence semantic information, difficult back-propagation of gradients at the far end of the sequence, etc., to achieve the effect of increasing continuity and expanding search space.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0027] Embodiment 1: as Figure 1-Figure 5 As shown, the structure search method of language model-oriented dual-unit search space includes: firstly, constructing a dual-unit search space; secondly, searching on the PTB data set, and selecting the structure with the smallest loss on the verification set during the search process as the structure to be Select the unit structure; finally, enter the evaluation stage, conduct a short-term evaluation on the candidate unit structure obtained in the search stage on the language model task, and obtain the optimal unit structure
[0028] The specific implementation steps of the structure search method based on the dual-unit search space are as follows:
[0029] Step1. A dual-unit search space is proposed for the language model task, a search unit is set, and the final cyclic neural network is formed through the connection of units, and then the search space is constructed;
[0030] The dual-unit search space proposed in Step1 is to co...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com