Experimental method for verifying influence of common sub-words on XLM translation model effect
A technology of translation models and experimental methods, applied in the field of natural language processing, can solve problems such as poor results, and achieve the effect of improving the performance of machine translation
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0026] Embodiment 1: as Figure 1-3 As shown, verify the experimental method of the impact of the common subword on the effect of the XLM translation model, and the method includes:
[0027] Step1. Preprocess the corpus of XLM translation model pre-training;
[0028] Step2. Verify whether the performance of the XLM translation model is degraded: use the preprocessed corpus to pre-train the XLM translation model, initialize the translation model with the pre-trained model, and observe the BLEU value of the new translation model.
[0029] The Step1 preprocessing includes the following:
[0030] First obtain the common subwords and all subword frequencies in English and French subwords; then randomly separate the common subwords according to the separation ratio; then read the vocabulary of all English and French subwords and save them in the dictionary for subsequent generation Separate subword files; use the generated separated subword files to initialize the dictionary, and ...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com