BPE encoding method and system based on Chinese subword unit, and machine translation system
A coding method and coding system technology, applied in the field of computer software, can solve the problems of low readability of translated text and degradation of translation quality.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0019] In order to make the object, technical solution and advantages of the present invention more clear, the present invention will be further described in detail below in conjunction with the examples. It should be understood that the specific embodiments described here are only used to explain the present invention, not to limit the present invention.
[0020] The invention improves the existing BPE encoding method. A better and more theoretical method for generating Chinese BPE codes is provided, so that Chinese BPE codes can solve the problem of Chinese unregistered words. While making full use of Chinese word information, it avoids the shortcomings caused by the traditional BPE encoding method. Due to the existence of Chinese characters constructed by Wubi typing, Chinese characters can be easily converted into English alphabets whose Chinese root corresponds to Wubi typing, thus solving the problem of Chinese splitting and effectively reducing the occurrence of Chines...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More - R&D
- Intellectual Property
- Life Sciences
- Materials
- Tech Scout
- Unparalleled Data Quality
- Higher Quality Content
- 60% Fewer Hallucinations
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2025 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com



