An English word and case joint prediction method based on neural machine translation
A machine translation, capitalization technology, applied in the field of machine translation, can solve the problems of increasing processing steps and time overhead, not considering source corpus, and word case information restoration interference, etc., to reduce size, reduce model parameters, The effect of quality improvement
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0027] 1) The parallel corpus used is the 2017 China Workshop on Machine Translation (CWMT) English-Chinese machine translation evaluation corpus. After noise reduction, deduplication, and deletion of unreasonable sentences, 7 million pieces of data were obtained. The training data set contains Chinese corpus and English corpus, and each Chinese sentence in the Chinese corpus corresponds to an English translation sentence in the English corpus. We divide the case of English words into four categories: a) other, b) lowercase, c) first letter uppercase, d) all uppercase.
[0028]According to the English corpus, make the uppercase and lowercase labels of the corresponding words to form an English label corpus. Each word corresponds to a case tag, so each English translation corresponds to a sequence of case tags. Convert all the English corpus to lowercase, count the frequency words of English words in the English corpus, and arrange them in descending order from high frequency ...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com