Bilingual corpus resource acquisition method and bilingual corpus resource acquisition system
A technology of parallel corpus and acquisition method, which is applied in the fields of instruments, computing, and electrical digital data processing, and can solve problems such as scarcity of corpus resources.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0028] Embodiments of the present invention will be described below with reference to the drawings.
[0029] When there are not enough parallel corpus resources between the two languages, in order to obtain the translation rules between the two languages, the intermediate language can be used to merge the translation rules, so as to indirectly obtain the translation rules between the two languages. For example, two sets of translation models M1 and M2 are currently known, where:
[0030] M1 is the translation model between the first language and the intermediate language
[0031] M2 is the translation model between the intermediate language and the second language
[0032] Both sets of translation models M1 and M2 contain a certain number of translation rules. The translation model of statistical machine translation is mainly divided into four parts: first language rules, second language rules, alignment relationship information and rule probability. figure 1 Shown is a sch...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com