Parallel corpus construction method and device
A construction method and parallel corpus technology, applied in the field of machine translation, can solve the problems of small corpus size, restricting the effect of machine translation models, low domain coverage, etc., and achieve the effect of expanding the scale
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0021] In order to make the object, technical solution and advantages of the present invention clearer, the present invention will be further described in detail below in conjunction with the embodiments and accompanying drawings.
[0022] Existing parallel corpora are basically obtained from parallel websites. This kind of corpus has problems such as small corpus size and low domain coverage, which restricts the further improvement of the effect of machine translation models. In combination with this problem, the inventors have found in practice that bilingual non-parallel corpora have the characteristics of large corpus and rich fields, but non-parallel corpora are simple monolingual corpora of two languages, and there is no interaction between the two languages. Alignment relationship; if more parallel phrase pairs can be trained based on non-parallel corpus, the scale of parallel corpus will be further expanded. Therefore, this application provides figure 1 The constructi...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com