WeChat public platform-based Chinese-Mongolian corpus crowdsourcing construction method
A WeChat public platform and construction method technology, applied in the field of corpus resource construction, can solve the problems of large investment and high cost of oral speech, and achieve the effects of simple interaction, mitigation of adverse effects, and good user experience
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0032] The method in the embodiment of the present invention will be described in detail and completely below in conjunction with the accompanying drawings.
[0033] The Chinese-Mongolian corpus crowdsourcing construction method based on the WeChat public platform used in this embodiment, such as figure 1 shown. The specific steps are:
[0034] The corpus in the embodiment of the present invention includes text corpus and speech corpus, including Chinese-Mongolian bilingual alignment corpus and monolingual text alignment corpus in the fields of machine translation and natural language processing.
[0035] Step A. According to definition 1 and definition 2, the original text is preprocessed, wherein, the specific process of preprocessing the original corpus varies with the translation direction, and the purpose is to standardize the corpus, and this embodiment does not Limit the source of the corpus, segment it, and delete meaningless data. Table 1 is an example of the origi...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com