Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Multilingual text generation method and device, equipment and storage medium

A multilingual and text-based technology, applied in neural learning methods, biological neural network models, natural language data processing, etc., can solve problems such as mixed multilingual texts, low proportion, and difficulty in obtaining multilingual texts

Pending Publication Date: 2021-12-03
IFLYTEK CO LTD
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

In addition, in the fields of language discrimination, multilingual speech synthesis, and multilingual speech recognition, a large amount of multilingual text corpus is needed. However, in real life, multilingual texts are often mixed in monolingual texts, and the proportion is relatively low. Large volumes of multilingual texts are difficult to obtain

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Multilingual text generation method and device, equipment and storage medium
  • Multilingual text generation method and device, equipment and storage medium
  • Multilingual text generation method and device, equipment and storage medium

Examples

Experimental program
Comparison scheme
Effect test

no. 1 example

[0071] see figure 1 , which shows a schematic flow chart of the multilingual text generation method provided by the embodiment of the present application, the method may include:

[0072] Step S101: Obtain a multilingual word list.

[0073] Wherein, the multilingual word list includes a plurality of entries, and each entry includes a word and language information of the word. An example of a multilingual word list is shown below:

[0074] language word Chinese I English plan Chinese Hello English document … …

[0075] In the above table, is an entry, is an entry, is an entry, is an entry, and so on.

[0076] The multilingual word list in this embodiment may be a word list of two languages, such as a Chinese-English word list, or a word list of three or more languages, such as a Chinese-English-French word list. It should be noted that the words in the multilingual word list are words that often appear in other language t...

no. 2 example

[0089] The above-described embodiment mentions that the multilingual text can be generated on the basis of the multilingual word list. The information samples several entries from the multilingual word list to form the target word list, and generates multilingual text based on the target word list. The method of “generating multilingual text” is the same. This embodiment takes “generating multilingual text based on the target word list” as an example to introduce the process of generating multilingual text.

[0090] see figure 2 , showing a schematic flow chart of generating multilingual text based on the target word list, which may include:

[0091] Step S201: Determine the feature vector of each entry in the target word list and the feature vector of the target word list.

[0092] In this embodiment, the word embedding vector of each entry in the target word list can be determined (ie, the representation vector of each entry), and then according to the word embedding vect...

no. 3 example

[0116] The first embodiment mentions that the multilingual text generation based on the target vocabulary can be realized by a pre-established multilingual text generation model. On the basis of the second embodiment above, this embodiment focuses on the structure of the multilingual text generation model And the establishment process of the multilingual text generation model is introduced.

[0117] This embodiment first introduces the structure of the multilingual text generation model.

[0118] see Figure 4 , shows a possible structural diagram of the multilingual text generation model provided by this embodiment, which may include: an input encoding module 401 , a plan processing module 402 and a sentence generation module 403 . in:

[0119] The input of the input coding module 401 is the target word list, after the target word list input input coding module 401, the input coding module 401 encodes it, the feature vector of each entry in the output target word list, the ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a multilingual text generation method and device, equipment and a storage medium. The method comprises the steps that a multilingual word list is acquired, the multilingual word list comprises a plurality of entries, and each entry comprises a word and the language information of the word; a pre-established multilingual text generation model is utilized, the multilingual text is generated on the basis of the multilingual word list, and the multilingual text generation model carries out text generation by taking the generation of the multilingual text meeting the characteristics of the real multilingual text as a generation target. According to the multilingual text generation method provided by the invention, the smooth and natural multilingual text conforming to human expression habits can be generated.

Description

technical field [0001] The present application relates to the technical field of text generation, and in particular to a multilingual text generation method, device, device and storage medium. Background technique [0002] Text generation is a difficult research direction in natural language processing, and its application scenarios are many and extensive. In recent years, text generation has made great progress in information extraction, dialogue systems, novel synthesis and advertising copy generation. [0003] With the development of globalization, in important scenarios of text generation applications such as daily communication and informal information, the language phenomenon of mixing different languages ​​in text or speech has become more and more obvious. In addition, in the fields of language discrimination, multilingual speech synthesis, and multilingual speech recognition, a large amount of multilingual text corpus is needed. However, in real life, multilingual ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F40/216G06N3/04G06N3/08
CPCG06F40/216G06N3/08G06N3/047G06N3/045
Inventor 陈梦楠高丽祖漪清
Owner IFLYTEK CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products