Patents
Literature
Patsnap Copilot is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Patsnap Copilot

126results about How to "Improve translation performance" patented technology

Translation model establishing method and system

The invention discloses a translation model establishing method and system. The translation model establishing method comprises the following steps: respectively generating a regular alignment table, a word semantic vector table and a phrase table according to alignment information of a double-language parallel corpus, subsequently generating a source language phrase semantic vector table of a source language semantic space and a target language phrase semantic vector table of a target language semantic space by using the word semantic vector table and the phrase table, and finally training by using phrase semantic vector tables of different semantic spaces, thereby generating a translation model integrated with semantic information. The result shows that phrase semantic information can be integrated in statistic machine translation, the research shows that the relevance of words or phrases to context words or phrases can be reflected in the semantic information, and compared with a conventional translation method based on words or phrases, the translation model is relatively high in translation quality after the phrase semantic information is integrated, so that the translation property of the statistic machine translation is further improved as compared with that of the prior art.
Owner:SUZHOU UNIV

Method of translation, method for determining target information and related devices

The invention discloses a method for determining target information. The method includes the following steps that encoding is conducted on to-be-processed text information to obtain a source-end vector expression sequence; according to the source-end vector expression sequence, a source-end context vector corresponding to the first moment is obtained, wherein the source-end context vector is used for expressing to-be-processed source-end content; according to the source-end vector expression sequence and the source-end context vector, a first translation vector and / or a second translation vector are / is determined, wherein the first translation vector indicates the source-end content which is not translated in the source-end vector expression sequence within the first moment, and the second translation vector indicates the source-end content which is translated in the source-end vector expression sequence within the second moment; decoding is conducted on the first translation vector and / or a second translation vector and the source-end context vector so as to obtain the target information of the first moment. The invention further provides a method of translation and a device for determining the target information. According to the method for determining the target information, the model training difficulty of a decoder can be reduced, and the translation effect of a translation system is improved.
Owner:SHENZHEN TENCENT COMP SYST CO LTD

An ancient Chinese automatic translation method based on multi-feature fusion

ActiveCN109684648AIncreased accuracySolve the problem of unregistered wordsNatural language translationSpecial data processing applicationsWord listSentence pair
The invention discloses an ancient Chinese automatic translation method based on multi-feature fusion. The method comprises the following steps: 1) collecting a text, modern text translation data of the text, a text word list and modern Chinese monolingual corpus data; And 2) cleaning the data and constructing an ancient Chinese parallel corpus by using a sentence alignment method. And 3) carryingout word segmentation on the modern text and the ancient text by using a Chinese word segmentation tool; 4) performing topic modeling on the ancient text corpus to generate topics-Word distribution and word-Subject conditional probability distribution 5) using the modern Chinese monolingual corpus to train to obtain a modern Chinese language model; And obtaining an aligned dictionary by using ancient Chinese parallel corpora. 6) on the basis of the attention-based recurrent neural network translation model, fusing statistical machine translation characteristics such as a language model and analignment dictionary, and using an ancient Chinese parallel sentence pair and a word topic sequence training model, and 7) inputting a to-be-translated text by a user, and obtaining a modern text translation by using the model obtained by training in the step 6).
Owner:ZHEJIANG UNIV

Method and device for adapting a machine translation system based on language database to new field

The invention provides method and system for adapting a machine translation system based on a language database to a new field. The method comprises the following steps of: translating a plurality of source language sentences in the new field by using the machine translation system based on the language database which is trained in one field; selecting the source language sentences the evaluation of the translated result of which is lower than a pre-set first evaluation threshold from the plurality of source language sentences; recognizing a text fragment related to the new field from the source language sentences evaluation of the translated result of which is lower than the first evaluation threshold; and updating the machine translation system by using the plurality of source language sentences and the translated results thereof, as well as the text fragment related to the new field and a correct translated text thereof. In the invention, the machine translation system trained well outside the field trains the machine translation system through using the text fragment which is recognized in the process of repeatedly translating the text in the new field and is related to the new filed so as to continuously improve the translation performance of the new field by using the machine translation system.
Owner:KK TOSHIBA

Multi-field neural machine translation method based on self-attention mechanism

The invention discloses a multi-field neural machine translation method based on a self-attention mechanism. The invention discloses a multi-field neural machine translation method based on a self-attention mechanism. The multi-field neural machine translation method comprises the following steps: carrying out two important changes on a Transformer, wherein the first change is a self-attention mechanism based on domain perception, and the domain representation is added to a key and a value vector of the original self-attention mechanism, the weight of the attention mechanism is the degree of correlation of the query and domain aware keys, the second change is to add a domain representation learning module to learn a domain vector. The method has the beneficial effects that a domain-aware NMT model architecture is provided on the basis of a neural network architecture Transformer representing the most advanced level at present. A self-attention mechanism based on domain awareness is provided for multi-domain translation. It is known that this is a first attempt on a multi-domain NMT based on a self-attention mechanism. Meanwhile, experiments and analysis also verify that the model can significantly improve the translation effect of each field and can learn the field information of training data.
Owner:SUZHOU UNIV

Speech translation method, system and equipment fusing text semantic features

The invention belongs to the field of machine translation, particularly relates to a speech translation method, a system and equipment fusing text semantic features, and aims to solve the problem that the existing speech translation method is difficult to fuse information between different modals and cannot fully utilize data in the fields of speech recognition and machine translation, so that the translation performance is poor. The method comprises the following steps: acquiring to-be-translated source language voice data; extracting a voice feature sequence corresponding to the source language voice data; obtaining acoustic representation corresponding to each voice feature; mapping the implicit vector of the acoustic representation to a source language word list, and obtaining the probability that the speech feature sequence is recognized as a word in the source language word list at each moment through a softmax function; filtering the acoustic features, and obtaining semantic features corresponding to the filtered acoustic features through a second encoder; and based on the semantic representation, obtaining a target language translation text corresponding to the source language voice data through a decoder. According to the method, the speech translation performance is improved.
Owner:INST OF AUTOMATION CHINESE ACAD OF SCI

Machine translation method and device, electronic equipment and medium

The embodiment of the invention discloses a machine translation method and device, electronic equipment and a medium. The method comprises the steps that a to-be-translated source language sentence isacquired; wherein a pre-trained machine translation model is used for giving a target language translation result of the source language sentence and a category to which the source language sentencebelongs at the same time, the category comprises coarse classification and fine classification under the coarse classification, and the machine translation model is used for giving the target languagetranslation result in combination with a classification task of the source language sentence. According to the embodiment of the invention, the trouble of manually selecting the field by a user is avoided; meanwhile, a classification task and a translation task are executed; coarse classification and fine classification to which the coarse classification belongs are given in the classification task. Compared with the prior art, the method has the advantages that each field is classified in the same dimension instead of in the same dimension, so that the model is more targeted in a category classification stage, categories and fields described by source language sentences can be automatically identified, meanwhile, translation content boundaries responsible for each field are clearer, andthe translation effect is improved.
Owner:BEIJING BAIDU NETCOM SCI & TECH CO LTD

Multi-modal machine translation data enhancement method based on image description generation

The invention discloses a multi-modal machine translation data enhancement method based on image description generation, which comprises the following steps of: training an attention mechanism-based image description generation model by using pre-trained image coding information and corresponding image description; encoding and decoding pictures in the existing multi-modal training data by using the trained image description generation model to generate a corresponding source language image description text; translating the generated source language image description text into a target language, and constructing pseudo data; and adding the constructed pseudo data into the multi-modal training data, fusing the picture information in the multi-modal training data with the source language description information, sending the fused information into a multi-modal machine translation model, and generating a target language translation assisted by the image context information in an autoregressive manner. The diversity of the pseudo data is enriched, the performance can be improved from knowledge refinement, and compared with a common data enhancement method adopting a random replacementmode and the like, the invention has great advantages.
Owner:沈阳雅译网络技术有限公司

Translation method based on multi-modal machine translation model

The invention provides a translation method based on a multi-modal machine translation model, which comprises the following steps: obtaining a source end sentence and a corresponding translation image, and preprocessing the source end sentence and the translation image to obtain the processed source end sentence, the global feature of the translation image and the local feature of the translationimage; establishing a multi-modal machine translation model, and training the multi-modal machine translation model according to the multi-modal machine translation model, the multi-modal machine translation model comprising an encoder and a decoder, and the decoder comprising a context-guided capsule network; translating the processed to-be-translated source end sentence and the corresponding translation image based on a trained multi-modal machine translation model to generate a target end sentence corresponding to the to-be-translated source end sentence. Therefore, the context is introduced into the decoder of the multi-modal machine translation model to guide the capsule network to translate, and introduction of a large number of parameters can be avoided while rich multi-modal representation is dynamically generated, so that the performance of multi-modal machine translation is effectively improved.
Owner:XIAMEN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products