Text processing method and device, electronic equipment and readable storage medium

A text processing and text technology, applied in digital data processing, natural language data processing, instruments, etc., can solve problems such as mixed corpus, affecting text accuracy, and difficulty in ensuring model accuracy, so as to improve quality and Completeness, Increased Precision, Effects of Increased Accuracy

Pending Publication Date: 2020-07-03
INDUSTRIAL AND COMMERCIAL BANK OF CHINA
View PDF6 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, considering the professionalism of the field and the extensiveness of parallel corpus acquisition, the corpus used for training the model is often mixed, and it is difficult to guarantee the accuracy of the trained model, which affects the accuracy of the translated text to a certain extent.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Text processing method and device, electronic equipment and readable storage medium
  • Text processing method and device, electronic equipment and readable storage medium
  • Text processing method and device, electronic equipment and readable storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0029] Hereinafter, embodiments of the present disclosure will be described with reference to the drawings. It should be understood, however, that these descriptions are exemplary only, and are not intended to limit the scope of the present disclosure. In the following detailed description, for purposes of explanation, numerous specific details are set forth in order to provide a thorough understanding of the embodiments of the present disclosure. It may be evident, however, that one or more embodiments may be practiced without these specific details. Also, in the following description, descriptions of well-known structures and techniques are omitted to avoid unnecessarily obscuring the concepts of the present disclosure.

[0030] The terminology used herein is for the purpose of describing particular embodiments only, and is not intended to be limiting of the present disclosure. The terms "comprising", "comprising", etc. used herein indicate the presence of stated features,...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a text processing method. The method comprises the steps of obtaining a mixed parallel corpus and a target parallel corpus; taking the mixed parallel corpus and the target parallel corpus as training samples, and training a predetermined model to obtain a first translation model; and taking the to-be-processed text as the input of the first translation model to obtain a translation text for the to-be-processed text. Wherein the target parallel corpus is a parallel corpus for the target field, and the target parallel corpus comprises a parallel corpus screened by a secondtranslation model. The second translation model is obtained by training by taking the mixed parallel corpus as a training sample. The invention further provides a text processing device, electronic equipment and a computer readable storage medium.

Description

technical field [0001] The present disclosure relates to the field of text translation and system monitoring, and more specifically, to a text processing method, device, electronic equipment and readable storage medium. Background technique [0002] With the development of electronic technology, language processing based on machine learning models has developed rapidly in order to improve processing efficiency and reduce labor costs. Among them, machine translation is an important branch of this language processing. [0003] In the process of realizing the disclosed concept, the inventors have found at least the following technical problems in the prior art: machine translation can be applied to various professional fields in addition to daily spoken language translation. When applied to various professional fields, a large amount of parallel corpus is often required as prior knowledge to train machine models. However, considering the field specialization and the extensive...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F40/58G06F40/211G06F40/289G06F40/30
Inventor 徐晨灿袁宁宫晨石建勋
Owner INDUSTRIAL AND COMMERCIAL BANK OF CHINA
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products