Text information translation method, device and terminal equipment

A text information, text information technology, applied in natural language translation and other directions, can solve the problem of low translation accuracy, achieve the effect of strong theme description characteristics and improve translation accuracy

Active Publication Date: 2022-04-12
INST OF SCI & TECHN INFORMATION OF CHINA
View PDF11 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Although the existing machine translation methods have improved the translation quality to some extent, there are still problems of low translation accuracy in the process of translating translated texts, such as foreign language scientific and technological texts.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Text information translation method, device and terminal equipment
  • Text information translation method, device and terminal equipment
  • Text information translation method, device and terminal equipment

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0032] The embodiment of the present application provides a method for translating text information, such as figure 1 shown, including:

[0033] Step S100, analyze the text information to be translated, and determine the subject text and layout information of the text information to be translated.

[0034] Specifically, the text to be translated in this embodiment can be a thesis text or a patent text, and of course it can also be a text in other recorded forms. Among them, the thesis text and the patent text mainly refer to scientific and technological texts that record scientific research activities and research methods , that is, the text to be translated in this embodiment mainly refers to the text of scientific and technological papers or scientific and technological patent texts, which can be saved in PDF format, word format, or other existing storage methods, such as .txt format , this application does not limit it.

[0035] Further, in this embodiment, the text infor...

Embodiment 2

[0044] The embodiment of the present application provides another possible implementation mode. On the basis of the first embodiment, the method shown in the second embodiment is also included, wherein,

[0045] Step S100 includes step S1001 (not marked in the figure), step S1002 (not marked in the figure), step S1003 (not marked in the figure) and step S1004 (not marked in the figure), wherein,

[0046] Step S1001: Determine the full-text characters of the text information to be translated and the position information of each character, and divide the full-text characters into words and combine them into lines according to the position information of each character to obtain corresponding line fragments.

[0047] Step S1002: Determine the number of line segments whose length difference is smaller than a preset length threshold.

[0048] Step S1003: Merge the line fragments into corresponding paragraphs according to the topological structure of the line fragments, and record t...

Embodiment 3

[0057] The embodiment of the present application provides another possible implementation mode. On the basis of the first and second embodiments, the method shown in the third embodiment is also included, wherein,

[0058] Step S200 includes step S2001 (not marked in the figure) and step S2002 (not marked in the figure), wherein,

[0059] Step S2001: According to the pre-established document content organization framework template, divide the subject text and layout information into content modules, and obtain multiple subject frameworks of the text information to be translated.

[0060] Step S2002: Based on the preset theme unit expression pattern and multiple theme frames, determine at least one theme element included in each theme frame through regular pattern matching.

[0061] Specifically, when step S200 is specifically to determine a plurality of theme frames of the text information to be translated based on the theme text and layout information, step S200 only needs to...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

This application relates to the field of natural language processing, and discloses a text information translation method, device, terminal equipment, and computer-readable storage medium, wherein the text information translation method includes: analyzing the text information to be translated, and determining the text information to be translated The subject text and format information of the subject text; based on the subject text and format information, determine multiple subject frames or at least one subject element in each subject frame of the text information to be translated; after training, corresponding to each subject frame or each subject element respectively Translation model, which translates the subject text by subject frame or subject element. The method of the embodiment of the present application not only realizes finer-grained content extraction of the text information to be translated, but also enables targeted translation of the sub-theme framework or theme elements of the text to be translated, so that the translation result has a clear theme and a strong Topic description features to improve translation accuracy.

Description

technical field [0001] The present application relates to the technical field of natural language processing, and in particular, the present application relates to a text information translation method, device, terminal equipment, and computer-readable storage medium. Background technique [0002] Text refers to the use of written language to record an event, which can be divided into scientific and technological texts, documentary texts, and narrative texts. Among them, scientific and technological texts are an important carrier for recording scientific research activities and research methods. Personnel acquire scientific and technological experience and understand the main literature of cutting-edge technologies in the industry. At present, a large number of scientific and technological texts are presented in English, Japanese, German, French, and Chinese languages. Faced with a large number of scientific and technological text resources, it is becoming more and more diff...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06F40/58G06F40/55
CPCG06F40/55G06F40/58
Inventor 石崇德何彦青许德山
Owner INST OF SCI & TECHN INFORMATION OF CHINA
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products