Text processing method and device, and model training method and device

A text processing and language text technology, applied in the field of data processing, can solve the problems of ignoring the semantic information of sentences and poor construction effect

Active Publication Date: 2021-05-14
PATSNAP CN SUZHOU LTD
View PDF5 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] However, the existing construction schemes of parallel sentence pairs mainly rely on the length information of vocabulary and sentences, ignoring the potential semantic information between sentences, so the construction effect is poor

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Text processing method and device, and model training method and device
  • Text processing method and device, and model training method and device
  • Text processing method and device, and model training method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0053] Hereinafter, exemplary embodiments according to the present disclosure will be described in detail with reference to the accompanying drawings. Apparently, the described embodiments are only some of the embodiments of the present disclosure, rather than all the embodiments of the present disclosure, and it should be understood that the present disclosure is not limited by the exemplary embodiments described here.

[0054] The technical solutions provided in this disclosure can be applied to smart terminals (such as tablet computers, mobile phones, etc.), so that the smart terminals can have related functions, such as cross-language patent retrieval functions, rapid translation functions of patent texts, etc.

[0055] Combine below figure 1 The application scenarios of the text processing method provided by the present disclosure are briefly introduced.

[0056] figure 1 Shown is a schematic diagram of an application scenario of the text processing method provided by a...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a text processing method and device, and a model training method and device, and relates to the technical field of data processing. The text processing method comprises the steps of determining a first language text module based on a first language text, and determining a second language text module based on a second language text; performing sentence and word splitting operation on the first language text module and the second language text module to generate a plurality of first language text units and a plurality of second language text units; and determining parallel sentence pairs corresponding to the first language text module and the second language text module based on the plurality of first language text units and the plurality of second language text units. According to the method, the structural features of the text are fully utilized, and the corresponding text module is converted into the text unit comprising the fragmented segmented sentences and words by means of the sentence and word splitting operation, so that potential semantic information between sentences can be fully considered, and the construction effect of the parallel sentence pairs can be effectively improved.

Description

technical field [0001] The present disclosure relates to the technical field of data processing, and specifically relates to a text processing method and device, a model training method and device, a computer-readable storage medium, and electronic equipment. Background technique [0002] In recent years, with the accelerated development of globalization, text processing requirements such as text translation and text retrieval have emerged. The importance of parallel sentence pairs as an important basis for text processing is self-evident. [0003] However, the existing parallel sentence pair construction schemes mainly rely on vocabulary and sentence length information, ignoring the potential semantic information between sentences, so the construction effect is poor. Contents of the invention [0004] In order to solve the above-mentioned technical problems, the present disclosure is proposed. Embodiments of the present disclosure provide a text processing method and de...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F40/44G06F40/289G06F40/211G06F40/30G06N3/08
CPCG06N3/08G06F40/211G06F40/289G06F40/30G06F40/44
Inventor 王超超王为磊屠昶旸
Owner PATSNAP CN SUZHOU LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products