Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Text processing method and device, model training method and device

A text processing and text technology, applied in the field of data processing, can solve the problems of ignoring the semantic information of sentences and poor construction effect, and achieve the effect of improving the construction effect

Active Publication Date: 2021-08-31
PATSNAP CN SUZHOU LTD
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] However, the existing construction schemes of parallel sentence pairs mainly rely on the length information of vocabulary and sentences, ignoring the potential semantic information between sentences, so the construction effect is poor

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Text processing method and device, model training method and device
  • Text processing method and device, model training method and device
  • Text processing method and device, model training method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0053] Hereinafter, exemplary embodiments according to the present disclosure will be described in detail with reference to the accompanying drawings. Apparently, the described embodiments are only some of the embodiments of the present disclosure, rather than all the embodiments of the present disclosure, and it should be understood that the present disclosure is not limited by the exemplary embodiments described here.

[0054] The technical solutions provided in this disclosure can be applied to smart terminals (such as tablet computers, mobile phones, etc.), so that the smart terminals can have related functions, such as cross-language patent retrieval functions, rapid translation functions of patent texts, etc.

[0055] Combine below figure 1 The application scenarios of the text processing method provided by the present disclosure are briefly introduced.

[0056] figure 1 Shown is a schematic diagram of an application scenario of the text processing method provided by a...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The disclosure provides a text processing method and device, a model training method and device, and relates to the technical field of data processing. The text processing method includes: determining a first language text module based on a first language text, and determining a second language text module based on a second language text; respectively performing sentence splitting operations on the first language text module and the second language text module , to generate a plurality of text units in the first language and a plurality of text units in the second language; based on a plurality of text units in the first language and a plurality of text units in the second language, determine the corresponding parallelism between the first language text module and the second language text module Sentence right. The present disclosure makes full use of the structural features of the text, and converts the corresponding text modules into text units including fragmented sentence and word segmentation by means of the sentence-word splitting operation. Therefore, the present disclosure can fully take into account the potential semantic information between sentences, Thus, the construction effect of parallel sentence pairs can be effectively improved.

Description

technical field [0001] The present disclosure relates to the technical field of data processing, and specifically relates to a text processing method and device, a model training method and device, a computer-readable storage medium, and electronic equipment. Background technique [0002] In recent years, with the accelerated development of globalization, text processing requirements such as text translation and text retrieval have emerged. The importance of parallel sentence pairs as an important basis for text processing is self-evident. [0003] However, the existing parallel sentence pair construction schemes mainly rely on vocabulary and sentence length information, ignoring the potential semantic information between sentences, so the construction effect is poor. Contents of the invention [0004] In order to solve the above-mentioned technical problems, the present disclosure is proposed. Embodiments of the present disclosure provide a text processing method and de...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F40/44G06F40/289G06F40/211G06F40/30G06N3/08
CPCG06N3/08G06F40/211G06F40/289G06F40/30G06F40/44
Inventor 王超超王为磊屠昶旸
Owner PATSNAP CN SUZHOU LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products