Text splicing method and device thereof

A text and segment technology, applied in special data processing applications, instruments, electrical digital data processing, etc., can solve the problem of lower sentence intelligibility, improve intelligibility, avoid long and difficult sentences, and have appropriate sentence length Effect

Active Publication Date: 2018-03-06
BAIDU ONLINE NETWORK TECH (BEIJIBG) CO LTD
View PDF14 Cites 6 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] For this reason, the first purpose of the present invention is to propose a text splicing method to improve the accuracy of sentence segmentation, ensure that the sentence semantics are

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Text splicing method and device thereof
  • Text splicing method and device thereof
  • Text splicing method and device thereof

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0034] The embodiments of the present invention are described in detail below. Examples of the embodiments are shown in the accompanying drawings, in which the same or similar reference numerals indicate the same or similar elements or elements with the same or similar functions. The embodiments described below with reference to the accompanying drawings are exemplary, and are intended to explain the present invention, but should not be construed as limiting the present invention.

[0035] The text splicing method and device according to the embodiments of the present invention are described below with reference to the drawings.

[0036] For longer texts, it usually increases the difficulty for readers to understand the meaning of the text. For example, for the text "Give me a chance. How to give you a chance. I didn't have a choice before, now I want to be a good person. Okay, go and tell the judge to see if he lets you be a good person. That is to let me go Death. Sorry, I am a ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a text splicing method and a device thereof. The method comprises the following steps of: obtaining a to-be-spliced current text fragment, obtaining an average sentence lengthcorresponding to the current text fragment, obtaining a first semantic score, in a semantic model, of the current text fragment, and obtaining a second semantic score, in the semantic model, of a candidate sentence comprising the current text fragment, wherein the current text fragment is a starting fragment of the candidate sentence; and splicing the current text fragment according to the averagesentence length, the first semantic score and the second semantic score, so as to obtain a target sentence corresponding to the current text fragment. During sentence segmentation, sentence lengths are considered, so that the sentence lengths are proper, long difficult sentences or massive short sentences are avoided, and the sentence lengths are relatively stable; and scoring of the semantic model is also considered, so that the sentence segmentation correctness can be improved, semantic meanings of the sentences are not damaged and the intelligibility of the sentences is improved.

Description

Technical field [0001] The present invention relates to the field of natural language processing, in particular to a text splicing method and device. Background technique [0002] In the processing of speech synthesis data, the length of natural language text is long, which is not convenient for the processing of speech synthesis data. It is often necessary to segment the text to obtain the most reasonable sentence. Inaccurate text segmentation methods, the resulting segmented sentences are not reasonable enough, which seriously affects the semantics of the segmented sentences, reduces the comprehensibility of the sentences, and then reduces the quality of data processing. Therefore, how to improve the accuracy of sentence segmentation becomes an urgent problem to be solved. Summary of the invention [0003] The present invention aims to solve one of the technical problems in the related art at least to a certain extent. [0004] For this reason, the first purpose of the present i...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/27
CPCG06F40/211G06F40/30
Inventor 孔德威
Owner BAIDU ONLINE NETWORK TECH (BEIJIBG) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products