Audio and text synchronization method, computing device and storage medium

What is AI technical title?
AI technical title is built by Patsnap AI team. It summarizes the technical point description of the patent document.
A text and audio technology, applied in the field of data processing, can solve problems such as poor user experience, lack of synchronous sentences, and inability to understand book content well, to improve accuracy, facilitate speech recognition, and improve reading effects and user experience. Effect

Active Publication Date: 2020-12-25

ZHANGYUE TECH CO LTD

View PDF9 Cites 2 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

However, for audio recorded by a real person for a book, it is difficult to synchronize the audio with the text of the book, resulting in the inability to display the read sentences synchronously during audio playback. When the user cannot hear or understand the content read in the audio , due to the lack of display of synchronous statements, the content of the book cannot be well understood, resulting in poor user experience

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment 1

[0023] figure 1 A schematic flowchart showing a method for synchronizing audio and text according to Embodiment 1 of the present invention, as shown in figure 1 As shown, the method includes the following steps:

[0024] In step S101, the audio to be matched and the first text are obtained, and the first text is segmented to obtain a first sentence set.

[0025] Among them, the original book text of each book and the audio recorded by a real person for each book are stored in the book platform. In this embodiment, the original book text of the book is called the first text. The resulting book text is called the second text. In step S101, the audio and the first text corresponding to the same book that need to be synchronized are obtained from the book platform as the audio and the first text to be matched, and then the first text is segmented to obtain multiple sentences The first statement set of .

[0026] Step S102, segment the audio to obtain a set of audio clips, perf...

Embodiment 2

[0034] Figure 2a A schematic flowchart showing a method for synchronizing audio and text according to Embodiment 2 of the present invention, as shown in Figure 2a As shown, the method includes the following steps:

[0035] In step S201, the audio to be matched and the first text are obtained, and the first text is segmented to obtain a first sentence set.

[0036]The audio and the first text corresponding to the same book that need to be synchronously processed are obtained from the book platform, and then the first text can be segmented according to the specified punctuation marks to obtain the first statement set. Wherein, the specified punctuation mark may be a punctuation mark used to indicate the end of a sentence, such as a period, a question mark, an exclamation mark, and the like. Specifically, the symbol position of the specified punctuation mark contained in the first text can be identified, and the first segmentation point is determined according to the symbol p...

Embodiment 3

[0072] Embodiment 3 of the present invention provides a non-volatile computer storage medium, the storage medium stores at least one executable instruction, and the executable instruction can execute the method for synchronizing audio and text in any of the above method embodiments.

[0073] Specifically, the executable instruction can be used to make the processor perform the following operations: obtain the audio to be matched and the first text, segment the first text to obtain the first statement set; segment the audio to obtain a collection of audio fragments, Perform speech recognition on each audio segment in the segment set to obtain each segment sentence, combine each segment sentence to obtain the second text, and obtain the character sequence corresponding to the second text; sequentially extract the first sentence from the first sentence set, and obtain the first sentence For the corresponding first character sequence, extract the second character sequence from the ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses an audio and text synchronization method, computing equipment and a storage medium, and the method comprises the steps: obtaining an audio to be matched and a first text, and segmenting the first text to obtain a first statement set; segmenting the audio to obtain an audio fragment set, performing voice recognition on each audio fragment in the audio fragment set to obtaineach fragment statement, combining the fragment statements to obtain a second text, and obtaining a character sequence corresponding to the second text; sequentially extracting a first statement fromthe first statement set, obtaining a first character sequence corresponding to the first statement, extracting a second character sequence from a character sequence corresponding to the second text according to a preset window, matching the first character sequence with the second character sequence, and determining a third character sequence matched with the first character sequence, and establishing a synchronization relationship between the audio fragment corresponding to the third character sequence and the first statement. According to the scheme, accurate determination of the synchronization relationship between the audio fragments and the statements is realized.

Description

technical field [0001] The invention relates to the technical field of data processing, in particular to a method for synchronizing audio and text, a computing device and a storage medium. Background technique [0002] With the continuous development of e-book technology, users can not only read book content with their eyes, but also obtain book content by playing audiobooks. Among them, the way of obtaining book content by playing audiobooks can also be called the way of listening to books, and this way of listening to books brings a new reading experience to users. However, for audio recorded by a real person for a book, it is difficult to synchronize the audio with the text of the book, resulting in the inability to display the read sentences synchronously during audio playback. When the user cannot hear or understand the content read in the audio , due to the lack of display of synchronous statements, the content of the book cannot be well understood, resulting in poor ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G10L15/26G06F40/289G11B20/10

CPCG10L15/26G11B2020/10953G06F40/289

Inventor陈梦瑶唐旺

OwnerZHANGYUE TECH CO LTD

Audio and text synchronization method, computing device and storage medium

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment 1

Embodiment 2

Embodiment 3

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology