Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Audio and text synchronization method, computing device and storage medium

A text and audio technology, applied in the field of data processing, can solve problems such as poor user experience, lack of synchronous sentences, and inability to understand book content well, to improve accuracy, facilitate speech recognition, and improve reading effects and user experience. Effect

Active Publication Date: 2020-12-25
ZHANGYUE TECH CO LTD
View PDF9 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, for audio recorded by a real person for a book, it is difficult to synchronize the audio with the text of the book, resulting in the inability to display the read sentences synchronously during audio playback. When the user cannot hear or understand the content read in the audio , due to the lack of display of synchronous statements, the content of the book cannot be well understood, resulting in poor user experience

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Audio and text synchronization method, computing device and storage medium
  • Audio and text synchronization method, computing device and storage medium
  • Audio and text synchronization method, computing device and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0023] figure 1 A schematic flowchart showing a method for synchronizing audio and text according to Embodiment 1 of the present invention, as shown in figure 1 As shown, the method includes the following steps:

[0024] In step S101, the audio to be matched and the first text are obtained, and the first text is segmented to obtain a first sentence set.

[0025] Among them, the original book text of each book and the audio recorded by a real person for each book are stored in the book platform. In this embodiment, the original book text of the book is called the first text. The resulting book text is called the second text. In step S101, the audio and the first text corresponding to the same book that need to be synchronized are obtained from the book platform as the audio and the first text to be matched, and then the first text is segmented to obtain multiple sentences The first statement set of .

[0026] Step S102, segment the audio to obtain a set of audio clips, perf...

Embodiment 2

[0034] Figure 2a A schematic flowchart showing a method for synchronizing audio and text according to Embodiment 2 of the present invention, as shown in Figure 2a As shown, the method includes the following steps:

[0035] In step S201, the audio to be matched and the first text are obtained, and the first text is segmented to obtain a first sentence set.

[0036]The audio and the first text corresponding to the same book that need to be synchronously processed are obtained from the book platform, and then the first text can be segmented according to the specified punctuation marks to obtain the first statement set. Wherein, the specified punctuation mark may be a punctuation mark used to indicate the end of a sentence, such as a period, a question mark, an exclamation mark, and the like. Specifically, the symbol position of the specified punctuation mark contained in the first text can be identified, and the first segmentation point is determined according to the symbol p...

Embodiment 3

[0072] Embodiment 3 of the present invention provides a non-volatile computer storage medium, the storage medium stores at least one executable instruction, and the executable instruction can execute the method for synchronizing audio and text in any of the above method embodiments.

[0073] Specifically, the executable instruction can be used to make the processor perform the following operations: obtain the audio to be matched and the first text, segment the first text to obtain the first statement set; segment the audio to obtain a collection of audio fragments, Perform speech recognition on each audio segment in the segment set to obtain each segment sentence, combine each segment sentence to obtain the second text, and obtain the character sequence corresponding to the second text; sequentially extract the first sentence from the first sentence set, and obtain the first sentence For the corresponding first character sequence, extract the second character sequence from the ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses an audio and text synchronization method, computing equipment and a storage medium, and the method comprises the steps: obtaining an audio to be matched and a first text, and segmenting the first text to obtain a first statement set; segmenting the audio to obtain an audio fragment set, performing voice recognition on each audio fragment in the audio fragment set to obtaineach fragment statement, combining the fragment statements to obtain a second text, and obtaining a character sequence corresponding to the second text; sequentially extracting a first statement fromthe first statement set, obtaining a first character sequence corresponding to the first statement, extracting a second character sequence from a character sequence corresponding to the second text according to a preset window, matching the first character sequence with the second character sequence, and determining a third character sequence matched with the first character sequence, and establishing a synchronization relationship between the audio fragment corresponding to the third character sequence and the first statement. According to the scheme, accurate determination of the synchronization relationship between the audio fragments and the statements is realized.

Description

technical field [0001] The invention relates to the technical field of data processing, in particular to a method for synchronizing audio and text, a computing device and a storage medium. Background technique [0002] With the continuous development of e-book technology, users can not only read book content with their eyes, but also obtain book content by playing audiobooks. Among them, the way of obtaining book content by playing audiobooks can also be called the way of listening to books, and this way of listening to books brings a new reading experience to users. However, for audio recorded by a real person for a book, it is difficult to synchronize the audio with the text of the book, resulting in the inability to display the read sentences synchronously during audio playback. When the user cannot hear or understand the content read in the audio , due to the lack of display of synchronous statements, the content of the book cannot be well understood, resulting in poor ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G10L15/26G06F40/289G11B20/10
CPCG10L15/26G11B2020/10953G06F40/289
Inventor 陈梦瑶唐旺
Owner ZHANGYUE TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products