Audio and text synchronization method, computing device and storage medium
A text and audio technology, applied in the field of data processing, can solve problems such as poor user experience, lack of synchronous sentences, difficulty in synchronizing audio and book text, etc., to improve user experience, facilitate speech recognition, and improve reading effects.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0023] figure 1 A schematic flowchart showing a method for synchronizing audio and text according to Embodiment 1 of the present invention, as shown in figure 1 As shown, the method includes the following steps:
[0024] In step S101, the audio to be matched and the first text are obtained, and the first text is segmented to obtain a first sentence set.
[0025] Among them, the original book text of each book and the audio recorded by a real person for each book are stored in the book platform. In this embodiment, the original book text of the book is called the first text. The resulting book text is called the second text. In step S101, the audio and the first text corresponding to the same book that need to be synchronized are obtained from the book platform as the audio and the first text to be matched, and then the first text is segmented to obtain multiple sentences The first statement set of .
[0026] Step S102, segment the audio to obtain a set of audio clips, perf...
Embodiment 2
[0034] Figure 2a A schematic flowchart showing a method for synchronizing audio and text according to Embodiment 2 of the present invention, as shown in Figure 2a As shown, the method includes the following steps:
[0035] In step S201, the audio to be matched and the first text are obtained, and the first text is segmented to obtain a first sentence set.
[0036]The audio and the first text corresponding to the same book that need to be synchronously processed are obtained from the book platform, and then the first text can be segmented according to the specified punctuation marks to obtain the first statement set. Wherein, the specified punctuation mark may be a punctuation mark used to indicate the end of a sentence, such as a period, a question mark, an exclamation mark, and the like. Specifically, the symbol position of the specified punctuation mark contained in the first text can be identified, and the first segmentation point is determined according to the symbol p...
Embodiment 3
[0072] Embodiment 3 of the present invention provides a non-volatile computer storage medium, the storage medium stores at least one executable instruction, and the executable instruction can execute the method for synchronizing audio and text in any of the above method embodiments.
[0073] Specifically, the executable instruction can be used to make the processor perform the following operations: obtain the audio to be matched and the first text, segment the first text to obtain the first statement set; segment the audio to obtain a collection of audio fragments, Perform speech recognition on each audio segment in the segment set to obtain each segment sentence, combine each segment sentence to obtain the second text, and obtain the character sequence corresponding to the second text; sequentially extract the first sentence from the first sentence set, and obtain the first sentence For the corresponding first character sequence, extract the second character sequence from the ...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com