Method and system for generating audio and video subtitles
An audio-video and subtitle technology, applied in the field of audio-video subtitle generation method and system, can solve the problems of low subtitle generation efficiency and high labor cost, and achieve the effects of improving generation efficiency, reducing labor cost, and facilitating digestion and understanding
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0021] figure 1 It shows the flowchart of the method for generating audio and video subtitles provided by Embodiment 1 of the present invention. For the convenience of description, only the parts related to the embodiment of the present invention are shown. The method for generating audio and video subtitles provided by the embodiment of the present invention, the method Including the following steps:
[0022] Step S1, acquiring audio and video data, and extracting audio data in the audio and video data.
[0023] In this embodiment, the acquired audio and video data to be processed may be video files or video streams, and the sources of the video files or video streams include but are not limited to: detected downloaded files, video files found by searching storage devices , The detected video stream (for example: live video stream, http video stream). The audio data in the extracted audio and video data may be audio data without segmentation processing, or audio data after ...
Embodiment 2
[0032] figure 2 A flow chart of the method for generating audio and video subtitles provided by Embodiment 2 of the present invention is shown, and the details are as follows:
[0033] Step S1, acquiring audio and video data, and extracting audio data in the audio and video data.
[0034] Step S2: Segment the audio data according to the speaking time interval and the size of the video frame to obtain audio data segments that conform to the speaking style and the size of the video frame, and record the time information of the audio data segments.
[0035] In step S3, the corresponding text data segment is obtained from the audio data segment through speech recognition, and the start time and end time of the corresponding text data segment are obtained according to the time information of the audio data segment to form subtitle text.
[0036] In step S4, each audio data segment is synchronized with its corresponding text data segment according to the time information of the au...
Embodiment 3
[0039] image 3 It shows a schematic structural diagram of the audio-video subtitle generation system provided by the third embodiment of the present invention. For the convenience of description, only the parts related to the embodiment of the present invention are shown. The audio-video subtitle generation system provided by the embodiment of the present invention, the system It includes: an audio data extraction unit 31 , a segmentation unit 32 , and a subtitle text formation unit 33 .
[0040]Specifically, the audio data extraction unit 31 is used to obtain audio and video data, and extract audio data in the audio and video data;
[0041] The segmentation unit 32 is used to segment the audio data according to the time interval of speaking and the size of the video image, to obtain audio data segments that conform to the speaking style and the size of the video image, and record the time of the audio data segments information; and
[0042] The subtitle text forming unit 3...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com