Method and apparatus for generating mixed-language-based subtitle file
A file generation and language technology, applied in the computer field, can solve the problems of high cost, low efficiency in generating mixed-language subtitle files, and inability to provide a method for generating mixed-language subtitle files, etc., so as to reduce the generation cost and improve the recognition accuracy.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0023] figure 1 It shows the implementation flow of the mixed language subtitle file generation method provided by Embodiment 1 of the present invention. For the convenience of description, only the parts related to the embodiment of the present invention are shown, and the details are as follows:
[0024] In step S101, an audio and video file input by a user is received, and audio in the audio and video file is extracted.
[0025] The embodiments of the present invention are applicable to audio and video playback devices such as computers, mobile phones, and tablet computers that support audio and video playback. Audio and video files are audio files or video files without subtitles, and multiple languages may appear in the audio or video, for example, Chinese and English appear simultaneously in one sentence, or one or more Chinese and English sentences appear simultaneously in a paragraph. Specifically, when the audio-video file is a video file, the audio stream in the v...
Embodiment 2
[0043] figure 2 The structure of the mixed-language subtitle file generation device provided by the second embodiment of the present invention is shown. For the convenience of description, only the parts related to the embodiment of the present invention are shown, including:
[0044] The audio extraction module 21 is configured to receive audio and video files input by the user, and extract audio from the audio and video files.
[0045] In the embodiment of the present invention, when the audio and video file is a video file, the audio stream in the video file needs to be separated, and the audio stream is saved as an audio file for operations such as audio segmentation and language recognition in subsequent steps .
[0046] Preferably, after extracting the audio in the audio and video file, the audio can be preprocessed. The preprocessing method can include audio signal denoising, decibel value setting, etc., to remove noise interference and weaken the background sound in ...
Embodiment 3
[0062] image 3 The preferred structure of the mixed language subtitle file generation structure provided by the third embodiment of the present invention is shown. For the convenience of description, only the parts related to the embodiment of the present invention are shown, including:
[0063] The audio extraction module 31 is configured to receive audio and video files input by the user, and extract audio from the audio and video files.
[0064] In the embodiment of the present invention, when the audio and video file is a video file, the audio stream in the video file needs to be separated, and the audio stream is saved as an audio file for operations such as audio segmentation and language recognition in subsequent steps .
[0065] The audio segmentation module 32 is configured to divide the audio into multiple audio segments, and store the time information corresponding to each audio segment.
[0066] In the embodiment of the present invention, the voice pause interva...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com