Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Method and system for generating audio and video subtitles

An audio-video and subtitle technology, applied in the field of audio-video subtitle generation method and system, can solve the problems of low subtitle generation efficiency and high labor cost, and achieve the effects of improving generation efficiency, reducing labor cost, and facilitating digestion and understanding

Inactive Publication Date: 2016-06-22
GUANGDONG XIAOTIANCAI TECH CO LTD
View PDF6 Cites 52 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The purpose of the present invention is to provide a method and system for generating audio and video subtitles, aiming to solve the problems of high labor cost and low subtitle generation efficiency caused by relying on manual input of subtitle texts in the prior art

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and system for generating audio and video subtitles
  • Method and system for generating audio and video subtitles
  • Method and system for generating audio and video subtitles

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0021] figure 1 It shows the flowchart of the method for generating audio and video subtitles provided by Embodiment 1 of the present invention. For the convenience of description, only the parts related to the embodiment of the present invention are shown. The method for generating audio and video subtitles provided by the embodiment of the present invention, the method Including the following steps:

[0022] Step S1, acquiring audio and video data, and extracting audio data in the audio and video data.

[0023] In this embodiment, the acquired audio and video data to be processed may be video files or video streams, and the sources of the video files or video streams include but are not limited to: detected downloaded files, video files found by searching storage devices , The detected video stream (for example: live video stream, http video stream). The audio data in the extracted audio and video data may be audio data without segmentation processing, or audio data after ...

Embodiment 2

[0032] figure 2 A flow chart of the method for generating audio and video subtitles provided by Embodiment 2 of the present invention is shown, and the details are as follows:

[0033] Step S1, acquiring audio and video data, and extracting audio data in the audio and video data.

[0034] Step S2: Segment the audio data according to the speaking time interval and the size of the video frame to obtain audio data segments that conform to the speaking style and the size of the video frame, and record the time information of the audio data segments.

[0035] In step S3, the corresponding text data segment is obtained from the audio data segment through speech recognition, and the start time and end time of the corresponding text data segment are obtained according to the time information of the audio data segment to form subtitle text.

[0036] In step S4, each audio data segment is synchronized with its corresponding text data segment according to the time information of the au...

Embodiment 3

[0039] image 3 It shows a schematic structural diagram of the audio-video subtitle generation system provided by the third embodiment of the present invention. For the convenience of description, only the parts related to the embodiment of the present invention are shown. The audio-video subtitle generation system provided by the embodiment of the present invention, the system It includes: an audio data extraction unit 31 , a segmentation unit 32 , and a subtitle text formation unit 33 .

[0040]Specifically, the audio data extraction unit 31 is used to obtain audio and video data, and extract audio data in the audio and video data;

[0041] The segmentation unit 32 is used to segment the audio data according to the time interval of speaking and the size of the video image, to obtain audio data segments that conform to the speaking style and the size of the video image, and record the time of the audio data segments information; and

[0042] The subtitle text forming unit 3...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The present invention is applicable to the field of computer technology, and provides a method and system for generating audio and video subtitles. The method includes: acquiring audio and video data, and extracting audio data in the audio and video data; Segment the audio data to obtain audio data segments that conform to the speaking style and the size of the video screen, and record the time information of the audio data segment; obtain the corresponding text data segment through voice recognition, and according to the audio data segment The time information is used to obtain the start time and end time of the corresponding text data segment to form subtitle text. The invention gets rid of the complicated workload of manually entering subtitles, realizes obtaining text data by identifying audio data, and generates complete subtitles simply and efficiently.

Description

technical field [0001] The invention belongs to the technical field of computers, and in particular relates to a method and system for generating audio and video subtitles. Background technique [0002] With the continuous development of Internet technology, audio and video has attracted a large number of users with its convenient access experience, diverse video sources and real-time update speed, making audio and video an indispensable part of users' lives. The emergence of subtitles makes audio and video help people understand the content of audio and video in a more intuitive and reliable way. More and more users are used to audio and video files with subtitles. A segment of voice data and a large segment of text data are used to generate subtitles. For audio and video without subtitles, users can only rely on what they hear to understand, and the user experience is poor. [0003] Under the condition of no text manuscript, the existing method of generating audio and vid...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): H04N21/43H04N21/439H04N21/81H04N21/845G10L15/26
CPCH04N21/4307G10L15/26H04N21/4394H04N21/8133H04N21/8456
Inventor 王金龙丁小响
Owner GUANGDONG XIAOTIANCAI TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products