Method and system for generating audio and video subtitles

What is AI technical title?
AI technical title is built by PatSnap AI team. It summarizes the technical point description of the patent document.
An audio-video and subtitle technology, applied in the field of audio-video subtitle generation method and system, can solve the problems of low subtitle generation efficiency and high labor cost, and achieve the effects of improving generation efficiency, reducing labor cost, and facilitating digestion and understanding

Inactive Publication Date: 2016-06-22

GUANGDONG XIAOTIANCAI TECH CO LTD

View PDF6 Cites 52 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0004] The purpose of the present invention is to provide a method and system for generating audio and video subtitles, aiming to solve the problems of high labor cost and low subtitle generation efficiency caused by relying on manual input of subtitle texts in the prior art

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment 1

[0021] figure 1 It shows the flowchart of the method for generating audio and video subtitles provided by Embodiment 1 of the present invention. For the convenience of description, only the parts related to the embodiment of the present invention are shown. The method for generating audio and video subtitles provided by the embodiment of the present invention, the method Including the following steps:

[0022] Step S1, acquiring audio and video data, and extracting audio data in the audio and video data.

[0023] In this embodiment, the acquired audio and video data to be processed may be video files or video streams, and the sources of the video files or video streams include but are not limited to: detected downloaded files, video files found by searching storage devices , The detected video stream (for example: live video stream, http video stream). The audio data in the extracted audio and video data may be audio data without segmentation processing, or audio data after ...

Embodiment 2

[0032] figure 2 A flow chart of the method for generating audio and video subtitles provided by Embodiment 2 of the present invention is shown, and the details are as follows:

[0033] Step S1, acquiring audio and video data, and extracting audio data in the audio and video data.

[0034] Step S2: Segment the audio data according to the speaking time interval and the size of the video frame to obtain audio data segments that conform to the speaking style and the size of the video frame, and record the time information of the audio data segments.

[0035] In step S3, the corresponding text data segment is obtained from the audio data segment through speech recognition, and the start time and end time of the corresponding text data segment are obtained according to the time information of the audio data segment to form subtitle text.

[0036] In step S4, each audio data segment is synchronized with its corresponding text data segment according to the time information of the au...

Embodiment 3

[0039] image 3 It shows a schematic structural diagram of the audio-video subtitle generation system provided by the third embodiment of the present invention. For the convenience of description, only the parts related to the embodiment of the present invention are shown. The audio-video subtitle generation system provided by the embodiment of the present invention, the system It includes: an audio data extraction unit 31 , a segmentation unit 32 , and a subtitle text formation unit 33 .

[0040]Specifically, the audio data extraction unit 31 is used to obtain audio and video data, and extract audio data in the audio and video data;

[0041] The segmentation unit 32 is used to segment the audio data according to the time interval of speaking and the size of the video image, to obtain audio data segments that conform to the speaking style and the size of the video image, and record the time of the audio data segments information; and

[0042] The subtitle text forming unit 3...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The present invention is applicable to the field of computer technology, and provides a method and system for generating audio and video subtitles. The method includes: acquiring audio and video data, and extracting audio data in the audio and video data; Segment the audio data to obtain audio data segments that conform to the speaking style and the size of the video screen, and record the time information of the audio data segment; obtain the corresponding text data segment through voice recognition, and according to the audio data segment The time information is used to obtain the start time and end time of the corresponding text data segment to form subtitle text. The invention gets rid of the complicated workload of manually entering subtitles, realizes obtaining text data by identifying audio data, and generates complete subtitles simply and efficiently.

Description

technical field [0001] The invention belongs to the technical field of computers, and in particular relates to a method and system for generating audio and video subtitles. Background technique [0002] With the continuous development of Internet technology, audio and video has attracted a large number of users with its convenient access experience, diverse video sources and real-time update speed, making audio and video an indispensable part of users' lives. The emergence of subtitles makes audio and video help people understand the content of audio and video in a more intuitive and reliable way. More and more users are used to audio and video files with subtitles. A segment of voice data and a large segment of text data are used to generate subtitles. For audio and video without subtitles, users can only rely on what they hear to understand, and the user experience is poor. [0003] Under the condition of no text manuscript, the existing method of generating audio and vid...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & AuthorityApplications(China)

IPC IPC(8): H04N21/43H04N21/439H04N21/81H04N21/845G10L15/26

CPCH04N21/4307G10L15/26H04N21/4394H04N21/8133H04N21/8456

Inventor王金龙丁小响

OwnerGUANGDONG XIAOTIANCAI TECH CO LTD

Method and system for generating audio and video subtitles

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment 1

Embodiment 2

Embodiment 3

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology