Method and system for realizing caption and speech synchronization in video-audio frequency processing

What is AI technical title?
AI technical title is built by Patsnap AI team. It summarizes the technical point description of the patent document.
A processing system, video and audio technology, applied to TV system components, speech analysis, speech recognition, etc., can solve the difficulty of synchronizing subtitle display and corresponding sound, etc., to save editing time, improve work efficiency, and avoid errors Effect

Inactive Publication Date: 2007-03-21

BEIJING FOUNDER ELECTRONICS CO LTD +1

View PDF0 Cites 10 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0006] The present invention provides a method and system for synchronizing subtitles and voice in video and audio processing to solve the problem in the prior art that it is difficult to synchronize the display of subtitles with the corresponding sound when subtitles are added to film and television programs.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0055] The present invention scans the voice file in the video and audio processing system, compares the characteristic parameters of the scanned voice signal with the parameter characteristics of the voice signal preset in the system, and determines the voiced sound in the voice file according to the voiced sound existing in each syllable. The start time and end time of a syllable, and finally determine the appearance time of the corresponding text in the subtitle text according to the start time of each syllable in the voice file, and achieve the purpose of subtitle and voice synchronization through synthesis.

[0056] In the present invention, the video and audio processing system first scans the voice file, and by calculating the short-term average amplitude of the voice signal in each frame of voice, the system judges whether the voice signal in a frame of voice represents voiced sound, and if so, saves The start time and end time of the frame of voice, and then the video ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

A method for synchronization of caption and sound during video and audio frequency dispose. The video and audio frequency dispose system scans sound document in audio frequency document. During the scanning process, confirm beginning time and close time of each syllable with sonant by characteristic parameter of sound signal to sonant. Keep the beginning time of each syllable into caption text as the appearance time for corresponding words in caption text and realize the synchronization to caption and sound. This also opens a system for synchronization of caption and sound during video and audio frequency dispose. The system relates to user end and incepting module, dispose module and sending module of video and audio frequency dispose system. It changes the handle operation about synchronization to caption and sound during video and audio frequency dispose into this system to reduce error taking by handle operation and greatly enhance the work efficiency of anaphase compilation.

Description

technical field [0001] The invention relates to the field of radio and television production, in particular to a method and a system for realizing synchronization of subtitles and voice in video and audio processing. Background technique [0002] With the development of artificial intelligence technology, computer speech recognition has also reached the level of practicality. There are two main categories of speech recognition: speech recognition and speaker recognition. The basic task of the speech recognition system is to accurately recognize what the speaker said. The task of a speaker recognition system is to identify the speaker or to distinguish the speaker from a known collection of people. The basic work of speech recognition is to decompose the input speech into basic elements, calculate the characteristics of speech elements and confirm the start and end points of speech. [0003] Speech recognition must involve speech. According to the different principles of s...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G10L21/06G10L21/00G10L15/00H04N5/278

Inventor王常波杨列森郭宗明高国连张磊

OwnerBEIJING FOUNDER ELECTRONICS CO LTD

Method and system for realizing caption and speech synchronization in video-audio frequency processing

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology