Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Method and system for realizing caption and speech synchronization in video-audio frequency processing

A processing system, video and audio technology, applied to TV system components, speech analysis, speech recognition, etc., can solve the difficulty of synchronizing subtitle display and corresponding sound, etc., to save editing time, improve work efficiency, and avoid errors Effect

Inactive Publication Date: 2007-03-21
BEIJING FOUNDER ELECTRONICS CO LTD +1
View PDF0 Cites 10 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] The present invention provides a method and system for synchronizing subtitles and voice in video and audio processing to solve the problem in the prior art that it is difficult to synchronize the display of subtitles with the corresponding sound when subtitles are added to film and television programs.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and system for realizing caption and speech synchronization in video-audio frequency processing
  • Method and system for realizing caption and speech synchronization in video-audio frequency processing

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0055] The present invention scans the voice file in the video and audio processing system, compares the characteristic parameters of the scanned voice signal with the parameter characteristics of the voice signal preset in the system, and determines the voiced sound in the voice file according to the voiced sound existing in each syllable. The start time and end time of a syllable, and finally determine the appearance time of the corresponding text in the subtitle text according to the start time of each syllable in the voice file, and achieve the purpose of subtitle and voice synchronization through synthesis.

[0056] In the present invention, the video and audio processing system first scans the voice file, and by calculating the short-term average amplitude of the voice signal in each frame of voice, the system judges whether the voice signal in a frame of voice represents voiced sound, and if so, saves The start time and end time of the frame of voice, and then the video ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

A method for synchronization of caption and sound during video and audio frequency dispose. The video and audio frequency dispose system scans sound document in audio frequency document. During the scanning process, confirm beginning time and close time of each syllable with sonant by characteristic parameter of sound signal to sonant. Keep the beginning time of each syllable into caption text as the appearance time for corresponding words in caption text and realize the synchronization to caption and sound. This also opens a system for synchronization of caption and sound during video and audio frequency dispose. The system relates to user end and incepting module, dispose module and sending module of video and audio frequency dispose system. It changes the handle operation about synchronization to caption and sound during video and audio frequency dispose into this system to reduce error taking by handle operation and greatly enhance the work efficiency of anaphase compilation.

Description

technical field [0001] The invention relates to the field of radio and television production, in particular to a method and a system for realizing synchronization of subtitles and voice in video and audio processing. Background technique [0002] With the development of artificial intelligence technology, computer speech recognition has also reached the level of practicality. There are two main categories of speech recognition: speech recognition and speaker recognition. The basic task of the speech recognition system is to accurately recognize what the speaker said. The task of a speaker recognition system is to identify the speaker or to distinguish the speaker from a known collection of people. The basic work of speech recognition is to decompose the input speech into basic elements, calculate the characteristics of speech elements and confirm the start and end points of speech. [0003] Speech recognition must involve speech. According to the different principles of s...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G10L21/06G10L21/00G10L15/00H04N5/278
Inventor 王常波杨列森郭宗明高国连张磊
Owner BEIJING FOUNDER ELECTRONICS CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products