Technology for realizing real-time subtitle overlay during video call and applications of technology

A technology of video calls and subtitles, which is applied to the parts of color TVs, parts of TV systems, TVs, etc. It can solve problems such as inaudible, inaudible to the other party, and having to play the sound outside, and achieve the goal of solving distress Effect

Inactive Publication Date: 2019-11-05
BEE SMART INFORMATION TECH CO LTD
View PDF14 Cites 10 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Due to the limitations of the form of the smartphone, both the speaker and the microphone are on the body. If you do not use headphones or other equipment, if you need the camera to capture the user's head, or you need to look at the other party on the screen and speak, you need to put the device on the If the place is far away from the ear, but you need to hear the other party’s voice clearly, you have to put the sound out, that is, turn on the speaker. This has the following disadvantages: If yo

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Technology for realizing real-time subtitle overlay during video call and applications of technology
  • Technology for realizing real-time subtitle overlay during video call and applications of technology
  • Technology for realizing real-time subtitle overlay during video call and applications of technology

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0032] Such as Figure 1-3 Shown, a kind of technology of superimposing subtitles in real time in video call, comprises subtitle software, comprises following implementation steps:

[0033] S1: Speech recognition algorithm, through machine learning algorithm, captures the audio data in the video in real time, and converts the audio data into meaningful language data;

[0034] S2: text conversion algorithm, which converts the extracted voice data into text information in real time after being processed by the algorithm;

[0035] S3: Subtitle display algorithm, real-time word-by-word or word-by-word display of text information;

[0036] S4: Automatic sentence segmentation algorithm, through the analysis of audio files, to obtain the start and stop points of a sentence;

[0037] S5: text and audio and video superimposition algorithm, the text is directly superimposed and displayed on the video interface to form video subtitles, the video interface does not specify the text subt...

Embodiment 2

[0039] On the basis of Embodiment 1, the text conversion algorithm also includes a complete sentence conversion algorithm. With the continuous improvement of voice data, after a complete sentence is recognized, the recognized text content is updated and displayed according to the approximate meaning of the sentence.

Embodiment 3

[0041] On the basis of Embodiments 1 and 2, the text and audio and video overlay algorithm also includes gravity sensing, which can recognize the direction of gravity of the device, so the subtitle can adjust the direction of superimposed display according to the direction of gravity, and automatically adjust according to the screen size with the current font size The number of characters displayed on a single line.

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a technology for realizing real-time subtitle overlay during a video call and applications of the technology. The technology comprises subtitle software, and comprises the following steps: S1, a voice recognition algorithm, namely, through a machine learning algorithm, capturing audio data in a video in real time, and converting the audio data into language data with practical meaning; S2, a character converting algorithm, namely, carrying out algorithm processing on the acquired voice data so that character information is obtained through converting in real time; S3, asubtitle display algorithm, namely, carrying out real-time character-by-character or word-by-word display on the character information; S4, an automatic sentence punctuating algorithm, namely, analyzing an audio file, so that the starting and pausing points of one sentence are acquired; and S5, a character and audio/video overlay method, namely, directly displaying characters to a video interfacein an overlay manner, so that video captions are formed, wherein the video interface does not assign the display positions of the character subtitles. Through the technology for realizing real-time subtitle overlay during a video call and the applications of the technology, after the complete sentence is acquired, all the displayed characters can be updated according to the meaning of the complete sentence.

Description

technical field [0001] The invention relates to the technical field of video calls, in particular to a technology for superimposing subtitles in real time during a video call and an application thereof. Background technique [0002] The times are advancing, and the way we communicate is constantly changing. The 2G era of long-distance communication using only SMS and telephone voice has become history. With the emergence of 3G communication, people's communication methods have entered the era of video long-distance communication, followed by the explosion of smart devices and the development of 4G communication, until now With the advent of the 5G era, the frequency of video communication has become extremely widespread, and people are increasingly dependent on video communication. [0003] Although video communication has gradually replaced only text and voice communication, there are often some disadvantages and unsatisfactory places in video communication. Due to the li...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G10L15/26G10L15/22H04N5/278H04N7/14
CPCG10L15/22G10L15/26H04N5/278H04N7/141
Inventor 谢锋黄胜男李璟苏耀飞乐程胜张意
Owner BEE SMART INFORMATION TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products