Subtitle generation method and device and device for generating subtitles

A subtitle and word segmentation technology, applied in the computer field, can solve problems such as difficulty in distinguishing expression pauses, poor accuracy and fluency of subtitles, and inaccurate recognition results, so as to ensure semantic integrity and rationality, improve accuracy and Fluency, effects that improve accuracy and fluency

Pending Publication Date: 2021-09-03
BEIJING SOGOU TECHNOLOGY DEVELOPMENT CO LTD
View PDF0 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] However, the method of segmenting the audio stream based on a single audio signal feature such as a silent segment is difficult to distinguish the expression pauses within a sentence and the expression pauses between sentences in a character's speech, and the semantics expressed by the segmented speech segments are often incomplete , for speech recognition of segmented speech segments, the recognition results are often not accurate enough, and the accuracy and fluency of the generated subtitles are poor, which is not conducive to users' understanding of audio and video content

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Subtitle generation method and device and device for generating subtitles
  • Subtitle generation method and device and device for generating subtitles
  • Subtitle generation method and device and device for generating subtitles

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0074] The following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are some of the embodiments of the present invention, but not all of them. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without creative efforts fall within the protection scope of the present invention.

[0075] method embodiment

[0076] refer to figure 1 , shows a flow chart of the steps of an embodiment of a subtitle generation method embodiment of the present invention, and the method may specifically include the following steps:

[0077] Step 101: Perform speech recognition processing on the audio-video signal to be processed, and obtain the text sequence corresponding to the audio-video signal and the timestamp mapping table of the text sequence,...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The embodiment of the invention provides a subtitle generation method and device and a device for generating subtitles. The method comprises the steps of carrying out voice recognition processing on audio and video signals to be processed, obtaining a text sequence corresponding to the audio and video signals and a timestamp mapping table of the text sequence, wherein the timestamp mapping table comprises timestamps corresponding to all segmented words in the text sequence; determining boundary segmented words in the text sequence; and splitting the text sequence according to the boundary segmented words and the timestamps corresponding to the boundary segmented words to generate subtitle line files corresponding to the audio and video signals. According to the embodiment of the invention, the accuracy and fluency of the generated subtitle file can be improved.

Description

technical field [0001] The present invention relates to the field of computer technology, in particular to a method and device for generating subtitles and a device for generating subtitles. Background technique [0002] When a user is watching some audio and video, such as webcast or movie, he can understand the audio and video content through the subtitles displayed on the audio and video display screen. [0003] In traditional methods for generating audio and video subtitles, audio streams are mainly processed according to silent segments so as to generate subtitles. The silent segment may be a segment without voice in the audio stream of the audio and video. According to the silent segment, the audio stream is segmented into multiple voice segments, and then speech recognition is performed on the segmented voice segments to obtain subtitles corresponding to the voice segments. [0004] However, the method of segmenting the audio stream based on a single audio signal fea...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F40/258G06F40/279G06F40/30G06F40/56G06F40/58G06F16/33
CPCG06F40/258G06F40/279G06F40/30G06F40/56G06F40/58G06F16/3343G06F16/3344
Inventor 卫林钰陈伟张旭
Owner BEIJING SOGOU TECHNOLOGY DEVELOPMENT CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products