Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Audio noise-tolerant sentence segmentation processing method and system

A processing method and audio technology, applied in the field of subtitle processing and speech, can solve the problems such as the inability to automatically segregate noise, etc., and achieve the effects of fast calculation speed, fast cutting speed, and saving workload

Active Publication Date: 2019-04-23
HUAKEFEIYANG
View PDF6 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

In this way, the problem of inability to automatically segment sentences and high noise in the existing subtitle correspondence process is solved

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Audio noise-tolerant sentence segmentation processing method and system
  • Audio noise-tolerant sentence segmentation processing method and system
  • Audio noise-tolerant sentence segmentation processing method and system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0045] The technical solutions of the present invention will be clearly and completely described below in conjunction with the accompanying drawings of the present invention. Apparently, the described embodiments are only some of the embodiments of the present invention, not all of them. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without creative efforts fall within the protection scope of the present invention.

[0046] Audio noise-tolerant sentence sentence processing method among the present invention, as figure 1 shown, including:

[0047] Step S101, acquiring multiple frame segments according to the audio.

[0048] The present invention may be installed on a server, or on a personal computer or mobile computing device. The computing terminal referred to below may be a server, a personal computer, or a mobile computing device. First, upload the audio and video files to the server, or open the a...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to an audio noise tolerance punctuation processing method and a system. The method comprises steps that multiple framing segments are acquired according to an audio; an energy threshold is acquired according to an energy value of each framing segment, a framing segment with an energy value surpassing the energy threshold Et is acquired from the framing segments according to the energy threshold, the frame segment with the energy value surpassing the energy threshold Et is taken as a middle sentence frame to scan a front sequence frame or a back sequence frame, if an energy threshold of the front sequence frame or the back sequence frame is smaller the set energy threshold Et, the frame with the energy threshold smaller than the set energy threshold Et and the middle sentence frame are merged according to the start order into an independent sentence, entropy spectrum analysis on each independent sentence is then carried out, and a final analysis sentence is acquired. Through the method, a problem of automatic punctuation incapability existing in a caption corresponding process in the prior art is solved, recorded audios and videos can not only be processed, but also audios and videos which are presently played can be further processed, for network broadcast flows, network broadcast voice cutting can be automatically carried out, subsequent links such as listening and writing links can be conveniently processed parallelly, and the processing time is shortened.

Description

technical field [0001] The invention relates to the technical field of voice and subtitle processing, in particular to a method and system for processing audio noise-tolerant sentence segmentation. Background technique [0002] At present, in the field of subtitle production, sentence segmentation is mainly performed manually. The premise of artificial voice segmentation is to listen to all the voices, and mark the start and end points of a sentence by tapping shortcut keys while dictating. Due to the delay of slapping, the obtained start point and end point are misaligned and need to be adjusted manually. The whole process takes a lot of time. For example, 30 minutes of audio requires 40 minutes to 1 hour of sentence segmentation time, which is extremely low in productivity. In the field of webcasting, if sentences are not segmented and dictation is performed manually, it is difficult to parallelize, and the speed of human dictation will be slower than the speed of live b...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G10L25/48H04N5/278
CPCG10L25/48H04N5/278
Inventor 胡飞
Owner HUAKEFEIYANG
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products