Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Method and device for adaptive discontinuous voice transmission

An adaptive and discontinuous technology, applied in speech analysis, instruments, etc., it can solve problems such as high computational complexity and inability to flexibly track signal changes, achieve a low average bit rate, and ensure the effect of sound quality

Active Publication Date: 2013-01-30
ZTE CORP
View PDF5 Cites 13 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0008] The technical problem to be solved by the present invention is to provide a method and device for voice adaptive discontinuous transmission, which overcomes the problems in the prior art that the fixed interval method cannot flexibly track signal changes, and the variable interval method must have linear prediction, etc. The calculation of multiple parameters leads to the disadvantage of high computational complexity

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and device for adaptive discontinuous voice transmission
  • Method and device for adaptive discontinuous voice transmission
  • Method and device for adaptive discontinuous voice transmission

Examples

Experimental program
Comparison scheme
Effect test

Embodiment approach 1

[0040] In the first embodiment, the silence insertion description frame processing unit is further configured to determine that the absolute value of the spectral energy of the speech signal frame and / or the absolute value of the spectral energy of the last silence insertion description frame is greater than the single frame energy threshold and the When the difference between the spectral energy of the speech signal frame and the spectral energy of the last silence insertion description frame is greater than a preset limit one, the silence insertion description frame is sent.

[0041] The silence insertion description frame processing unit is further configured to determine that the absolute value of the spectral energy of the speech signal frame and / or the absolute value of the spectral energy of the last silence insertion description frame is greater than a single frame energy threshold and the speech signal frame has an absolute value of the spectral energy. The difference ...

Embodiment approach 2

[0044] In Embodiment 2, when the silence insertion description frame processing unit is used to determine the absolute value of the spectral energy of the speech signal frame and / or the absolute value of the spectral energy of the last silence insertion description frame is greater than the single frame energy threshold, according to Calculate the spectral correlation value of the current speech signal frame and the spectral energy of the last silence insertion description frame, and send the silence insertion description frame when it is judged that the spectral correlation value is less than the spectral correlation threshold.

Embodiment approach 3

[0045] In the third embodiment, the silence insertion description frame processing unit is used to determine whether to send the silence insertion description frame at the same time based on the difference of the spectral energy and the spectral correlation value of the two.

[0046] like figure 2 As shown, the apparatus may further include a smoothing filtering unit; the smoothing filtering unit is used for smoothing and filtering the frequency domain signal of the speech signal, and then inputting it to the silence insertion description frame processing unit, and the silence insertion description frame processing unit performs smoothing processing on The above-mentioned processing is performed on the obtained frequency domain signal, and the mute insertion description frame storage unit also needs to store the smoothed frequency domain signal.

[0047] The method for performing speech adaptive discontinuous transmission includes: in performing speech adaptive discontinuous ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a method and a device for adaptive discontinuous voice transmission. The method includes: during adaptive discontinuous voice transmission, determining whether to transmit a silence insertion descriptor or not according to a current voice signal frame and spectral information of a previous silence insertion descriptor. By the method and device, the problems that flexibly monitoring signal change by means of fixed intervals fails in the prior art and necessity of computation on multiple parameters such as linear prediction for the use of the means of variable intervals causes high computation complexity can be solved. Transmission is directly performed frequency domains by the method and device, signal change can be well tracked, and acoustic fidelity is guaranteed while low average bitrate is kept.

Description

technical field [0001] The present invention relates to the field of digital signal processing, and in particular, to a method and device for performing voice adaptive discontinuous transmission (Discontinuous Transmission, DTX for short). Background technique [0002] In an actual user communication process, in general, less time is used to transmit the user's voice, and more time is used to transmit non-voice background sounds. If the whole communication process is coded according to the coding method of the voice signal, it will cause a great waste of resources. In order to reduce this waste in the prior art, the sender uses a Voice Activity Detector (VAD for short) algorithm to perform signal detection, and when detecting an inactive segment in a call, a lower bit rate is used in the silent segment. The important information of the signal is encoded, that is, the signal is encoded into a silence insertion description (Silence InsertionDescriptor, SID for short) frame, a...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G10L19/012G10L19/02
CPCG10L19/012G10L19/02
Inventor 顾彩霞袁浩江东平黎家力
Owner ZTE CORP
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products