Unlock instant, AI-driven research and patent intelligence for your innovation.

Method and voice activity detector for a speech encoder

A voice activity and detector technology, applied in voice analysis, instruments, etc., to achieve the effect of maintaining quality

Inactive Publication Date: 2012-11-28
TELEFON AB LM ERICSSON (PUBL)
View PDF4 Cites 14 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Another problem: loud noises can have spectrally varying properties that are very similar to some pieces of music that are not suppressed by the VAD algorithm
[0012] Although importance thresholding is used to enhance VAD performance, it has been noted that it may also cause occasional speech truncation, mainly front-end truncation of low SNR non-speech

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and voice activity detector for a speech encoder
  • Method and voice activity detector for a speech encoder
  • Method and voice activity detector for a speech encoder

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0028] Embodiments of the present invention will be described more fully hereinafter with reference to the accompanying drawings, in which preferred embodiments of the invention are shown. Embodiments may, however, be embodied in many different forms and should not be construed as limited to those set forth herein; rather, these embodiments are provided so that this disclosure will be thorough and complete, and will The scope of the present invention is fully conveyed to those skilled in the art. In the drawings, like reference numerals refer to like elements.

[0029] Additionally, those skilled in the art will appreciate that the means and functions described below can be implemented using software functionality in conjunction with programming a microprocessor or general purpose computer, and / or using application specific integrated circuits (ASICs). It will also be appreciated that although the present embodiments have been described primarily in terms of methods and appar...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The embodiments of the present invention relates to a primary voice activity detector and a method thereof. By using the method of the embodiments it is possible to determine whether frames of an input signal comprise voice. That is achieved by receiving a frame of the input signal, determining a first SNR of the received frame, comparing the determined first SNR with an adaptive threshold, and detecting whether the received frame comprises voice based on said comparison. The adaptive threshold is at least based on total noise energy of a noise level, an estimate of a second SNR and an energy variation between different frames.

Description

technical field [0001] Embodiments of the invention relate to methods and voice activity detectors, in particular to threshold adaptation of voice activity detectors. Background technique [0002] In speech coding systems for conversational speech, discontinuous transmission (DTX) is often used to increase the efficiency of coding. The reason is that conversational speech contains a large number of pauses embedded in the speech, such as when one person is speaking while another is listening. Thus, with DTX, the vocoder is only active about 50% of the time on average, and comfort noise can be used to encode the rest of the time. Comfort noise is artificial noise generated on the decoder side, only similar in characteristics to encoder side noise, and thus requires less bandwidth. Some example codecs with this feature are AMR NB (Adaptive Multi-Rate Narrowband) and EVRC (Enhanced Variable Rate CODEC). Note that AMR NB uses DTX, while EVRC uses Variable Rate (VBR), where a R...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G10L11/02
CPCG10L2025/786G10L25/78G10L21/0208G10L25/87G10L25/18G10L25/51
Inventor 马丁·绍尔斯戴德
Owner TELEFON AB LM ERICSSON (PUBL)