Supercharge Your Innovation With Domain-Expert AI Agents!

Method for enhancing excitation signal naturalism based on judgment and processing of transition frames

A technology of excitation signal and transition frame, applied in the field of speech coding, can solve the problem of insufficient utilization of bit information

Inactive Publication Date: 2011-03-30
TSINGHUA UNIV
View PDF0 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0013] like figure 1 As shown, in the quantization of unvoiced and voiced sound decision parameters in sub-bands, the original technology adopts a simple 1-bit quantization method of 5-bit quantization for each sub-band unvoiced and voiced sound decision parameter, which will cause a certain degree of redundancy in the coded stream. underutilization of bits

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method for enhancing excitation signal naturalism based on judgment and processing of transition frames
  • Method for enhancing excitation signal naturalism based on judgment and processing of transition frames
  • Method for enhancing excitation signal naturalism based on judgment and processing of transition frames

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0034] The method for improving the naturalness of the excitation signal based on transition frame judgment and processing proposed by the present invention is further described as follows in conjunction with the accompanying drawings and embodiments:

[0035] The method process of the present invention is as figure 2 shown, including the following steps:

[0036] On the encoding side, follow the steps below:

[0037] Step (1) framing the input voice signal samples in time order;

[0038] Step (2) extracts pitch period parameter to current frame;

[0039] Step (3) extracts energy parameters to the current frame;

[0040] Step (4) extracts the residual spectrum amplitude parameter to the current frame;

[0041] Step (5) extracts 5 subband unvoiced and voiced sound judgment parameters to current frame;

[0042] Step (6) obtains the average energy of each 60 sample points before and after the current frame, when the average energy of the rear 60 sample points is greater tha...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a method for improving the fidelity of an actuating signal on the basis of transition frame judgment and treatment, which belongs to the technical field of the compressed encoding of low-rate sounds. In the case that the ratio of the average energy of 60 sampling points before the current frame to the average energy of the 60 sampling points after the current frame is less than 1 / 32, then the current frame is judged to be a transition frame and expressed by the redundancy mode of sub-band surd and sonant vectors; a decoding end carries out de-quantification to parameters and the sub-band surd and sonant vectors obtained by decoding are utilized to judge if the current frame is the transition frame, if no, then judge if the current frame is a sonant frame and if the previous frame is a surd frame; if yes, the parameters of the current frame at the decoding end is not interpolated with the parameters of the previous frame when the actuating signal is synthesized. The method can improve the fidelity of a synthesized speech and is suitable for an SELP 2.4kbps vocoder.

Description

technical field [0001] The invention belongs to the technical field of speech coding, and is particularly aimed at SELP 2.4kbps vocoder technology. Background technique [0002] Speech coding is widely used in communication systems, voice storage and playback systems, and consumer products with voice functions. In recent years, the International Telecommunication Union (ITU), some regional organizations and some countries have successively formulated a series of speech compression coding standards, and obtained satisfactory speech quality at the coding rate of 2.4kb / s to 16kb / s. At present, research at home and abroad is mainly focused on high-quality voice compression coding at a rate below 2.4kb / s, which is mainly used for wireless communication, secure communication, and large-capacity voice storage and playback. The synthesis of excitation signals in low-rate speech coding is very important. SELP vocoder uses mixed excitation signals, and uses pitch period parameters, e...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G10L19/14G10L19/16
Inventor 崔慧娟唐昆计哲李晔
Owner TSINGHUA UNIV
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More