Speech coding and decoding apparatus

Inactive Publication Date: 2000-05-30
KK TOSHIBA
View PDF8 Cites 45 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Benefits of technology

With this shortcoming in mind, it is an object of the present invention to provide a speech coding apparatus capable of providing high-quality synthesized sounds even at a low transfer rate.
According to the present invention, in a speech coding apparatus for driving a synthesis filter by an excitation signal to acquire a synthesized sound, the frame of the excitation signal is divided into plural subframes of an equal length or different lengths, a pulse interval is variable subframe by subframe, the excitation signal is formed by a train of excitation pulses with equal intervals in each subframe, the amplitude or the amplitude and phase of the excitation pulse train

Problems solved by technology

However, the results of the experiments by this conventional system show that when the transfer rate becomes low, particularly, 10 Kb/s or below, noise in the synthesized sound becomes prominent, deteriorating the quality.
In particular, the quality degradation is noticeable in the experiments with female voices with short pitch.
According to the conventional system, however, even though the power of the prediction residual signal changes within a frame, the synthesis

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Speech coding and decoding apparatus
  • Speech coding and decoding apparatus
  • Speech coding and decoding apparatus

Examples

Experimental program
Comparison scheme
Effect test

Example

FIG. 4 is a block diagram showing a coding apparatus according to the first embodiment. A speech signal s(n) after A / D conversion is input to a frame buffer 102, which accumulates the speech signal s(n) for one frame. Individual elements in FIG. 4 perform the following processes frame by frame.

A prediction parameter calculator 108 receives the speech signal s(n) from the frame buffer 102, and computes a predetermined number, p, of prediction parameters (LPC parameter or reflection coefficient) by an autocorrelation method or covariance method. The acquired prediction parameters are sent to a prediction parameter coder 110, which codes the prediction parameters based on a predetermined number of quantization bits, and outputs the codes to a decoder 112 and a multiplexer 118. The decoder 112 decodes the received codes of the prediction parameters and sends decoded values to a prediction filter 106 and an excitation signal generator 104. The prediction filter 106 receives the speech si...

Example

Now the second embodiment will be explained. Although the excitation pulse is computed by the A-b-S (Analysis by Synthesis) method in the first embodiment, the excitation pulse may be analytically calculated as another method.

Here, first, let N (samples) be the frame length, M be the number of subframes, L (samples) be the subframe length, N.sub.m (1.ltoreq.m.ltoreq.M) be the interval of the excitation pulse in the m-th subframe, Q.sub.m be the number of excitation pulses, g.sub.i.sup.(m) (1.ltoreq.i.ltoreq.Q.sub.m) be the amplitude of the excitation pulse, and K.sub.m be the phase of the excitation pulse. Here there is the following relation.

where .left brkt-bot..multidot..right brkt-bot. indicates computation to provide an integer portion by rounding off.

FIG. 7 illustrates an example of the excitation signal in a case where M=5, L=8, N.sub.1 =N.sub.3 =1, N.sub.2 =N.sub.4 =N.sub.5 =2, Q.sub.1 =Q.sub.3 =8, Q.sub.2 =Q.sub.4 =Q.sub.5 =4, and K.sub.1 =K.sub.2 =K.sub.3 =K.sub.4 =1. Let ...

Example

FIG. 9 is a block diagram showing a coding apparatus according to the third embodiment, and FIG. 11 is a block diagram of a decoding apparatus according to the third embodiment. In FIG. 9, a speech signal after A / D conversion is input to a frame buffer 202, which accumulates the speech signal for one frame. Therefore, individual elements in FIG. 9 perform the following processes frame by frame.

A prediction parameter calculator 204 calculates prediction parameters using a known method. When a prediction filter 206 is constituted to have a long-term prediction filter (pitch prediction filter) 240 and a short-term prediction filter 242 cascade-connected as shown in FIG. 10, he prediction parameter calculator 204 calculates a pitch period, a pitch prediction coefficient, and a linear prediction coefficient (LPC parameter or reflection coefficient) by a know method, such as an autocorrelation method or covariance method. The calculation method is described in the document 2.

The calculate...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

A speech signal is input to an excitation signal generating section, a prediction filter and a prediction parameter calculator. The prediction parameter calculator calculates a predetermined number of prediction parameters (LPC parameter or reflection coefficient) by an autocorrelation method or covariance method, and supplies the acquired prediction parameters to a prediction parameter coder. The codes of the prediction parameters are sent to a decoder and a multiplexer. The decoder sends decoded values of the codes of the prediction parameters to the prediction filter and the excitation signal generating section. The prediction filter calculates a prediction residual signal, which is the difference between the input speech signal and the decoded prediction parameter, and sends it to the excitation signal generating section. The excitation signal generating section calculates the pulse interval and amplitude for each of a predetermined number of subframes based on the input speech signal, the prediction residual signal and the quantized value of the prediction parameter, and sends them to the multiplexer. The multiplexer combines these codes and the codes of the prediction parameters, and send the results as an output signal of a coding apparatus to a transmission path or the like.

Description

TECHNICAL FIELDThe present invention relates to a speech coding apparatus which compresses a speech signal with a high efficiency and decodes the signal. More particularly, this invention relates to a speech coding apparatus based on a train of adaptive density excitation pulses and whose transfer bit rate can be set low, e.g., to 10 Kb / s or lower.BACKGROUND ARTTodays, coding technology for transferring a speech signal at a low bit rate of 10 Kb / s or lower has been extensively studied. As a practical method is known using a system in which an excitation signal of a speech synthesis filter is represented by a train of pulses aligned at predetermined intervals and the excitation signal is used for coding the speech signal. The details of this method are explained in the paper titled "Regular-Pulse Excitation--A Novel Approach to Effective and Efficient Multipulse Coding of Speech," written by Peter Kroon et al. in the IEEE Report, October 1986, Vol. ASSP-34, pp. 1054-1063 (Document 1)...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G10L19/04G10L19/10G10L19/113
CPCG10L19/113
Inventor AKAMINE, MASAMIMISEKI, KIMIO
Owner KK TOSHIBA
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products