Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Linear spectral frequency parameter quantization bit allocation method and system

A technology for quantizing bits and line spectrum frequencies, applied in the field of speech coding, can solve the problems of not fully considering the influence degree of synthesized speech quality and the difference in the influence degree of synthesized speech quality, and achieve the effect of improving quality and improving quantization efficiency.

Active Publication Date: 2019-11-08
南京梧桐微电子科技有限公司
View PDF5 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] At present, there are existing methods for allocating scalar quantization bits of line spectrum frequency parameters, but it is easy to fall into a local optimal value, and quantization distortion is used as the basis for bit allocation, which does not fully consider the influence of quantization distortion on the quality of synthesized speech. The actual situation is , each dimension of the line spectrum frequency parameter has a large difference in the degree of influence on the quality of the synthesized speech, so there are still defects in the existing technology that need to be overcome

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Linear spectral frequency parameter quantization bit allocation method and system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0028] Embodiment: the speech training set is sampled by 8KHz frequency, 16 bits are quantized, and the line spectrum frequency parameter dimension is 10, adopts LBG algorithm to generate the quantizer of 4,8,16,32 different quantization layer numbers, calculates each with P.862 software The synthesized speech quality MOS score corresponding to the quantizer is stored. B is determined by the total number of bits allocated to the line spectrum frequency parameters by the vocoder; in this step, when calculating the MOS value corresponding to different quantizers for each dimension parameter, the values ​​of other dimension parameters are not quantized;

[0029] (2) Set the initial bit allocation number of each dimension of the line spectrum frequency parameter to 5, that is, the number of quantization layers is 32; add and sum the quantization bits of all dimensions of the line spectrum frequency parameter to obtain b;

[0030] Example: the initial bit allocation of the 10-dimen...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a linear spectral frequency parameter quantization bit allocation method and system. The method includes the following steps: an objective voice mean opinion score (MOS) is used as the basis of a linear spectral frequency parameter quantization bit allocation solution, initial bit allocation is obtained by applying quantized bit subtraction and an MOS comparing method, andthen an optimal bit allocation solution is searched by a simulated annealing algorithm. The method has the following advantages: the method takes into full account the influence of the difference between all dimensions of linear spectral frequency parameters on the quality of a synthetic voice, and applies the simulated annealing algorithm to search a globally optimal solution, which can further improve the quantization efficiency of the linear spectral frequency parameters and the quality of the synthetic voice.

Description

technical field [0001] The invention relates to a method and system for allocating quantized bits of line spectrum frequency parameters, and belongs to the technical field of speech coding. Background technique [0002] Speech coding is widely used in communication systems, recording and playback systems, and consumer products with voice functions. In recent years, the International Telecommunication Union (ITU), 3GPP, some regional organizations and countries have successively formulated a series of voice compression coding standards, the coding rate is getting lower and lower, and the quality of synthesized voice is getting higher and higher. At present, research at home and abroad mainly focuses on low-to-medium rate high-quality speech compression coding, which is mainly used in wireless communication, secure communication, underwater acoustic communication and other fields. In the above-mentioned speech coding algorithm, it is extremely important to efficiently quantiz...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G10L19/032G10L25/60
CPCG10L19/032G10L25/60
Inventor 颜夕宏张生平王主磊吴子晧颜明
Owner 南京梧桐微电子科技有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products