Voice signal processing method, device and system

What is AI technical title?
AI technical title is built by Patsnap AI team. It summarizes the technical point description of the patent document.
A voice signal processing and voice signal technology, applied in the field of communication, can solve the problems of limited, inability to effectively improve the efficiency of voice coding and compression, and reduce the transmission delay, so as to reduce coding bits, improve the efficiency of voice coding and compression, and reduce the transmission delay. Effect

Active Publication Date: 2014-07-09

CHINA MOBILE COMM GRP CO LTD

View PDF11 Cites 10 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0007] Even if variable rate speech coding is adopted, since the judgment of speech frame activity is based on speech frames with a fixed time length, there may still be speech signals that have no communication significance in the speech frames that pass the activity detection. Therefore, it cannot be effectively improved. Speech coding compression efficiency, transmission delay reduction is also relatively limited

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment 1

[0051] Embodiment 1 of the present invention provides a voice signal processing method, the step flow of the method can be as follows Figure 4 shown, including:

[0052] Step 101. Receive a speech signal to be encoded.

[0053] In this step, a speech signal to be encoded may be received, and the speech signal to be encoded includes at least two sampling signals.

[0054] Step 102, dividing sub-signals.

[0055] In this step, the speech signal to be encoded can be sequentially divided into sub-signals according to the number of sampling signals included in each sub-signal set, so that the speech energy of the divided sub-signals in the logarithmic domain can be used to characterize the speech The short-term energy of the signal. The number of sampled signals included in each of the set sub-signals may be determined according to training or empirical values.

[0056] Wherein, the number of sampling signals included in each sub-signal is set to be small enough to reflect cha...

Embodiment 2

[0089] The schematic diagram of the speech signal processing method provided by Embodiment 2 of the present invention can be as follows Figure 5 As shown, for the received speech signal to be encoded, the energy curve of the speech signal to be encoded in the logarithmic domain can be obtained, so as to obtain the energy envelope information of the speech signal to be encoded. In addition, the speech signal to be encoded can be divided into sub-signals, and the speech energy of each sub-signal in the logarithmic domain can be obtained, so that the frame length can be judged. After determining the sub-signals included in each speech frame (which can be interpreted as determining the sampling signal included in each speech frame) through the frame length determination, the speech signal to be encoded can be divided into frames and divided into multiple speech frames. And it can perform parameter / waveform / hybrid coding on each speech frame to obtain coding parameters. The final...

Embodiment 3

[0091] Embodiment 3 of the present invention provides a voice signal processing method, the step flow of the method can be as follows Image 6 shown, including:

[0092] Step 201, receiving information.

[0093] In this step, each speech frame encoded by the method as described in Embodiment 1 can be received, the sampling signal information included in each speech frame, and the energy envelope information (that is, the coded code stream in Embodiment 2) .

[0094] Step 202, perform decoding.

[0095] In this step, each encoded speech frame may be decoded according to the sampling signal information included in each speech frame.

[0096] Step 203, perform synthesis.

[0097] In this step, according to each decoded speech frame, the speech signal can be synthesized by using the energy envelope information. Thus, the speech signal to be encoded before encoding can be obtained.

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention provides a voice signal processing method, device and system. The voice signal processing method comprises the steps of dividing a voice signal to be coded into subsignals according to the set quantity of sampled signals included by each of the subsignals; dividing the voice signal to be coded into voice frames according to voice energy of each subsignal in a log domain, wherein the voice energy of each subsignal included by each of the divided voice frames in the log domain is not less than a set value or less than the set value. Therefore, the voice signal with high communication meaning and the voice signal with low communication meaning can be distinguished well by the aid of high or low voice energy of each subsignal in the log domain. More coding bits are allocated for voice frames with high communication meaning (namely, a first type of the voice frames), fewer coding bits are allocated for voice frames with low communication meaning (namely, a second type of the voice frames), therefore, the quantity of the coding bits is decreased, compression efficiency in voice coding is increased remarkably, and transmission delay is reduced.

Description

technical field [0001] The present invention relates to the communication field, in particular to a voice signal processing method, device and system. Background technique [0002] With the rapid development of mobile networks, voice applications based on the Internet Protocol (IP) domain on mobile terminals are gradually increasing. In addition to the traditional Voice over Internet Protocol (VoIP) technology, voice applications used in conjunction with instant messaging applications have also developed rapidly in recent years. Represented by Tencent's WeChat, Xiaomi's Michat, and China Mobile's Feichat, these instant messaging tools are equipped with voice applications that allow users to record voice signals and send them to communication receivers through the IP domain. Currently, these voice applications have been accepted by a large number of users, and are generally considered to be a development direction of the mobile Internet with great potential. [0003] For vo...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G10L19/002G10L13/00

Inventor刘霖俞小良

OwnerCHINA MOBILE COMM GRP CO LTD

Voice signal processing method, device and system

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment 1

Embodiment 2

Embodiment 3

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology