Intelligent voice mixing method and device for multi-party voice communication

What is AI technical title?
AI technical title is built by Patsnap AI team. It summarizes the technical point description of the patent document.
A voice call and voice channel technology, applied in the multimedia field, can solve the problems of low voice, interference, and difficulty for the audience to identify the content of the speech and the identity of the speaker.

Active Publication Date: 2015-04-22

GUANGZHOU HUADUO NETWORK TECH

View PDF6 Cites 22 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0005] When there are many participants involved in the audio mixing process, due to the background noise in the environment of each participant, even if each participant does not speak, the mixed audio data after the final audio mixing process will hear "humming" In addition, due to the large number of participants in the conversation, the voice of the speech will be very small due to attenuation, and it is difficult for the audience to identify the content of the speech and the identity of the speaker

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment 1

[0099] An embodiment of the present invention provides an intelligent mixing method for multi-party voice calls, see figure 1 .

[0100] Wherein, the method flow includes:

[0101] 101: During a voice call, obtain the current frame data of each active voice channel except the local end;

[0102] 102: Obtain the voice activity detection results of the current frame data of each active voice channel and the short-term average energy of each active voice channel;

[0103] 103: According to the voice activity detection results of the current frame data of each active voice channel, the short-term average energy of each active voice channel, the number of voice channels of effective voice, and the corresponding gating identifiers of each active voice channel, select the voice mixing process. Voice channel; the strobe mark is the selection result recorded for each active voice channel during the last voice channel selection;

[0104] 104: Perform superimposed sound mixing process...

Embodiment 2

[0113] An embodiment of the present invention provides an intelligent mixing method for multi-party voice calls, see image 3 .

[0114] 301: During a voice call, obtain the current frame data of each active voice channel except the local end.

[0115] When the voice data sent by each active voice channel is received, this step starts to be executed for each frame of data of each active voice channel. The voice data sent by each active voice channel is divided into frames to obtain the current frame data.

[0116] Wherein, step 301 can be realized through the following process:

[0117] Obtain the voice data stream of each active voice channel except the local end, and perform frame division processing on the voice data stream of each active voice channel to obtain the current frame data in the voice data stream of each active voice channel.

[0118] 302: Acquire the voice activity detection results of the current frame data of each active voice channel and the short-term a...

Embodiment 3

[0166] An embodiment of the present invention provides an intelligent mixing method for multi-party voice calls, see Figure 4 .

[0167] 401: During a voice call, obtain the current frame data of each active voice channel except the local end.

[0168] When the voice data sent by each active voice channel is received, this step starts to be executed for each frame of data of each active voice channel. The voice data sent by each active voice channel is divided into frames to obtain the current frame data.

[0169] Wherein, step 401 can be realized through the following process:

[0170] Obtain the voice data stream of each active voice channel except the local end, and perform frame division processing on the voice data stream of each active voice channel to obtain the current frame data in the voice data stream of each active voice channel.

[0171] 402: Obtain the voice activity detection results of the current frame data of each active voice channel and the short-term a...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses an intelligent voice mixing method and device for multi-party voice communication, and belongs to the technical field of multimedia. The method comprises the steps that in the voice communication process, current frame data of all active voice channels except a home terminal are obtained; voice active detection results of the current frame data of all the active voice channels and the short time average energy of all the active voice channels are obtained; voice channels for conducting voice mixing processing are selected according to the voice active detection results of the current frame data of all the active voice channels, the short time average energy of all the active voice channels, the number of voice channels with effective voice and gating identifiers corresponding to all the active voice channels; superposition voice mixing processing is conducted on the current frame data of the selected voice channels, and voice mixing data obtained after the superposition voice mixing are output. By means of the intelligent voice mixing method and device, noise generated in the multi-party voice communication is lowered, the clarity of voice in the multi-party voice communication is improved, and the execution efficiency of the multi-party voice communication is improved.

Description

technical field [0001] The invention relates to the field of multimedia technology, in particular to an intelligent sound mixing method and device for multi-party voice calls. Background technique [0002] With the increasing demand for long-distance communication, VOIP (Voice over Internet Protocol) technology based on voice packet switching is more and more popular with users because of its low cost, easy expansion and excellent call quality. The application of multi-party voice call service on the Internet is also becoming more and more extensive. A multi-party voice call needs to transmit the voice of any party to any other party, and any party can hear the voices of multiple other parties at the same time, so it is necessary to mix the voice data of all parties. [0003] At present, the audio mixing process is that the audio mixing server receives the voice data sent by the terminals of each participant, performs audio mixing processing on all the audio data of each pa...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & AuthorityApplications(China)

IPC IPC(8): H04M3/56H04L12/64

Inventor林成保黄博贤梁俊斌

OwnerGUANGZHOU HUADUO NETWORK TECH

Intelligent voice mixing method and device for multi-party voice communication

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment 1

Embodiment 2

Embodiment 3

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology