Intelligent voice mixing method and device for multi-party voice communication

A voice call and voice channel technology, applied in the multimedia field, can solve the problems of low voice, interference, and difficulty for the audience to identify the content of the speech and the identity of the speaker.

Active Publication Date: 2015-04-22
GUANGZHOU HUADUO NETWORK TECH
View PDF6 Cites 22 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] When there are many participants involved in the audio mixing process, due to the background noise in the environment of each participant, even if each participant does not speak, the mixed audio data after the final audio mixing process will hear "humming" In addition, due to the large number of participants in the conversation, the voice of the speech will be very small due to attenuation, and it is difficult for the audience to identify the content of the speech and the identity of the speaker

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Intelligent voice mixing method and device for multi-party voice communication
  • Intelligent voice mixing method and device for multi-party voice communication
  • Intelligent voice mixing method and device for multi-party voice communication

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0099] An embodiment of the present invention provides an intelligent mixing method for multi-party voice calls, see figure 1 .

[0100] Wherein, the method flow includes:

[0101] 101: During a voice call, obtain the current frame data of each active voice channel except the local end;

[0102] 102: Obtain the voice activity detection results of the current frame data of each active voice channel and the short-term average energy of each active voice channel;

[0103] 103: According to the voice activity detection results of the current frame data of each active voice channel, the short-term average energy of each active voice channel, the number of voice channels of effective voice, and the corresponding gating identifiers of each active voice channel, select the voice mixing process. Voice channel; the strobe mark is the selection result recorded for each active voice channel during the last voice channel selection;

[0104] 104: Perform superimposed sound mixing process...

Embodiment 2

[0113] An embodiment of the present invention provides an intelligent mixing method for multi-party voice calls, see image 3 .

[0114] 301: During a voice call, obtain the current frame data of each active voice channel except the local end.

[0115] When the voice data sent by each active voice channel is received, this step starts to be executed for each frame of data of each active voice channel. The voice data sent by each active voice channel is divided into frames to obtain the current frame data.

[0116] Wherein, step 301 can be realized through the following process:

[0117] Obtain the voice data stream of each active voice channel except the local end, and perform frame division processing on the voice data stream of each active voice channel to obtain the current frame data in the voice data stream of each active voice channel.

[0118] 302: Acquire the voice activity detection results of the current frame data of each active voice channel and the short-term a...

Embodiment 3

[0166] An embodiment of the present invention provides an intelligent mixing method for multi-party voice calls, see Figure 4 .

[0167] 401: During a voice call, obtain the current frame data of each active voice channel except the local end.

[0168] When the voice data sent by each active voice channel is received, this step starts to be executed for each frame of data of each active voice channel. The voice data sent by each active voice channel is divided into frames to obtain the current frame data.

[0169] Wherein, step 401 can be realized through the following process:

[0170] Obtain the voice data stream of each active voice channel except the local end, and perform frame division processing on the voice data stream of each active voice channel to obtain the current frame data in the voice data stream of each active voice channel.

[0171] 402: Obtain the voice activity detection results of the current frame data of each active voice channel and the short-term a...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses an intelligent voice mixing method and device for multi-party voice communication, and belongs to the technical field of multimedia. The method comprises the steps that in the voice communication process, current frame data of all active voice channels except a home terminal are obtained; voice active detection results of the current frame data of all the active voice channels and the short time average energy of all the active voice channels are obtained; voice channels for conducting voice mixing processing are selected according to the voice active detection results of the current frame data of all the active voice channels, the short time average energy of all the active voice channels, the number of voice channels with effective voice and gating identifiers corresponding to all the active voice channels; superposition voice mixing processing is conducted on the current frame data of the selected voice channels, and voice mixing data obtained after the superposition voice mixing are output. By means of the intelligent voice mixing method and device, noise generated in the multi-party voice communication is lowered, the clarity of voice in the multi-party voice communication is improved, and the execution efficiency of the multi-party voice communication is improved.

Description

technical field [0001] The invention relates to the field of multimedia technology, in particular to an intelligent sound mixing method and device for multi-party voice calls. Background technique [0002] With the increasing demand for long-distance communication, VOIP (Voice over Internet Protocol) technology based on voice packet switching is more and more popular with users because of its low cost, easy expansion and excellent call quality. The application of multi-party voice call service on the Internet is also becoming more and more extensive. A multi-party voice call needs to transmit the voice of any party to any other party, and any party can hear the voices of multiple other parties at the same time, so it is necessary to mix the voice data of all parties. [0003] At present, the audio mixing process is that the audio mixing server receives the voice data sent by the terminals of each participant, performs audio mixing processing on all the audio data of each pa...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): H04M3/56H04L12/64
Inventor 林成保黄博贤梁俊斌
Owner GUANGZHOU HUADUO NETWORK TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products