Voice processing method and device

A technology for voice processing and voice data, applied in the field of data processing, to solve problems such as inability to correctly separate voices

Pending Publication Date: 2020-11-03
SHANGHAI MININGLAMP ARTIFICIAL INTELLIGENCE GRP CO LTD
View PDF16 Cites 5 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] Embodiments of the present invention provide a voice processing method and device to at least solve the problem in the related art that voices cannot be separated correctly in scenes with complex environmental sounds

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Voice processing method and device
  • Voice processing method and device
  • Voice processing method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0059] The method embodiment of the present application embodiment can be performed in a mobile terminal, a computer terminal, or a similar computing device. Take the mobile terminal as an example. figure 1 It is a block diagram showing a hardware configuration of the voice processing method of a mobile terminal according to an embodiment of the present invention, as figure 1 As shown, the mobile terminal can include one or more ( figure 1 Only one) processor 102 (processor 102 can include, but is not limited to, microprocessor MCU, or programmable logic device FPGA, or the like), and memory 104 for storing data, optionally, the mobile terminal A transmission device 106 and an input / output device 108 may be included for communication functions. One of ordinary skill in the art will appreciate that figure 1 The structure shown is only schematic, and does not limit the structure of the mobile terminal. For example, mobile terminals may also include ratio figure 1 More or fewer com...

Embodiment 2

[0089] According to another embodiment of the present invention, there is provided a voice processing apparatus, image 3 It is a block diagram of a voice processing apparatus according to an embodiment of the present invention, such as image 3 As shown, including:

[0090] Obtaining module 32, configured to obtain multiple voice data collected by the microphone array, wherein the microphone array comprises a plurality of microphones, each microphone collecting the voice data identifier carries the microphone;

[0091] Determination module 34 for determining sound intensity of the multiplexed speech data;

[0092] Separation module 36, for performing separation according to the speech sound intensity of the multiplexed speech data and said multi-channel voice microphone identifying data carried.

[0093] Figure 4 Is a block speech processing apparatus according to a preferred embodiment of the present invention Figure one ,like Figure 4Shown, the separation module 36 comprises:

...

Embodiment 3

[0112] Embodiments of the present invention further provides a computer-readable storage medium, that storage medium stores a computer program, wherein the computer program being arranged to perform the method steps in any preceding embodiment is run.

[0113] Alternatively, in the present embodiment, the storage medium can be set to store a computer program for performing the following steps:

[0114] Sl, acquires multiplexed voice data collected by the microphone array, wherein the microphone array comprises a plurality of microphones, each microphone collecting the voice data identifier carries the microphone;

[0115] S2, determining sound intensity of the multiplexed speech data;

[0116] S3, voice sounds separated according to the intensity of the multiplexed speech data and said multi-channel voice data carried Mike identifier.

[0117] Alternatively, in the present embodiment, the storage medium may include, but are not limited to: U disk, read only memory (Read-Only Memor...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a voice processing method and device, and the method comprises the steps: obtaining multipath voice data collected by a microphone array, wherein the microphone array comprisesa plurality of microphones, and the voice data collected by each microphone carries a microphone identification; determining the sound intensity of the multiple paths of voice data; and performing voice separation according to the sound intensity of the multiple paths of voice data and the microphone identifiers carried by the multiple paths of voice data, so that the problem that the voice cannotbe correctly separated in a scene with complex environmental sound in related technologies can be solved, a plurality of directional microphone arrays are used,in a proper noisy environment, sounds of the speakers are separated.

Description

Technical field [0001] The present invention relates to data processing, and in particular, relates to a speech processing method and apparatus. Background technique [0002] The current market need for separate voice recorder is used in a quiet environment (such as: car) or law background sound environment (such as: watching TV), separate two-dimensional multi-level display mode or one-dimensional horizontal display, use 2 direction and type (voice, noise) a 1-6 MIC is determined by the sound propagation speed of sound in different directions to separate the human voice (track). In the above manner in complex environments (establishments) background sound will lead to a change of scene would not be the right isolated voices (doping noise, environmental sounds). [0003] For problems related art can not be separated properly in the context of the speech sound complex scenes, yet propose solutions. Inventive content [0004] Embodiment of the present invention provides a speech p...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G10L21/028
CPCG10L21/028
Inventor 李健沈忱王玉好梁志婷
Owner SHANGHAI MININGLAMP ARTIFICIAL INTELLIGENCE GRP CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products