Voice enhancement method and device, multimedia data acquisition method and device, multimedia data playing method and device and monitoring system

A multimedia data package and voice enhancement technology, which is applied in voice analysis, closed-circuit television systems, instruments, etc., can solve the problems of affecting the sense of hearing, unsatisfactory effect, and high algorithm complexity

Active Publication Date: 2020-03-17
HANGZHOU HIKVISION DIGITAL TECH
View PDF5 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] Speech enhancement technologies provided by related technologies include spectral subtraction, Wiener filtering, Kalman filtering, wavelet transform, etc. These algorithms suppress noise through filtering in the time domain, frequency domain, and wavelet transform domain, but the actual effect is not Ideally, for example, a related technology provides a method for enhancing the separation and enhancement of speech through blind source separation, but the implementation method of this method has high algorithm complexity, which is limited in practical applications, and the separated sounds are often The separation is not clean, which seriously affects the sense of hearing

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Voice enhancement method and device, multimedia data acquisition method and device, multimedia data playing method and device and monitoring system
  • Voice enhancement method and device, multimedia data acquisition method and device, multimedia data playing method and device and monitoring system
  • Voice enhancement method and device, multimedia data acquisition method and device, multimedia data playing method and device and monitoring system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0151] Exemplary embodiments of the present disclosure will be described in more detail below with reference to the accompanying drawings. Although exemplary embodiments of the present disclosure are shown in the drawings, it should be understood that the present disclosure may be embodied in various forms and should not be limited by the embodiments set forth herein. Rather, these embodiments are provided for more thorough understanding of the present disclosure and to fully convey the scope of the present disclosure to those skilled in the art.

[0152] It should be noted that, unless otherwise specified, technical terms or scientific terms used in this application shall have the usual meanings understood by those skilled in the art to which this application belongs.

[0153] In addition, the terms "first" and "second" are used to distinguish different objects, not to describe a specific order. Furthermore, the terms "include" and "have", as well as any variations thereof, ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a voice enhancement method and device, a voice acquisition method and device, a multimedia data acquisition method and device, a multimedia data playing method and device and amonitoring system. The voice enhancement method comprises the following steps: determining multi-channel frequency domain audio data obtained based on a microphone array; determining coordinate information of each microphone in the microphone array; determining sound source angle information according to the multi-channel frequency domain audio data and the coordinate information of each microphone; and enhancing the multi-channel frequency domain audio data according to the sound source angle information to obtain enhanced target frequency domain audio data. According to the invention, the sound source angle information can be determined according to the coordinate information of each microphone in the microphone array and the multi-channel frequency domain audio data, so that the voice emitted by the sound source can be enhanced precisely in a targeted manner, and the enhanced audio data can be played more clearly.

Description

technical field [0001] The present application relates to the technical field of speech enhancement, in particular to a speech enhancement method and device, a speech collection method and device, a multimedia data collection method and device, a multimedia data playback method and device, and a monitoring system. Background technique [0002] Speech enhancement refers to the technology of extracting useful speech signals from the noise background to suppress and reduce noise interference when the speech signal is interfered or even submerged by various noises. [0003] Speech enhancement technologies provided by related technologies include spectral subtraction, Wiener filtering, Kalman filtering, wavelet transform, etc. These algorithms suppress noise through filtering in the time domain, frequency domain, and wavelet transform domain, but the actual effect is not good. Ideally, for example, a related technology provides a method for enhancing the separation and enhancemen...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G10L21/0216G10L21/0232H04N7/18H04N21/43
CPCG10L21/0216G10L21/0232H04N7/18H04N21/4307G10L2021/02166
Inventor 陈扬坤钱能锋陈展
Owner HANGZHOU HIKVISION DIGITAL TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products