Voice noise reduction method for conference terminal based on neural network model

A neural network model and conference terminal technology, which is applied in the field of speech noise reduction of conference terminals based on neural network model, can solve problems such as large error and intermittent sound, and achieve real-time performance, reduction of calculation amount, and strong feature learning ability. Effect

Active Publication Date: 2018-12-21
FUJIAN STAR NET WISDOM TECH CO LTD
View PDF4 Cites 43 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The fewer output nodes, the smaller the amount of calculation, but at the same time, the error is larger when the difference is expanded, es

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Voice noise reduction method for conference terminal based on neural network model
  • Voice noise reduction method for conference terminal based on neural network model
  • Voice noise reduction method for conference terminal based on neural network model

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0033] In order to make the present invention more comprehensible, a preferred embodiment is now described in detail with accompanying drawings as follows.

[0034] Such as figure 1 As shown, a kind of conference terminal voice noise reduction method based on neural network model of the present invention comprises the following steps:

[0035] Step 1. A conference terminal device with a single microphone collects the audio file and generates a digital audio signal in the time domain, which is mixed with a voice signal and a noise signal;

[0036] Step 2, subframe the digital audio signal in the time domain and transfer it from the time domain to the frequency domain after performing the short-time Fourier transform; specifically:

[0037] Divide the digital audio signal in the time domain into frames, set every 10ms as a frame, a total of N frames, and N is a positive integer; set the 0th frame before the 1st frame as the compensation frame, and set the N+th frame after the N...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a voice noise reduction method for a conference terminal based on the neural network model. The method comprises steps that S1, an audio file is collected by the conference terminal device to generate a digital audio signal in the time domain; S2, the digital audio signal is framed, and short-time Fourier transform is performed; S3, the amplitude spectrum of the frequency domain is mapped into a frequency band, and a Mel-frequency cepstral coefficient is further solved; S4, first-order and second-order differential coefficients are calculated through utilizing the Mel-frequency cepstral coefficient, a pitch correlation coefficient is calculated on each frequency band, and pitch period features and VAD features are further extracted; S5, input characteristic parameters of an audio are used as the input of the neural network model, the neural network is trained offline, the frequency band gain generating the noise reduction speech is learned, and the trained weightis solidified; S6, the neural network model is utilized to learn, the frequency band gain is generated, the outputted frequency band gain is mapped to the spectrum, the phase information is added, and a noise reduction speech signal is reduced through inverse Fourier transform. The method is advantaged in that real-time noise reduction can be achieved.

Description

technical field [0001] The invention relates to the technical fields of voice processing and communication, in particular to a neural network model-based voice noise reduction method for a conference terminal. Background technique [0002] Speech noise reduction technology refers to removing noise from noisy audio signals, and has a wide range of applications, such as mobile terminals and conference terminal equipment. The research on speech noise reduction technology has a long history, and monophonic speech noise reduction is a very challenging topic. Using only one microphone for speech noise reduction can not only reduce the cost of equipment, but also make it more convenient in actual use. [0003] In the prior art, the original amplitude spectrum is used as the input of the neural network. Too many input nodes lead to a large amount of calculation, which affects real-time voice communication, and requires further compression of the amplitude spectrum. The amplitude s...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G10L21/0216G10L21/0232G10L25/30G10L25/24G10L25/18
CPCG10L21/0216G10L21/0232G10L25/18G10L25/24G10L25/30G10L2021/02163
Inventor 薛建清陈东敏刘敏何志辉
Owner FUJIAN STAR NET WISDOM TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products