Voice noise reduction method for conference terminal based on neural network model

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A neural network model and conference terminal technology, which is applied in the field of speech noise reduction of conference terminals based on neural network model, can solve problems such as large error and intermittent sound, and achieve real-time performance, reduction of calculation amount, and strong feature learning ability. Effect

Active Publication Date: 2018-12-21

FUJIAN STAR NET WISDOM TECH CO LTD

View PDF4 Cites 43 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

The fewer output nodes, the smaller the amount of calculation, but at the same time, the error is larger when the difference is expanded, especially when the signal-to-noise ratio is low, some weaker speech signals will be significantly suppressed, resulting in intermittent sound

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0033] In order to make the present invention more comprehensible, a preferred embodiment is now described in detail with accompanying drawings as follows.

[0034] Such as figure 1 As shown, a kind of conference terminal voice noise reduction method based on neural network model of the present invention comprises the following steps:

[0035] Step 1. A conference terminal device with a single microphone collects the audio file and generates a digital audio signal in the time domain, which is mixed with a voice signal and a noise signal;

[0036] Step 2, subframe the digital audio signal in the time domain and transfer it from the time domain to the frequency domain after performing the short-time Fourier transform; specifically:

[0037] Divide the digital audio signal in the time domain into frames, set every 10ms as a frame, a total of N frames, and N is a positive integer; set the 0th frame before the 1st frame as the compensation frame, and set the N+th frame after the N...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention provides a voice noise reduction method for a conference terminal based on the neural network model. The method comprises steps that S1, an audio file is collected by the conference terminal device to generate a digital audio signal in the time domain; S2, the digital audio signal is framed, and short-time Fourier transform is performed; S3, the amplitude spectrum of the frequency domain is mapped into a frequency band, and a Mel-frequency cepstral coefficient is further solved; S4, first-order and second-order differential coefficients are calculated through utilizing the Mel-frequency cepstral coefficient, a pitch correlation coefficient is calculated on each frequency band, and pitch period features and VAD features are further extracted; S5, input characteristic parameters of an audio are used as the input of the neural network model, the neural network is trained offline, the frequency band gain generating the noise reduction speech is learned, and the trained weightis solidified; S6, the neural network model is utilized to learn, the frequency band gain is generated, the outputted frequency band gain is mapped to the spectrum, the phase information is added, and a noise reduction speech signal is reduced through inverse Fourier transform. The method is advantaged in that real-time noise reduction can be achieved.

Description

technical field [0001] The invention relates to the technical fields of voice processing and communication, in particular to a neural network model-based voice noise reduction method for a conference terminal. Background technique [0002] Speech noise reduction technology refers to removing noise from noisy audio signals, and has a wide range of applications, such as mobile terminals and conference terminal equipment. The research on speech noise reduction technology has a long history, and monophonic speech noise reduction is a very challenging topic. Using only one microphone for speech noise reduction can not only reduce the cost of equipment, but also make it more convenient in actual use. [0003] In the prior art, the original amplitude spectrum is used as the input of the neural network. Too many input nodes lead to a large amount of calculation, which affects real-time voice communication, and requires further compression of the amplitude spectrum. The amplitude s...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G10L21/0216G10L21/0232G10L25/30G10L25/24G10L25/18

CPCG10L21/0216G10L21/0232G10L25/18G10L25/24G10L25/30G10L2021/02163

Inventor 薛建清陈东敏刘敏何志辉

Owner FUJIAN STAR NET WISDOM TECH CO LTD

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Voice noise reduction method for conference terminal based on neural network model

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology