Deep-learning-based speech tone quality enhancement method, device and system

A deep learning and voice quality technology, applied in voice analysis, instruments, etc., can solve problems such as roughness, no corresponding feasible solution, and inability to restore high-quality voice quality, so as to achieve the effect of high-efficiency voice quality

Active Publication Date: 2019-01-04
ANKER INNOVATIONS TECH CO LTD
View PDF10 Cites 6 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, at present, there is no corresponding feasible solution for low-quality bit-rate speech reconstruction using software methods
For the reconstruction of low-quality bit-rate speech, the method of filling or interpolating data is usually adopted, but this method is too rough to restore the sound quality of high-quality speech

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Deep-learning-based speech tone quality enhancement method, device and system
  • Deep-learning-based speech tone quality enhancement method, device and system
  • Deep-learning-based speech tone quality enhancement method, device and system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0037] In order to make the objects, technical solutions and advantages of the present invention more apparent, exemplary embodiments according to the present invention will be described in detail below with reference to the accompanying drawings. Apparently, the described embodiments are only some embodiments of the present invention, rather than all embodiments of the present invention, and it should be understood that the present invention is not limited by the exemplary embodiments described here. Based on the embodiments of the present invention described in the present invention, all other embodiments obtained by those skilled in the art without creative effort shall fall within the protection scope of the present invention.

[0038] First, refer to figure 1 An example electronic device 100 for implementing the method, device and system for enhancing speech sound quality based on deep learning according to an embodiment of the present invention will be described.

[003...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a deep-learning-based speech tone quality enhancement method, device and system. The method comprises: to-be-processed speech data are obtained and feature extraction is carriedout on the to-be-processed speech data to obtain features of the to-be-processed speech data; and on the basis of the features of the to-be-processed speech data, the to-be-processed speech data arereconstructed to be output speech data by using a trained speech reconstruction neural network, wherein the speech quality of the output speech data is higher than that of the to-be-processed speech data. According to the invention, the low-quality speech quality can be enhanced based on the deep learning method and the low-quality speech quality is reconstructed by the deep neural network to obtain the high-quality speech tone quality, so that the tone quality improvement effect that can not be realized by the traditional method is realized.

Description

technical field [0001] The present invention relates to the technical field of sound quality optimization, and more particularly to a method, device and system for enhancing speech sound quality based on deep learning. Background technique [0002] In recent years, voice wireless communication has developed rapidly and is currently widely used in various civil and industrial fields. The wireless communication is limited by the bandwidth, and it is required to compress the speech coding and reduce the sampling frequency and code rate of the speech as much as possible. Although speech coding reduces the speech quality, it also greatly saves resources. Early digital voice communication coding, such as Global System for Mobile Communications-Half Rate (GMS-HR), has a code rate of about 6.5kbps, adopts a sampling frequency of 8kHz, and has an actual bandwidth of less than 4k, which loses a lot of high-frequency information and makes the human voice lack The degree of recognitio...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G10L21/007G10L25/30
CPCG10L21/007G10L25/30G10L21/0208G10L21/034G10L2021/02082
Inventor 秦宇姚青山喻浩文卢峰
Owner ANKER INNOVATIONS TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products