Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Artificial speech bandwidth expansion method and device based on K-SVD

A K-SVD algorithm and bandwidth expansion technology, applied in speech analysis, instruments, etc., can solve problems such as high algorithm complexity, difficulty in popularization, and poor real-time performance

Active Publication Date: 2014-12-17
DALIAN UNIV OF TECH
View PDF6 Cites 19 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0011] In summary, due to the "source-filter" model based on speech, prior knowledge is used too much, and the algorithm complexity is high; the wideband spectral envelope estimation process takes a long time to train the codebook Or statistical models, so the real-time performance is poor, so it is difficult to promote

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Artificial speech bandwidth expansion method and device based on K-SVD
  • Artificial speech bandwidth expansion method and device based on K-SVD
  • Artificial speech bandwidth expansion method and device based on K-SVD

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0047] The present invention will be further described in detail below in conjunction with the accompanying drawings and specific embodiments.

[0048] like figure 1As shown, the device of the present invention includes a training unit 1 and an expansion unit 2 . At the sending end of broadband extension, the training unit 1 trains the source broadband speech signal to obtain the wideband speech dictionary, narrowband speech dictionary and narrowband speech sparse matrix, and transmits the wideband speech dictionary and narrowband speech dictionary to the extension at the receiving end of bandwidth extension Unit 2. The expansion unit 2 performs bandwidth expansion on the source narrowband speech signal according to the obtained wideband speech dictionary and narrowband speech dictionary, and obtains the final extended wideband speech signal.

[0049] like figure 2 As shown, the training unit 1 includes a low-pass filter module 11, a parameter extraction module 12 based on...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses an artificial speech bandwidth expansion method and device based on K-SVD. The method is characterized by including 1, training original wideband speech signals at the bandwidth expansion transmitting end, and acquiring a wideband speech dictionary, a narrowband speech dictionary and narrowband speech sparse matrix; 2, at the bandwidth expansion receiving end, performing bandwidth expansion on original narrowband speech signals through the wideband speech dictionary and the narrowband speech dictionary, and acquiring the expanded final wideband speech signals. By the aid of the method, the wideband speech quality is improved, the training time and prior knowledge usage are reduced greatly, the accuracy of the original narrowband speech sparse matrix is improved effectively, practicality is high, and the method can be applied to the field of speech communication widely.

Description

technical field [0001] The present invention relates to a bandwidth expansion method and device, in particular to a K-SVD (K-means Singular Value Decomposition, K-means singular value decomposition) artificial voice bandwidth expansion method and device. Background technique [0002] Human speech energy is mainly distributed in the frequency range of 0.05-8KHz. In voice communication systems, such as PSTN (Public Switched Telephone Network) and Global System for Mobile Communication (GSM, Global System for Mobile communication), etc., due to many reasons such as technology, cost and system complexity, the transmission The voice signal bandwidth is generally below 4KHz, and this type of voice is called narrowband voice. Although narrowband voice communication reduces bandwidth requirements, although it ensures a certain clarity, it reduces voice naturalness. In some special occasions, such as teleconferencing systems, narrowband voice sounds unnatural and difficult to meet ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G10L21/038G10L19/107G10L25/27
Inventor 陈喆殷福亮隋经纬
Owner DALIAN UNIV OF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products