Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Method and system for encoding and synthesizing speech based on speech primitive

A primitive and voice technology, applied in the fields of voice coding, voice transmission, and voice telephony, it can solve the problems of voice quality loss, lossy compression of voice coding, etc.

Active Publication Date: 2012-07-04
孟智平
View PDF1 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Although low-rate voice compression coding brings convenience to channel transmission and saves storage space, since most voice coding is lossy compression, voice quality will inevitably be lost

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and system for encoding and synthesizing speech based on speech primitive
  • Method and system for encoding and synthesizing speech based on speech primitive
  • Method and system for encoding and synthesizing speech based on speech primitive

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0095] The speech primitives in the present invention can be phonemes, or waveforms intercepted at equal or variable frames, and different speech primitive model libraries can be established by using different speech primitives. In specific implementation, one of the model libraries can be used as the basis to encode and decode the transmitted speech; several model libraries can also be used in combination to encode complex speech in some special cases .

[0096] The basic idea of ​​the present invention is: collect a large amount of speech stream data samples, carry out the automatic segmentation of speech primitive to continuous speech stream, form speech primitive set, extract the feature of speech primitive, and adopt the method of fuzzy clustering to The speech primitive set is clustered to establish a speech primitive model library; based on the established speech primitive model library, when a continuous speech stream is obtained, the speech stream is automatically seg...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a speech coding and synthesizing method and a system thereof, which are based on a speech primitive and can be applied to low-bandwidth and high-tone quality speech transmission. On the basis of digital speech transmission, the constructed speech primitive is taken as a coding object and a clustering algorithm is adopted to construct a speech primitive model base by analysis on daily speech; then, a speech primitive automatic cut algorithm is utilized to carry out automatic speech primitive cutting to the obtained continuous speech stream and extract the MFCC characteristics of the speech primitive; a number corresponding to the speech primitive is obtained by carrying out matching identification to the speech primitive in the speech primitive model base, and the number carries out coding by replacing the speech primitive. During the process of speech synthesizing, the speech primitive corresponding to the number is taken out from the speech primitive model base according to the number, and processing such as interpretation fitting and the like is carried out to the spectra enveloping of the speech primitive by mathematical manipulation so as to form smoothtransited speech.

Description

technical field [0001] The invention relates to the fields of speech coding, speech transmission, speech telephony, etc., and in particular to a speech coding and synthesis method and system based on speech primitives. Background technique [0002] With the development of modern network technology, there are more and more applications to transmit voice signals through the Internet, especially the rapid popularization of online chat tools, which has made Internet telephony a popular communication tool. At present, most Internet telephony adopts general-purpose encoding technologies such as G.711, G.723, G.726, and G.729, and the speech in network transmission mostly adopts relatively high-compression medium- and low-rate speech encoding. Although low-rate speech compression coding brings convenience to channel transmission and saves storage space, since most speech coding is lossy compression, speech quality is bound to be lost. What these technologies have in common is to u...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G10L15/02G10L15/08G10L19/14G10L19/00
Inventor 孟智平郭海锋
Owner 孟智平
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products