Phoneme changing method based on digital signal processing

A digital signal processing and speech technology, applied in speech synthesis, speech analysis, instruments, etc., can solve the problems of large amount of calculation, poor voice naturalness, poor naturalness, etc., and achieve the effect of small amount of calculation and high naturalness

Inactive Publication Date: 2005-01-19
BEIJING KEXIN TECH +1
View PDF0 Cites 14 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The common problems of these methods are that the amount of calculation is relatively large, manual intervention is required, and the naturalness of the synthesized speech is relatively poor.
However, due to the limitations of the PSOLA algorithm itself, when the fundamental frequency needs to be changed in a relatively large range, the speech will generate quite a lot of aliasing, resulting in a lot of noise
After the other two methods change the fundamental frequency, the natur

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Phoneme changing method based on digital signal processing
  • Phoneme changing method based on digital signal processing
  • Phoneme changing method based on digital signal processing

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0020] The present invention will be further described in detail below in conjunction with the accompanying drawings and specific implementation directions.

[0021] Such as figure 1 The flowchart of the voice changing method of the present invention shown. First, input a frame of speech, and the length of a frame of speech can be adjusted appropriately according to the actual situation.

[0022] The fundamental frequency value in the input raw speech signal is then estimated. In the speech fundamental frequency estimation of this embodiment, the harmonic sum method (Summation of Sub-Harmonic Method) is adopted, so no matter whether the original speech has periodicity or not, a fundamental frequency value will be obtained. When there is periodicity in the original speech, that is, there is a pitch period, then a meaningful fundamental frequency value will be obtained; when there is no periodicity in the original speech, that is, there is no pitch period, such as in unvoiced ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

This invention published a kind of voice variation method that based on digital signal processing. It includes the steps of (1) selecting the original voice signal that needs to change; (2) finding the basic tone cycle length of original voice signal; (3) confirming the position of every basic tone cycle of whole original voice signal according to the basic tone cycle length; (4) inserting or deleting the basic tone cycle between the basic tone cycle of original voice signal, get the shortened or prolonged voice signal; (5) linearly extending or compressing the shortened or prolonged voice signal to the same length of original voice signal and getting the changed voice signal. This invention can be realized real-time on DSP chip. The changed voice is very natural.

Description

technical field [0001] The present invention relates to a voice changing method, more specifically, the present invention relates to a voice changing method based on digital signal processing. Background technique [0002] Fundamental frequency and formants are two very important features in speech. The fundamental frequency is the frequency at which the vocal cords vibrate when making voiced sounds. The fundamental frequency is directly related to the gender of the speaker. Generally speaking, the fundamental frequency of a male voice is relatively low, and that of a female voice is relatively high. In addition, age also has a certain influence on the base frequency. The base frequency of the elderly is lower than that of the young, and the base frequency of the young is lower than that of children. Therefore, by changing the fundamental frequency, the effect of speech can be changed, affecting people's judgment on the speaker's age and even gender. [0003] Formant refer...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G10L13/00G10L21/003
Inventor 李明刘建汪俊杰庹凌云颜永红孙宝海
Owner BEIJING KEXIN TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products