Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Conversion method for sound of speaker

A voice conversion and speaker technology, applied in the field of signal processing, can solve the problems of poor voice quality and low similarity, and achieve the effect of improving sound quality and similarity, easy to implement, and eliminating interference.

Active Publication Date: 2013-03-20
UNIV OF SCI & TECH OF CHINA
View PDF3 Cites 82 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0008] The technical problem to be solved by the present invention is that the voice quality of the existing speaker voice conversion method is poor and the similarity is not high.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Conversion method for sound of speaker
  • Conversion method for sound of speaker
  • Conversion method for sound of speaker

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0043] In order to make the object, technical solution and advantages of the present invention clearer, the present invention will be further described in detail below in conjunction with specific embodiments and with reference to the accompanying drawings.

[0044] From a physiological point of view, the work of scholars has confirmed that when the human brain perceives speech signals, the perception of speaker information and the perception of speech content are completed in different areas of the cerebral cortex. This shows that the human brain decomposes the speaker and content information at a high level. The information in the speech signal is separable. The separation of speaker information and content information is of great significance to speech signal processing. The separated information can be used separately. For speaker recognition, speech recognition and other targeted applications.

[0045] The present invention starts from the essence of the speaker's voice con...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a conversion method for sound of a speaker. The method comprises a training stage and a conversion stage, wherein the training stage comprises the steps of respectively extracting a fundamental frequency characteristic, a speaker characteristic and a content characteristic from training voice signals of a source speaker and a target speaker, constructing a fundamental frequency conversion function according to the fundamental frequency characteristic, and constructing a speaker conversion function according to the speaker characteristic. The conversion stage comprises the steps of extracting a fundamental frequency characteristic and a spectrum characteristic from a voice signal to be converted of the source speaker, using the fundamental frequency conversion function and the speaker conversion function obtained in the training stage to convert the fundamental frequency characteristic and the speaker characteristic extracted from the voice signal to be converted, obtaining the converted fundamental frequency characteristic and the speaker characteristic, and synthesizing voices of the target speaker according to the obtained converted fundamental frequency characteristic, the speaker characteristic and the content characteristic in the voice signal to be converted. The method is easy to realize, and the converted sound quality and similarity are higher.

Description

technical field [0001] The invention belongs to the technical field of signal processing, and specifically relates to converting a speaker's voice signal into a voice signal that can be perceived as another speaker's voice signal without changing the content information in the voice signal. A speaker voice conversion method for separating speaker information and content information in a voice signal. Background technique [0002] In today's information age, human-computer interaction has always been a research hotspot in the computer field, and an efficient and intelligent human-computer interaction environment has become an urgent need for the application and development of current information technology. As we all know, speech is one of the most important and convenient ways for human communication. Voice interaction will be the most "friendly" among human interactions. Human-machine speech dialogue technology based on speech recognition, speech synthesis and natural lan...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G10L25/30
Inventor 陈凌辉戴礼荣凌震华
Owner UNIV OF SCI & TECH OF CHINA
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products