Voice quality conversion system, voice quality conversion device, method therefor, vocal tract information generating device, and method therefor

A technology for transforming system and channel information, which is applied in the field of sound quality transformation, and can solve the problems of smooth and natural voice, unable to input voice transformation, etc.

Inactive Publication Date: 2013-10-23
PANASONIC CORP
View PDF2 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0009] However, in the above-mentioned voice quality transformation technology, it is sometimes impossible to transform the input speech into smooth and natural speech.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Voice quality conversion system, voice quality conversion device, method therefor, vocal tract information generating device, and method therefor
  • Voice quality conversion system, voice quality conversion device, method therefor, vocal tract information generating device, and method therefor
  • Voice quality conversion system, voice quality conversion device, method therefor, vocal tract information generating device, and method therefor

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0162] Figure 8 It is a configuration diagram of the voice quality conversion system 100 of the first embodiment.

[0163] The voice quality conversion system 100 converts the voice quality of input speech using vocal tract shape information indicating the shape of the vocal tract. Such as Figure 8 As shown, the voice quality transformation system 100 includes: an input speech storage unit 101, a vowel accepting unit 102, an analysis unit 103, a first vowel vocal tract information storage unit 104, a mixing unit 105, a second vowel vocal tract information storage unit 107, a synthesis unit 108 , an output unit 109 , a mixing ratio input unit 110 , and a transformation ratio input unit 111 . Each component is connected by wire or wirelessly to send and receive information to each other. Next, each constituent element will be described.

[0164] (input voice storage unit 101)

[0165] The input voice storage unit 101 stores input voice information and accessory informatio...

Embodiment 2

[0312] Embodiment 2 will be described below.

[0313] In this embodiment, the voice quality conversion system is different from the voice quality conversion system of the first embodiment in that it is composed of two devices. Hereinafter, the differences from Example 1 will be mainly described.

[0314] Figure 20 It is a configuration diagram showing the sound quality conversion system 200 of the second embodiment. exist Figure 20 , for those with Figure 8 Components with the same function are assigned the same reference numerals, and explanations thereof are appropriately omitted.

[0315] Such as Figure 20 As shown, the voice quality conversion system 200 includes a vocal tract information generation device 201 and a voice quality conversion device 202 .

[0316] The vocal tract information generation unit 201 generates second vocal tract shape information indicating the shape of the vocal tract and used when converting the sound quality of the input speech. The ...

Embodiment 3

[0324] Embodiment 3 will be described below.

[0325] In this embodiment, the voice quality conversion system is different from the voice quality conversion system in the first embodiment in that it is composed of two devices. Hereinafter, the differences from Example 1 will be mainly described.

[0326] Figure 22 It is a configuration diagram showing the sound quality conversion system 300 of the third embodiment. exist Figure 22 , for those with Figure 8Components with the same function are assigned the same reference numerals, and explanations thereof are appropriately omitted.

[0327] Such as Figure 22 As shown, the voice quality conversion system 300 includes a vocal tract information generation device 301 and a voice quality conversion device 302 .

[0328] The vocal tract information generation device 301 includes a first vowel vocal tract information storage unit 104 , a mixing unit 105 , and a mixing ratio input unit 110 . The voice quality conversion devi...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

A voice quality conversion system includes: an analysis unit which analyzes sounds of plural vowels of different types to generate first vocal tract shape information for each type of the vowels; a combination unit which combines, for each type of the vowels, the first vocal tract shape information on that type of vowel and the first vocal tract shape information on a different type of vowel to generate second vocal tract shape information on that type of vowel; and a synthesis unit which (i) combines vocal tract shape information on a vowel included in input speech and the second vocal tract shape information on the same type of vowel to convert vocal tract shape information on the input speech, and (ii) generates a synthetic sound using the converted vocal tract shape information and voicing source information on the input speech to convert the voice quality of the input speech.

Description

technical field [0001] The present invention relates to sound quality conversion technology. Background technique [0002] As a conventional voice quality conversion technology, there is a technique of preparing a large number of pairs of voices with the same content pronounced in two different ways of speaking (e.g., emotion), and learning the conversion rules between the two ways of speaking (e.g., , see Patent Document 1). In the voice quality conversion technology described in Patent Document 1, it is possible to convert from non-emotional speech to emotional speech based on a learning model. [0003] In the voice quality conversion technology described in Patent Document 2, the conversion to the target voice is realized by extracting feature quantities from a small number of vowels that are uttered in isolation. [0004] (Prior art literature) [0005] (patent documents) [0006] Patent Document 1: Japanese Patent Application Laid-Open No. 7-72900 [0007] Patent D...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G10L21/04G10L11/00
CPCG10L21/003G10L21/04G10L13/033G10L25/15
Inventor 釜井孝浩广濑良文
Owner PANASONIC CORP
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products