Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

System and method for identifying accent of input sound

A speech and accent technology, applied in speech analysis, speech synthesis, speech recognition, etc., can solve the problems of generating training data and insufficient accuracy.

Inactive Publication Date: 2008-06-04
NUANCE COMM INC
View PDF0 Cites 21 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, because accents are relative in nature, it is difficult to generate training data from data such as phonetic deviations
In fact, although automatic recognition of accents based on such speech data has been attempted (see Kikuo Emoto, Heiga Zen, Keiichi Tokuda, and Tadashi Kitamura "Accent Type Recognition for Automatic Prosodic Labeling," Proc. of Autumn Meeting of the Acoustical Society of Japan ( September, 2003) (Kikuo Emoto, Heiga Zen, Keiichi Tokuda, and Tadashi Kitamura, "Speech Type Recognition for Automatic Prosodic Labeling", Fall Meeting of the Acoustical Society of Japan (September 2003))), but the accuracy is not satisfactory enough to make The identification is put into practice

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • System and method for identifying accent of input sound
  • System and method for identifying accent of input sound
  • System and method for identifying accent of input sound

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0018] Although the present invention is described below in the best mode for carrying out the present invention (hereinafter referred to as embodiments), the following embodiments do not limit the present invention in accordance with the scope of the claims, and the embodiments described in the embodiments All combinations of the features are not necessarily necessary for the solution of the present invention.

[0019] FIG. 1 shows the overall configuration of the recognition system 10. The recognition system 10 includes a storage unit 20 and an accent recognition unit 40. The input text 15 and the input speech 18 are input to the accent recognition unit 40, and the accent recognition unit 40 recognizes the accent of the input speech 18 thus input. The input text 15 is data for indicating the content of the input voice 18, and is, for example, data such as a file in which characters are arranged. In addition, the input voice 18 is a voice in which the input text 15 is read. This ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

Disclosed are a method and a system of input speech stress recognition, with a training vocabulary, training speech data and training boundary data stored. Hereafter, after the candidates of boundary data are input, the first probabilities are calculated based on the input vocabulary, speech and boundary data. The first probabilities are the corresponding probabilities between the words of input text, each prosodic word boundary and each boundary candidate. In addition, the second probabilities also can be calculated based on the input vocabulary, speech and boundary data when the input speech has boundary of the prosodic word assigned by one of the input boundary data candidates. The second probabilities are the corresponding probabilities between the word speech of input text and input speech data. Thereafter, the optimized the boundary candidate is searched out as the output result by maximizing the products of the first and the second probabilities.

Description

Technical field [0001] The present invention relates to speech recognition technology. Specifically, the present invention relates to a technique for recognizing the accent of an input voice. Background technique [0002] In recent years, attention has been paid to speech synthesis for reading the input text using natural pronunciation without requiring accompanying information such as the reading of the text. In this speech synthesis technology, in order to produce a natural-sounding sound to the listener, it is important not only to accurately reproduce the pronunciation of a word, but also to accurately reproduce its accent. If speech can be synthesized by accurately reproducing the higher H type or lower L type pronunciation of each mora that constitutes a word, it is possible to make the resulting speech sound natural to the listener. [0003] Most of the currently used speech synthesis systems are systems constructed through statistical training of the systems. In order to ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G10L13/00G10L13/08G10L15/00G10L13/06G10L13/10
CPCG10L15/04G10L13/04
Inventor 立花隆辉长野彻西村雅史仓田岳人
Owner NUANCE COMM INC
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products