System and method for identifying accent of input sound

What is AI technical title?
AI technical title is built by PatSnap AI team. It summarizes the technical point description of the patent document.
A speech and accent technology, applied in speech analysis, speech synthesis, speech recognition, etc., can solve the problems of generating training data and insufficient accuracy.

Inactive Publication Date: 2008-06-04

NUANCE COMM INC

View PDF0 Cites 21 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

However, because accents are relative in nature, it is difficult to generate training data from data such as phonetic deviations

In fact, although automatic recognition of accents based on such speech data has been attempted (see Kikuo Emoto, Heiga Zen, Keiichi Tokuda, and Tadashi Kitamura "Accent Type Recognition for Automatic Prosodic Labeling," Proc. of Autumn Meeting of the Acoustical Society of Japan ( September, 2003) (Kikuo Emoto, Heiga Zen, Keiichi Tokuda, and Tadashi Kitamura, "Speech Type Recognition for Automatic Prosodic Labeling", Fall Meeting of the Acoustical Society of Japan (September 2003))), but the accuracy is not satisfactory enough to make The identification is put into practice

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0018] Although the present invention is described below in the best mode for carrying out the present invention (hereinafter referred to as embodiments), the following embodiments do not limit the present invention in accordance with the scope of the claims, and the embodiments described in the embodiments All combinations of the features are not necessarily necessary for the solution of the present invention.

[0019] FIG. 1 shows the overall configuration of the recognition system 10. The recognition system 10 includes a storage unit 20 and an accent recognition unit 40. The input text 15 and the input speech 18 are input to the accent recognition unit 40, and the accent recognition unit 40 recognizes the accent of the input speech 18 thus input. The input text 15 is data for indicating the content of the input voice 18, and is, for example, data such as a file in which characters are arranged. In addition, the input voice 18 is a voice in which the input text 15 is read. This ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

Disclosed are a method and a system of input speech stress recognition, with a training vocabulary, training speech data and training boundary data stored. Hereafter, after the candidates of boundary data are input, the first probabilities are calculated based on the input vocabulary, speech and boundary data. The first probabilities are the corresponding probabilities between the words of input text, each prosodic word boundary and each boundary candidate. In addition, the second probabilities also can be calculated based on the input vocabulary, speech and boundary data when the input speech has boundary of the prosodic word assigned by one of the input boundary data candidates. The second probabilities are the corresponding probabilities between the word speech of input text and input speech data. Thereafter, the optimized the boundary candidate is searched out as the output result by maximizing the products of the first and the second probabilities.

Description

Technical field [0001] The present invention relates to speech recognition technology. Specifically, the present invention relates to a technique for recognizing the accent of an input voice. Background technique [0002] In recent years, attention has been paid to speech synthesis for reading the input text using natural pronunciation without requiring accompanying information such as the reading of the text. In this speech synthesis technology, in order to produce a natural-sounding sound to the listener, it is important not only to accurately reproduce the pronunciation of a word, but also to accurately reproduce its accent. If speech can be synthesized by accurately reproducing the higher H type or lower L type pronunciation of each mora that constitutes a word, it is possible to make the resulting speech sound natural to the listener. [0003] Most of the currently used speech synthesis systems are systems constructed through statistical training of the systems. In order to ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & AuthorityApplications(China)

IPC IPC(8): G10L13/00G10L13/08G10L15/00G10L13/06G10L13/10

CPCG10L15/04G10L13/04

Inventor立花隆辉长野彻西村雅史仓田岳人

OwnerNUANCE COMM INC

System and method for identifying accent of input sound

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology