Unlock instant, AI-driven research and patent intelligence for your innovation.

Pitch quantization for distributed speech recognition

A differential quantization and pitch technology, applied in speech recognition, speech analysis, instruments, etc., can solve problems such as accuracy and anti-channel error, and achieve the effect of improving network performance and communication quality

Active Publication Date: 2006-03-15
IBM CORP +1
View PDF3 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, this could potentially create problems in terms of accuracy and robustness against channel errors

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Pitch quantization for distributed speech recognition
  • Pitch quantization for distributed speech recognition
  • Pitch quantization for distributed speech recognition

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0024] According to a preferred embodiment, the present invention advantageously overcomes the problems of the prior art by effectively reducing the number of bits used in pitch quantization, as will be discussed in detail below.

[0025] I. Overview

[0026] figure 1 is a block diagram illustrating a network for distributed speech recognition (DSR) according to a preferred embodiment of the present invention. figure 1 A web server or wireless service provider 102 is shown operating on a network 104 connecting the server / wireless service provider 102 with clients 106 and 108 . In one embodiment of the invention, figure 1 A network computer system is shown that includes a server 102, a network 104, and client computers 106-108. In a first embodiment, the network 104 is a circuit-switched network, such as the Public Service Telephone Network (PSTN). Optionally, network 104 is a packet switched network. A packet-switched network is a Wide Area Network (WAN), such as the glob...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

A system, method and computer readable medium for quantizing pitch information of audio is disclosed. The method includes capturing audio representing a numbered frame of a plurality of numbered frames. The method further includes calculating a class of the frame, wherein a class is any one of a voiced or unvoiced class. If the frame is a voiced class, a pitch is calculated for the frame (903). If the frame is an even numbered frame and a voiced class, a codeword of first length is calculated by absolutely quantizing the frame pitch (910). If the frame is an odd numbered frame and a voiced class and a reliable frame is available, a codeword of a second length is calculated by differentially quantizing the frame pitch (905). If there is no reliable frame available, a codeword of the second length is calculated by absolutely quantizing the frame pitch.

Description

[0001] Cross References to Related Applications [0002] This patent application is related to co-pending and commonly owned U.S. Patent Application No. 10 / 360,582, Attorney Firm No. CML00872M, entitled "Class Quantization For Distributed Speech Recognition," filed on the same date as this patent application, It is hereby incorporated by reference in its entirety. technical field [0003] The present invention generally relates to the field of distributed speech recognition systems, in particular to distributed speech recognition for narrow bandwidth communication and wireless communication. Background technique [0004] With the advent of pagers and cellular phones, the wireless service industry has grown into a multi-billion dollar industry. A substantial amount of revenue for wireless service providers (WSPs) comes from subscriptions. Likewise, the ability of a WSP to run a successful network depends on the quality of service provided to subscribers on a network with li...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G10L11/04G10L25/90
CPCG10L15/30G10L19/09G10L19/08G10L19/032G10L25/90G10L25/93G10L2025/935
Inventor 藤卡思·V·拉马巴德兰亚历山大·索兰
Owner IBM CORP