Linguistic segmentation of speech

a linguistic segmentation and speech technology, applied in the field of speech processing, can solve the problems of long time-consuming and laborious, and the difficulty of automatically transcribing and indexing speech in an intelligent and useful manner

Inactive Publication Date: 2004-02-05
BBN TECHNOLOGIES CORP
View PDF42 Cites 26 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

As effective as the spoken word is for communicating, archiving spoken segments in a useful and easily retrievable manner has long been a difficult proposition.
Although the act of recording audio is not difficult, automatically transcribing and indexing speech in an intelligent and useful manner can be difficult.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Linguistic segmentation of speech
  • Linguistic segmentation of speech
  • Linguistic segmentation of speech

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

refers to the accompanying drawings. The same reference numbers may be used in different drawings to identify the same or similar elements. Also, the following detailed description does not limit the invention. Instead, the scope of the invention is defined by the appended claims and equivalents of the claim limitations.

[0023] Linguistic segmentation of spoken audio is performed by a linguistic segmentation tool based on a transcribed version of the speech and the original speech. The linguistic segmentation tool analyzes both lexical and acoustical features of the speech in generating the linguistic segments. The lexical features include syntactic classifications of the words in the transcribed text. The acoustical features include measured pauses, speaking rate, speaker energy, and speaker pitch. Speech models based on the acoustic and lexical features are combined to achieve a final probability of a particular linguistic feature occurring.

[0024] System Overview

[0025] Linguistic s...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

A linguistic segmentation tool (115) includes an acoustic feature extraction component (302) and a lexical feature extraction component (311). The acoustic feature extraction component (302) extracts prosodic features from speech (e.g., pauses, pitch, energy, and rate). The lexical feature extraction component (311) extracts lexical features from a transcribed version of the speech (e.g., words, syntactic classifications of the words, and word structure). A language model is constructed based on the lexical features and an acoustic model is constructed based on the acoustic features. A statistical framework combines the outputs of the language model to generate indications of potential linguistic features.

Description

[0001] This application claims priority under 35 U.S.C. .sctn. 119 based on U.S. Provisional Application Nos. 60 / 394,064 and 60 / 394,082 filed Jul. 3, 2002 and Provisional Application No. 60 / 419,214 filed Oct. 17, 2002, the disclosures of which are incorporated herein by reference.[0003] A. Field of the Invention[0004] The present invention relates generally to speech processing and, more particularly, to linguistic segmentation of transcribed speech.[0005] B. Description of Related Art[0006] Speech has not traditionally been valued as an archival information source. As effective as the spoken word is for communicating, archiving spoken segments in a useful and easily retrievable manner has long been a difficult proposition. Although the act of recording audio is not difficult, automatically transcribing and indexing speech in an intelligent and useful manner can be difficult.[0007] Speech is typically received into a speech recognition system as a continuous stream of words. In orde...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(United States)
IPC IPC(8): G06F7/00G06F17/00G06F17/21G06F17/28G10L11/00G10L15/00G10L15/26G10L21/00
CPCG10L25/78G10L15/26Y10S707/99943H04M2201/42H04M2201/60H04M2203/305
Inventor SRIVASTAVA, AMITKUBALA, FRANCIS
Owner BBN TECHNOLOGIES CORP
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products