Unlock instant, AI-driven research and patent intelligence for your innovation.

Thematic segmentation of speech

a speech segmentation and speech technology, applied in the field of speech processing, can solve the problems of long time-consuming and laborious, difficult automatic transcribing and indexing speech in intelligent and useful manner, and produce erroneous or non-optimal thematic segments

Inactive Publication Date: 2004-02-05
BBN TECHNOLOGIES CORP
View PDF18 Cites 37 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

As effective as the spoken word is for communicating, archiving spoken segments in a useful and easily retrievable manner has long been a difficult proposition.
Although the act of recording audio is not difficult, automatically transcribing and indexing speech in an intelligent and useful manner can be difficult.
A problem with this technique is that it can produce erroneous or non-optimal thematic segments.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Thematic segmentation of speech
  • Thematic segmentation of speech
  • Thematic segmentation of speech

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

refers to the accompanying drawings. The same reference numbers may be used in different drawings to identify the same or similar elements. Also, the following detailed description does not limit the invention. Instead, the scope of the invention is defined by the appended claims and equivalents of the claim limitations.

[0021] Thematic segmentation of spoken audio is performed by a thematic segmentation tool on a transcribed version of the audio supplemented with additional information that further describes the audio. In one implementation, the transcription is supplemented with visible linguistic structural information, such as sentence demarcations and non-visible linguistic structural information such, as phrasal boundaries, topic lists, and speaker boundaries. The result of the thematic segmentation includes hierarchical and potentially overlapping thematic segments.

System Overview

[0022] Thematic segmentation, as described herein, may be performed on one or more processing devi...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

A thematic segmentation tool generates indications of thematically coherent segments within a document. The thematic segmentation tool includes a transcription component, a speaker boundary detection component, a linguistic detection component, and a topic classification component. Each of these components generates input information for a thematic decision component, which generates the thematic segmentation information. Multiple thematic segments may concurrently apply to a portion of a document. Additionally, the thematic segments may be hierarchical thematic segments.

Description

[0001] This application claims priority under 35 U.S.C. .sctn. 119 based on U.S. Provisional Application Nos. 60 / 394,064 and 60 / 394,082 filed Jul. 3, 2002 and Provisional Application No. 60 / 419,214 filed Oct. 17, 2002, the disclosures of which are incorporated herein by reference.[0003] A. Field of the Invention[0004] The present invention relates generally to speech processing and, more particularly, to the segmentation of speech based on thematic classification.[0005] B. Description of Related Art[0006] Speech has not traditionally been valued as an archival information source. As effective as the spoken word is for communicating, archiving spoken segments in a useful and easily retrievable manner has long been a difficult proposition. Although the act of recording audio is not difficult, automatically transcribing and indexing speech in an intelligent and useful manner can be difficult.[0007] Speech is typically received into a speech recognition system as a continuous stream of ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(United States)
IPC IPC(8): G06F7/00G06F17/00G06F17/21G06F17/28G10L11/00G10L15/00G10L15/26G10L21/00
CPCG10L25/78G10L15/26Y10S707/99943H04M2201/42H04M2201/60H04M2203/305
Inventor SRIVASTAVA, AMITKUBALA, FRANCIS
Owner BBN TECHNOLOGIES CORP