Systems and methods for automated audio transcription, translation, and transfer

a technology of automatic audio transcription and translation, applied in the field of multimedia processing, can solve the problems of limited flexibility and usefulness of present real-time applications, and many real-time audio and video applications do not permit users to edit or otherwise manipulate content, so as to reduce the cost and process delay of motion picture translation

Inactive Publication Date: 2006-08-24
SAINDON RICHARD J +1
View PDF16 Cites 86 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Benefits of technology

[0008] The present invention also provides audio to text conversion with high accuracy in short periods of time. For example, the present invention provides systems and methods for accurate transcription of live events to 95-98%, and accurate transcription of any event to 100% within a few hours of event completion.
[0023] The present invention further provides systems and methods for providing translations for motion pictures, television shows, or any other serially encoded medium. For example, the present invention provides methods for the translation of audio dialogue into another language that will be represented in a form similar to subtitles. The method allows synchronization of the subtitles with the original audio. The method also provides a hardcopy or electronic translation of the dialogue in a scripted form. The systems and methods of the present invention may be used to transmit and receive synchronized audio, video, timecode, and text over a communication network. In some embodiments, the information is encrypted and decrypted to provide anti-piracy or theft of the material. Using the methods of the present invention, a dramatic reduction (e.g., 50% or more) in the time between a domestic motion picture release and foreign releases is achieved.
[0024] In some such embodiments, the present invention provides methods for providing a motion picture translation comprising, providing: motion picture audio information, a translation system that generates a text translation of the audio; and a processor that encodes text and audio information; processing the motion picture audio information with the translation system to generate a text translation of the audio; processing the text translation with the processor to generate encoded text information; processing the motion picture audio information with the processor to generate encoded audio information; and synchronizing the encoded text information and the encoded audio information. Such methods find use, for example, in reducing the cost and process delay of motion picture translations by more than 50% (e.g., 50%, 51%, . . . , 90%, . . . ).

Problems solved by technology

Present real-time applications, however, are limited in their flexibility and usefulness.
For example, many real-time audio and video application do not permit users to edit or otherwise manipulate the content.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Systems and methods for automated audio transcription, translation, and transfer
  • Systems and methods for automated audio transcription, translation, and transfer
  • Systems and methods for automated audio transcription, translation, and transfer

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0084] The present invention comprises systems and methods for providing text transcripts of multimedia events. For example, text transcripts of live or pre-recorded audio events are generated by the systems and methods of the present invention. The audio may be a component of a more complex multimedia performance, such as televised or motion picture video. Text transcripts are made available to viewers either as pure text transcripts or in conjunction with audio or video (e.g., audio or video from which the text was derived). In some preferred embodiments of the present invention (e.g., for live events), text is encoded in an information stream and streamed to a viewer along with the audio or video event. In some such embodiments, the text is configured to be viewable separate from the media display on a viewer's computer. In yet other preferred embodiments, the text is provided to the viewer in a manner that allows the viewer to manipulate the text. Such manipulations include copy...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The present invention relates to systems and methods for audio processing. For example, the present invention provides systems and methods for receiving live speech, converting the speech to text, and transferring the text to a user. As desired, the speech or text can be translated into one or more different languages. Systems and methods for real-time conversion and transmission of speech and text are provided.

Description

[0001] The present application is a Continuation-in-Part application of co-pending application Ser. No. 09 / 843,186, filed Apr. 4, 2001, herein incorporated by reference in its entirety.FIELD OF THE INVENTION [0002] The present invention relates to systems and methods for multimedia processing. For example, the present invention provides systems and methods for receiving spoken audio, converting the spoken audio to text, and transferring the text to a user. As desired, the speech or text can be translated into one or more different languages. Systems and methods for real-time conversion and transmission of speech and text are provided, including systems and methods for large scale processing of multimedia events. BACKGROUND OF THE INVENTION [0003] The Internet has revolutionized the way that information is delivered and business is done. In June of 1999, Nielsen / NetRatings reported that there were a total of 63.4 million active Internet users in the United States, and 105.4 million t...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(United States)
IPC IPC(8): G10L15/26G06F17/28
CPCG06F17/28G10L15/26G06F17/289G06F40/40G06F40/58
Inventor SAINDON, RICHARD J.BRAND, STEPHEN
Owner SAINDON RICHARD J
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products