Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Language-based signaling of secondary audio

a secondary audio and language-based technology, applied in the field of reliable and accurate signaling of secondary audio channels, can solve the problems of unreliable automatic selection of proper audio channels, not always possible, and at best unreliable automatic selection of audio channels,

Inactive Publication Date: 2018-06-21
ARRIS ENTERPRISES LLC
View PDF0 Cites 3 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Benefits of technology

The present invention provides a method for detecting actual languages in digital audio streams, such as in MPEG-2 or HDMI streams. This is by monitoring the content of the audio stream and converting it to text in real-time. The method includes identifying a sequence of three letters (a trigram) in the converted text and retaining a list of the most frequent trigrams associated with various languages. By comparing the retained trigrams with a set of pre-stored entries, the method can detect the actual language being spoken in the audio stream. This technology can be applied in various areas such as language identification in voice-controlled systems or media content analysis.

Problems solved by technology

However, as the inventors hereof appreciated, this automatic switching of played audio back to a main audio elementary stream can occur upon events such as channel transitions (a change of channel), a change of program from one program to the next (e.g., at the top of the hour, etc.) Or the selected audio channel may change for other reasons, which the present inventors have appreciated may be confusing or distracting to the viewer of the HDMI sink device, e.g., a television 902.
Adding to the confusion is that a broadcaster may signal an audio elementary stream as generically containing “original audio” language, without specifying the exact language being used, thus making automatic selection of the proper audio channel unreliable at best, or at worst not always possible.
As explained above, the inventors have appreciated that use of the ISO language descriptor for automatic selection of audio channel is unreliable at best, and not always possible at worst.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Language-based signaling of secondary audio
  • Language-based signaling of secondary audio
  • Language-based signaling of secondary audio

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0035]The present invention determines and corrects signaling mismatch in secondary audio based on the actual content of the audio in the secondary audio channel. Disclosed embodiments relate to use in a set-top box (or Home Network End Device “HNED”), but the principles apply equally to use within a user HDMI sink device such as a television or digital video receiver (DVR).

[0036]The invention alleviates problems in conventional use of an ISO language descriptor, particularly observed by the inventors hereof, to automatically detect the textually named language contained in an associated audio stream. Use of an ISO language descriptor has conventionally been felt to provide reliable language identification.

[0037]The inventive system and method additionally, or instead, monitors the actual audio content of the currently selected audio channel, converts the audio in real time to text, and based on the first few or so detected words determines a most probable actual language of the aud...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

An actual language contained within a compressed digital A / V stream within an MPEG-2 (or HDMI) stream is detected upon program transition by monitoring the actual audio content of a currently selected audio stream within the MPEG-2 (or HDMI or MP4) stream. The monitored audio stream is converted in real time to text. A frequency of sequence of three letters (a trigram) in the converted text is generated, and a plurality of the most frequent trigrams within the converted text are retained. An actual language being spoken is detected in the digital audio stream by determining a closest match between the retained plurality of trigrams and a pre-stored entry in a list of most frequent trigrams (MFT) each pre-associated with a given respective language. The detected actual language may be compared to an ISO language descriptor received in the stream, and appended to an AC-3 audio coding descriptor.

Description

BACKGROUND OF THE INVENTION1. Field of the Invention[0001]The present invention relates to reliable and accurate signaling of a secondary audio channel received by a set-top box or HDMI sink device such as a television or DVR.2. Background of Related Art[0002]Digital TV is transmitted as a stream of MPEG-2 data known as a transport stream. Each transport stream has a data rate of up to 40 mb / s for a cable or satellite network, which is enough for seven or eight separate TV channels, or about 25 mb / s for a terrestrial network.[0003]Each transport stream includes a multiplexed set of sub-streams known as elementary streams. Each elementary stream can contain MPEG-2 encoded audio, MPEG-2 encoded video, or data encapsulated in an MPEG-2 stream. Each elementary stream has a unique 13-bit ‘packet identifier’ (PID) that identifies that stream within the transport system.[0004]Each MPEG-2 elementary stream is packetized into a packetized elementary stream (PES). Each packetized elementary s...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): H04N21/439G10L25/78G10L15/26G10L15/02G10L15/00G10L25/51G10L19/00H04N21/81H04N21/4147H04N21/4363
CPCH04N21/4394G10L25/78G10L15/26G10L15/02G10L15/005G10L2015/022G10L19/00H04N21/8106H04N21/4147H04N21/43635H04N21/4884G10L25/51G10L19/167H04N21/440236
Inventor BHAT, DINKAR N.LEARY, PATRICK J.
Owner ARRIS ENTERPRISES LLC
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products