Patents
Literature
Hiro is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Hiro

730 results about "Audio segment" patented technology

Gallery of videos set to an audio time line

A machine includes a processor and a memory connected to the processor. The memory stores instructions executed by the processor to receive a message with audio content and video content. Audio fingerprints within the audio content are evaluated. The audio fingerprints are matched to known audio fingerprints to establish matched audio fingerprints. A determination is made whether the matched audio fingerprints correspond to a designated gallery constructed to receive a sequence of videos set to an audio time line. The matched audio fingerprints and corresponding video content are added to the audio time line. The operations are repeated until the audio time line is populated with corresponding video content to form a completed gallery with video segments set to audio segments that constitute a complete audio time line. The completed gallery is supplied in response to a request.
Owner:SNAP INC

Gallery of videos set to an audio time line

A machine includes a processor and a memory connected to the processor. The memory stores instructions executed by the processor to receive a message with audio content and video content. Audio fingerprints within the audio content are evaluated. The audio fingerprints are matched to known audio fingerprints to establish matched audio fingerprints. A determination is made whether the matched audio fingerprints correspond to a designated gallery constructed to receive a sequence of videos set to an audio time line. The matched audio fingerprints and corresponding video content are added to the audio time line. The operations are repealed until the audio time line is populated with corresponding video content to form a completed gallery with video segments set to audio segments that constitute a complete audio time line. The completed gallery is supplied in response to a request.
Owner:SNAP INC

Encapsulated, streaming media automation and distribution system

Disclosed are systems and methods for creating and distributing programming content carried by a digital streaming media to be a plurality of remote nodes located over a large geographic area to create customized broadcast quality programming at the remote nodes. At the remote nodes, a multi-window screen display simultaneously shows different programming including national programming and local programming content. The remote nodes utilize a remote channel origination device to assemble the customized programming at the remote location that can be controlled from a central location. An encapsulated IP and IP encryption system is used to transport the digital streaming media to the appropriate remote nodes. Also disclosed is a graphical user interface (“GUI”) providing a software control interface for creating and editing shows or programs that can be aired or played on a remote display device having a multi-window display. The intuitive GUI Software provides the user the ability to easily manage and assemble a series of images, animations and transitions as a single broadcast quality program to be displayed on a remote display device. Another application software system is capable of automating the production of audio narration reports. The disclosed audio concatenation engine automates the creation of audio narration using prerecorded audio segments to minimize the requirement for live, on-air personnel to record audio narration segments.
Owner:CALLAHAN CELLULAR L L C

Method for locating an audio segment within an audio file

A method for locating an audio segment within an audio file comprising (i) providing a first transcribed text file associated with the audio file; (ii) providing a second transcribed text file associated with the audio file; (iii) receiving a user input defining a text segment corresponding to the audio segment to be located; (iv) searching for the text segment in the first transcribed text file; and (v) displaying only those occurrences of the text segment within the first transcribed text file that are also a match to occurrences of the text segment within the second transcribed text file.
Owner:KAHN JONATHAN +2

Multi-mode audio recognition and auxiliary data encoding and decoding

ActiveUS20140108020A1Improving communication over networkOptimize networkSpeech analysisData capacityFeature extraction
Audio signal processing enhances audio watermark embedding and detecting processes. Audio signal processes include audio classification and adapting watermark embedding and detecting based on classification. Advances in audio watermark design include adaptive watermark signal structure data protocols, perceptual models, and insertion methods. Perceptual and robustness evaluation is integrated into audio watermark embedding to optimize audio quality relative the original signal, and to optimize robustness or data capacity. These methods are applied to audio segments in audio embedder and detector configurations to support real time operation. Feature extraction and matching are also used to adapt audio watermark embedding and detecting.
Owner:DIGIMARC CORP

Multi-mode audio recognition and auxiliary data encoding and decoding

ActiveUS20140142958A1Improving communication over networkOptimize networkSpeech analysisData capacityFeature extraction
Audio signal processing enhances audio watermark embedding and detecting processes. Audio signal processes include audio classification and adapting watermark embedding and detecting based on classification. Advances in audio watermark design include adaptive watermark signal structure data protocols, perceptual models, and insertion methods. Perceptual and robustness evaluation is integrated into audio watermark embedding to optimize audio quality relative the original signal, and to optimize robustness or data capacity. These methods are applied to audio segments in audio embedder and detector configurations to support real time operation. Feature extraction and matching are also used to adapt audio watermark embedding and detecting.
Owner:DIGIMARC CORP

Downloadable and controllable music-on-hold

A method is disclosed that enables a telecommunications terminal user who is on hold during a call to determine which audio segments, such as musical compositions, are to be played, without some of the costs, disadvantages, and limitations of techniques in the prior art. The illustrative embodiment of the present invention provides a controllable music-on-hold capability that allows the user to enter commands via the user's telecommunications terminal keypad. In accordance with the illustrative embodiment of the present invention, a structure for storing computer files, referred to as an “audio segment box,” is established in a data-processing system. The audio segment box is similar to a “voice mail box” used in voice mail systems, except that the audio segment box is designated for audio segments that include musical compositions.
Owner:AVAYA INC

Picture line audio augmentation

The subject invention provides a system and / or a method that facilitates creating an authored video with audio applied to at least one image / video segment within the authored video. An audio enhancement component can apply audio to at least one image / video segment, wherein an audio segment begins with a display of the image / video segment (e.g., an instance of displaying the image or video segment within the authored video). A segment-line can be utilized to provide audio to the image / video segment(s) within the authored video, wherein the segment-line can be a sequence of image / video segments chronologically ordered based upon a start and an end of the image / video clip.
Owner:MICROSOFT TECH LICENSING LLC

System for gathering and recording real-time market survey and other data from radio listeners and television viewers utilizing telephones including wireless cell phones

InactiveUS20070107008A1Free and reduced cost advertising timeMarket predictionsAnalogue secracy/subscription systemsDemographic dataTelevision station
The invention records real-time radio and television listener data utilizing automated, interactive questions and radio and television broadcast audio segments recorded by telephone, including wireless cell phones. Telephone users are asked to hold their phone in the direction of any broadcast audio they are hearing or listening to. Streaming program audio directly from radio and television broadcasts is matched to the program audio recorded from telephone users using computer audio matching technology. When an audio match is made, recorded data will automatically populate an alpha / numeric database creating a record including fields for identifying the radio or TV station, time of recording, the phone user's 10 digit telephone number and demographic information on the listener. Demographics may be obtained prior to the call or by automated interactive questioning, during a call, with the phone user responding to questions verbally or by pushing appropriate keys on the telephone keypad.
Owner:DYBUS DONNELLY ANDREW MR

System and method for continuous media segment identification

This invention provides a means to identify unknown media programming using the audio component of said programming. The invention extracts audio information from the media received by consumer electronic devices such as smart TVs and TV set-top boxes then conveys said information to a remote server means which will in turn identify said audio information of unknown identity by way of testing against a database of known audio segment information. The system identifies unknown media programming in real-time such that time-sensitive services may be offered such as interactive television applications providing contextually related information or television advertisement substitution. Other uses include tracking media consumption among many other services.
Owner:INSCAPE DATA INC

Signal processing method and video signal processor for detecting and analyzing a pattern reflecting the semantics of the content of a signal

The video signal processor 10 includes a scene detector 16 which uses features extracted for visual segments and / or audio segments resulted from segmentation of an input stream of video data, and a criterion for measurement of similarity between visual and / or audio segment pairs, calculated for each of the features using the similarity measurement criterion, to detect two visual segments and / or audio segments whose time gap is within a predetermined temporal threshold and whose dissimilarity is less than a predetermined dissimilarity threshold and group the segments into a scene consisting of visual segments and / or audio segments reflecting the semantics of the video data content and temporally contiguous to each other.
Owner:SATURN LICENSING LLC

System and method for seamless multimedia assembly

Systems and methods are provided for seamless assembly of video / audio segments. To achieve such seamless assembly during streaming / online progressive download of media, a second segment is downloaded to a client during the presentation of a first segment. The first segment is then attached to the beginning of the second segment, where no jitter or gap results with the transition point either in the video or audio portion of the segments. Hence, the merged segments are presented as a seamless assembly of video / audio segments, where the user is “unaware” that the merged segments are the result of two separate or different segments. To effectuate such gapless assembly of segments, a gapless media file is created for encoding the video and audio segments using a gapless audio encoding scheme, such as Ogg Vorbis, where synchronized, gapless audio tags are interleaved in the video segments.
Owner:JBF INTERLUDE 2009

Controlling Spatial Audio Coding Parameters as a Function of Auditory Events

An audio encoder or encoding method receives a plurality of input channels and generates one or more audio output channels and one or more parameters describing desired spatial relationships among a plurality of audio channels that may be derived from the one or more audio output channels, by detecting changes in signal characteristics with respect to lime in one or more of the plurality of audio input channels, identifying as auditory event boundaries changes in signal characteristics with respect to lime in the one or more of the plurality of audio input channels, an audio segment between consecutive boundaries constituting an auditory event in the channel or channels, and generating all or some of the one or more parameters al least partly in response to auditory events and / or the degree of change in signal characteristics associated with the auditory event boundaries. An auditory-event-responsive audio upmixer or upmixing method is also disclosed.
Owner:DOLBY LAB LICENSING CORP

Method for inputting characters in electronic device

According to an aspect of the invention, an enhanced audible feedback solution has been invented for electronic devices using an input device facilitating navigation though a plurality of available user interface input options and confirmation of a selected input option. The electronic device is arranged to define, as a response to detecting a selection of a character on the basis of a detection of a first input to an input device of the electronic device, an audio segment specific to the character. The electronic device is arranged to output the defined audio segment via the audio output means prior to a confirmation by a second input to the input device, the second input being associated with a function adding the character as part of a character sequence entered by the user.
Owner:NOKIA TECHNOLOGLES OY

Method and system for gathering and recording real-time market survey and other data from radio listeners and television viewers utilizing telephones including wireless cell phones

InactiveUS7797186B2Free and reduced cost advertising timeMarket predictionsAnalogue secracy/subscription systemsRecording durationDigital data
The invention records real-time radio and television listener data utilizing automated, interactive questions and radio and television broadcast audio segments recorded by telephone, including wireless cell phones. Telephone users are asked to hold their phone in the direction of any broadcast audio they are hearing or listening to. Streaming program audio directly from radio and television broadcasts is matched to the program audio recorded from telephone users using computer audio matching technology. When an audio match is made, recorded data will automatically populate an alpha / numeric database creating a record including fields for identifying the radio or TV station, time of recording, the phone user's 10 digit telephone number and demographic information on the listener. Demographics may be obtained prior to the call or by automated interactive questioning, during a call, with the phone user responding to questions verbally or by pushing appropriate keys on the telephone keypad.
Owner:DYBUS DONNELLY ANDREW MR

System and method for detecting and storing important information

Provided is an improved method for recording audio notes for easier later retrieval. The system monitors audio input and recommends recording of an extended audio segment based on detection of audio triggers. If the user accepts the recommendation, the use is provided with the opportunity to record a segment name. Segment names are recorded with links to the extended audio segment. Later review of segment names eases retrieval of extended audio segment with desired content.
Owner:IBM CORP

System and Method for Providing Audio for a Requested Note Using a Render Cache

A method for providing audio data corresponding to a requested musical note is disclosed, the method comprising: (a) providing a render cache having a plurality of cache entries, each of the cache entries corresponding to a different note; (b) receiving a request for a first note from a client; (c) identifying a first cache entry corresponding to the first note; (d) determining that a first audio segment corresponding to the first cache entry is not available; (e) identifying a second audio segment corresponding to a near-hit cache entry in the render cache; and (f) processing the second audio segment into a third audio segment that is substantially similar to the first audio segment.
Owner:MUSIC MASTERMIND

Graphical user interface for determining speech recognition accuracy

A solution for determining the accuracy of a speech recognition system. A first graphical user interface (GUI) is provided for selecting a transaction log. The transaction log has at least one entry that specifies a speech recognition text result. A second GUI is also provided for selecting at least one audio segment corresponding to the entry. The second GUI includes an activatable icon for initiating transcription of the audio segment through a reference speech recognition engine to generate a second text result.
Owner:NUANCE COMM INC

Video summarization apparatus and method

A video summarization apparatus stores, in memory, video data including video and audio, and metadata items corresponding to video segments included in the video data respectively, each of metadata items including keyword and characteristic information of content of corresponding video segment, selects metadata items including specified keyword from metadata items, to obtain selected metadata items, extracts, from video data, video segment corresponding to selected metadata items, to obtain selected video segments, generates summarized video data by connecting extracted video segments, detects audio breakpoints included in video data, to obtain audio segments segmented by audio breakpoints, extracts from video data, audio segments corresponding to extracted video segments as audio narrations, and modifies ending time of video segment in summarized video data so that ending time of video segment in summarized video data coincides with or is later than ending time of corresponding audio segment of extracted audio segments.
Owner:KK TOSHIBA

Media Delivery System and a Portable Communications Module for Audio and Remote Control of Interactive Toys or Devices

A portable communications module (600) of FIG. 6 has an input (621) coupled to receive an incident signal. The input (621) split out a first audio channel containing a context audio track from the incident signal and directs the first audio channel along a first audio output path for selective audio output from a speaker (634, 620) either internally within or external to the module. The input (621)also directs a second audio channel in the incident signal to an RF audio transmitter chain for broadcast, the second audio channel comprising a composite audio signal from a plurality of audio tracks, each audio track embedded with a unique activation code that is present for substantially an entire duration of audio activity in each audio segment of each track. The input (621) is further arranged to apply a tone encoded signal in the incident signal to at least a tone decoder (640) in a data transmitter chain distinct from the audio transmitter chain (642). A microcontroller (650), responsive to recovered data, is arranged to translate the recovered data into a control signal related to functional control of remote equipment (102). And a data transmit chain (660) operates to modulate the control signal onto a carrier for broadcast to the remote equipment to effect its operational control. The tone encoded data is effectively filtered within the portable communications module to an extent that it is not amplified within the first audio output path and is not processed by the RF audio transmitter chain.
Owner:REGLER

System and method for providing matched multimedia video content

A system for providing content to client computing devices. The system is configured to receive an audio feed that includes audio segments. Each audio segment includes either regular audio content or preemptory audio content. The system may determine whether each audio segment includes regular or preemptory audio content. For each audio segment determined to include preemptory audio content, the system may direct the client computing devices to preempt, with the preemptory audio content, any current content being presented by the client computing devices. For each audio segment determined to include regular audio content, the system may identify the regular audio content, match multimedia video content with the identified regular audio content, and direct the matched multimedia video content to the client computing devices for presentation thereby to users.
Owner:VADIO

System and method for diarization of speech, automated generation of transcripts, and automatic information extraction

A client device retrieves a diarization model. The diarization model has been trained to determine whether there is a change of one speaker to another speaker within an audio sequence. The client device receives enrollment data from each speaker of a group of speakers who are participating in an audio conference. The client device obtains an audio segment from a recording of the audio conference. The client device identifies one or more speakers for the audio segment by applying the diarization model to a combination of the enrollment data and the audio segment.
Owner:ONU TECH INC

System and method for fingerprinting datasets

Systems and methods for the matching of datasets, such as input audio segments, with known datasets in a database are disclosed. In an illustrative embodiment, the use of the presently disclosed systems and methods is described in conjunction with recognizing known network message recordings encountered during an outbound telephone call. The methodologies include creation of a ternary fingerprint bitmap to make the comparison process more efficient. Also disclosed are automated methodologies for creating the database of known datasets from a larger collection of datasets.
Owner:GENESYS TELECOMMUNICATIONS LABORATORIES INC

Integrated voice mail and email system

A method for managing text messages, includes transcript of voice mail media mail (voice mail or text message) messages. The media mail messages can be stored in a client device or a media mail server. A media mail message is a text-based message or a text transcription of at least part of an audio segment comprising a voice message or a conversation. The method comprising the steps of receiving an audio signal input from a user, the audio signal input including a command indicating a task, and performing the task according to the command. The tasks include: copying, deleting, replying, forwarding a message or saving a message to a folder, creating a new folder in the client device or in the media mail server, renaming, moving or deleting a folder in the client device or in the media mail server, searching for a term in a media mail message, and searching for a media mail message containing a keyword.
Owner:NOKIA CORP

Automatic volume and dynamic range adjustment for mobile audio devices

A mobile audio device (for example, a cellular telephone, personal digital audio player, or MP3 player) performs Audio Dynamic Range Control (ADRC) and Automatic Volume Control (AVC) to increase the volume of sound emitted from a speaker of the mobile audio device so that faint passages of the audio will be more audible. This amplification of faint passages occurs without overly amplifying other louder passages, and without substantial distortion due to clipping. Multi-Microphone Active Noise Cancellation (MMANC) functionality is, for example, used to remove background noise from audio information picked up on microphones of the mobile audio device. The noise-canceled audio may then be communicated from the device. The MMANC functionality generates a noise reference signal as an intermediate signal. The intermediate signal is conditioned and then used as a reference by the AVC process. The gain applied during the AVC process is a function of the noise reference signal.
Owner:QUALCOMM INC

Video summarization using audio and visual cues

A method for producing an audio-visual slideshow for a video sequence having an audio soundtrack and a corresponding video track including a time sequence of image frames, comprising: segmenting the audio soundtrack into a plurality of audio segments; subdividing the audio segments into a sequence of audio frames; determining a corresponding audio classification for each audio frame; automatically selecting a subset of the audio segments responsive to the audio classification for the corresponding audio frames; for each of the selected audio segments automatically analyzing the corresponding image frames to select one or more key image frames; merging the selected audio segments to form an audio summary; forming an audio-visual slideshow by combining the selected key frames with the audio summary, wherein the selected key frames are displayed synchronously with their corresponding audio segment; and storing the audio-visual slideshow in a processor-accessible storage memory.
Owner:KODAK ALARIS INC

Method and apparatus for interactivity with broadcast media

A method and apparatus for interactivity with broadcast media is provided. The method includes capturing (404) a plurality of audio segments from a plurality of broadcasts. The plurality of broadcasts correspond to a plurality of broadcast channels. Further, the method includes receiving (406) an audio clip from an electronic device (200). The method also includes identifying (408) a broadcast channel corresponding to the audio clip based on the plurality of audio segments and the audio clip.
Owner:MOTOROLA MOBILITY LLC
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products