Patents
Literature
Patsnap Copilot is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Patsnap Copilot

19017 results about "Speech recognition" patented technology

Speech recognition is a interdisciplinary subfield of computational linguistics that develops methodologies and technologies that enables the recognition and translation of spoken language into text by computers. It is also known as automatic speech recognition (ASR), computer speech recognition or speech to text (STT). It incorporates knowledge and research in the linguistics, computer science, and electrical engineering fields.

System and methods for recognizing sound and music signals in high noise and distortion

A method for recognizing an audio sample locates an audio file that most closely matches the audio sample from a database indexing a large set of original recordings. Each indexed audio file is represented in the database index by a set of landmark timepoints and associated fingerprints. Landmarks occur at reproducible locations within the file, while fingerprints represent features of the signal at or near the landmark timepoints. To perform recognition, landmarks and fingerprints are computed for the unknown sample and used to retrieve matching fingerprints from the database. For each file containing matching fingerprints, the landmarks are compared with landmarks of the sample at which the same fingerprints were computed. If a large number of corresponding landmarks are linearly related, i.e., if equivalent fingerprints of the sample and retrieved file have the same time evolution, then the file is identified with the sample. The method can be used for any type of sound or music, and is particularly effective for audio signals subject to linear and nonlinear distortion such as background noise, compression artifacts, or transmission dropouts. The sample can be identified in a time proportional to the logarithm of the number of entries in the database; given sufficient computational power, recognition can be performed in nearly real time as the sound is being sampled.
Owner:APPLE INC

Method and apparatus for automatically recognizing input audio and/or video streams

A method and system for the automatic identification of audio, video, multimedia, and / or data recordings based on immutable characteristics of these works. The invention does not require the insertion of identifying codes or signals into the recording. This allows the system to be used to identify existing recordings that have not been through a coding process at the time that they were generated. Instead, each work to be recognized is “played” into the system where it is subjected to an automatic signal analysis process that locates salient features and computes a statistical representation of these properties. These features are then stored as patterns for later recognition of live input signal streams. A different set of features is derived for each audio or video work to be identified and stored. During real-time monitoring of a signal stream, a similar automatic signal analysis process is carried out, and many features are computed for comparison with the patterns stored in a large feature database. For each particular pattern stored in the database, only the relevant characteristics are compared with the real-time feature set. Preferably, during analysis and generation of reference patterns, data are extracted from all time intervals of a recording. This allows a work to be recognized from a single sample taken from any part of the recording.
Owner:ICEBERG IND

Personalized audio system and method

A personalized audio system and method that overcomes many of the broadcast-type disadvantages associated with conventional radio stations. According to one embodiment, the personalized audio system includes the following: (1) a user interface that enables a user of the personalized audio system to specify a profile for a personalized audio channel, (2) a sound recording library comprising a plurality of sound recordings, (3) a playlist generator that (a) selects a plurality of sound recording identifiers from a set of sound recording identifiers, wherein each of the plurality of sound recording identifiers identifies a sound recording that matches the profile and that is stored in the library, and that (b) creates a playlist that lists the plurality of sound recording identifiers in a particular order, and (4) a sound recording reproducing device for reproducing the plurality of identified sound recordings according to the particular order in which the sound recording identifiers are listed in the playlist so that the user can listen to the sound recordings. Advantageously, the personalized audio system does not provide the user with a way to determine the plurality of sound recording identifiers prior to the reproducing means reproducing the plurality of sound recordings, and the personalized audio system does not provide the user with a way to directly control which sound recording identifiers in the set are selected by the playlist generator to be included in the plurality of sound recording identifiers.
Owner:MUSIC CHOICE
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products