Patents
Literature
Patsnap Copilot is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Patsnap Copilot

293 results about "Speech translation" patented technology

Speech translation is the process by which conversational spoken phrases are instantly translated and spoken aloud in a second language. This differs from phrase translation, which is where the system only translates a fixed and finite set of phrases that have been manually entered into the system. Speech translation technology enables speakers of different languages to communicate. It thus is of tremendous value for humankind in terms of science, cross-cultural exchange and global business. so when you type something in you can press a button and it will read it out and pronounce it for you so you know how to say it in a different language.

Speech-to-speech translation system with user-modifiable paraphrasing grammars

The present invention discloses a speech-to-speech translation device which allows one or more users to input a spoken utterance in one language, translates the utterance into one or more second languages, and outputs the translation in speech form. Additionally, the device allows for translation both directions, recognizing inputs in the one or more second languages and translating them back into the first language. The device recognizes and translates utterances in a limited domain as in a phrase book translation system, so the translation accuracy is essentially 100%. By limiting the domain the system increases the accuracy of the speech recognition component and thus the accuracy of the overall system. However unlike other phrase book systems, the device also allows wide variations and paraphrasing in the input, so that the user is much more likely to find the desired phrase from the stored list of phrases. The device paraphrases the input to a basic canonical form and performs the translation on that canonical form, ignoring the non-essential variations in the surface form of the input. The device can provide visual and / or auditory feedback to confirm the recognized input and makes the system usable for non-bilingual users with absolute confidence.
Owner:EHSANI FARZAD +2

Audio renderings for expressing non-audio nuances

Methods, systems, computer program products, and methods of doing business by adapting audio renderings of non-audio messages (for example, e-mail messages that are processed by a text-to-speech translator) to reflect various nuances of the non-audio information. Audio cues are provided for this purpose, which are sounds that are “mixed” in with the audio rendering as a separate (background) audio stream. Audio cues may reflect information such as the topical structure of a text file, or changes in paragraphs. Or, audio cues may be used to signal nuances such as changes in the color or font of the source text. Audio cues may also be advantageously used to reflect information about the translation process with which the audio rendering of a text file was created, such as using varying background tones to convey the degree of certainty in the accuracy of translating text to audio using a text-to-speech translation system, or of translating audio to text using a voice recognition system, or of translating between languages, and so forth. Stylesheets, such as those encoded in the Extensible Stylesheet Language (“XSL”), may optionally be used to customize the audio cues. For example, a user-specific stylesheet customization may be performed to override system-wide default audio cues for a particular user, enabling her to hear a different background sound for messages on a particular topic than other users will hear.
Owner:CERENCE OPERATING CO

Network provided information using text-to-speech and speech recognition and text or speech activated network control sequences for complimentary feature access

A real-time networked telephony or computer system has a feature complex and / or applications that offer a class of features to a subscriber, including call information, and permits the subscriber to manage incoming and existing calls through available features accessed using spoken utterances. A speech processing unit coupled to the system interprets a subscriber's spoken utterances without requiring the subscriber to train the system to recognize his or her voice. The interpretation of spoken utterances is enabled by a system state database that is maintained at the speech processing unit and comprises a database of the possible system slates, including possible call flows for a call, and a database associated with the system state database comprising context-specific grammar that a subscriber may recite at respective points in the call flow. The speech processing unit may also convert message signals from the network to speech which is read to the subscriber using a text to speech translator. The network can identify the voice or subscriber voice, or language used and will thereafter recognize all further commands using specific grammar for that language as well as perform text-to-speech conversion using the identified language. Use of the features can be applied to update of grammars, profiles and templates, etc. by transmitting results of transactions.
Owner:SOUND VIEW INNOVATIONS
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products