Patents
Literature
Hiro is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Hiro

36 results about "Received spoken" patented technology

Media center controller system and method

A system and methods for a media center controller. The system and methods include a computing device having a user dialog manager to process commands and input for controlling one or more controlled devices of the media center. The system and methods includes the capability to receive and respond to commands and input from a variety of sources, including spoken commands from a user, for remotely controlling one or more electronic devices and to perform, in response to the input received from the handheld device, speech recognition processing, voice over Internet Protocol communications, instant messaging, electronic mail messaging, or control of one or more controlled devices. The system and methods may also include a user interaction device capable of receiving spoken user input and transferring the spoken input to the computing device.
Owner:ONE VOICE TECHNOLOGIES

System and method for immigration tracking and intelligence

A client identity verification system and method which includes receiving a spoken voice sample from a client. Identifying the match of the client, verifying client information, changing parameters in a conditional database and communicating alerts based on the results. The method also includes matching the received spoken voice sample against a stored voiceprint and voiceprint template of random phrases. The method then also includes verifying the phone number of the client.
Owner:CARTER JOHN A +2

Head-mounted text display system and method for the hearing impaired

The head-mounted text display system for the hearing impaired is a speech-to-text system, in which spoken words are converted into a visual textual display and displayed to the user in passages containing a selected number of words. The system includes a head-mounted visual display, such as eyeglass-type dual liquid crystal displays or the like, and a controller. The controller includes an audio receiver, such as a microphone or the like, for receiving spoken language and converting the spoken language into electrical signals. The controller further includes a speech-to-text module for converting the electrical signals representative of the spoken language to a textual data signal representative of individual words. A transmitter associated with the controller transmits the textual data signal to a receiver associated with the head-mounted display. The textual data is then displayed to the user in passages containing a selected number of individual words.
Owner:GHULMAN MAHMOUD M

System and method for optimizing speech recognition in a vehicle

A system is provided for controlling personalized settings in a vehicle. The system includes a microphone for receiving spoken commands from a person in the vehicle, a location recognizer for identifying location of the speaker, and an identity recognizer for identifying the identity of the speaker. The system also includes a speech recognizer for recognizing the received spoken commands. The system further includes a controller for processing the identified location, identity and commands of the speaker. The controller controls one or more feature settings based on the identified location, identified identity and recognized spoken commands of the speaker. The system also optimizes on the beamforming microphone array used in the vehicle.
Owner:DELPHI TECH INC

Automated directory assistance system utilizing a priori advisor for predicting the most likely requested locality

The invention relates to an automated directory assistance system that utilizes a priori advisor for predicting the most likely requested locality. The automated directory assistance system includes a speech recognition dictionary containing a plurality of orthographies, each orthography corresponding to a locality name in which a subscriber whose telephone number is sought by the user of the automated directory assistance system may be residing. Upon reception of the spoken utterance, the system performs a first pass search scores on the basis of acoustics characteristics of the orthographies in the speech recognition dictionary, each orthography having a certain likelihood of being a match to the spoken utterances. The orthographies are then weighed on the basis of information indicative of the geographical location of the user. A final re-scoring operation may then be performed on the top N candidates in the weighed list. This system enables to improve recognition accuracy by combining the acoustical match search with a probabilistic bias derived from statistical information on calling patterns in the population.
Owner:RPX CLEARINGHOUSE

Data Encoding and Retrieval System and Method

System platform, software and hardware equipment and components, and methodologies are provided for generating, organizing, storing and retrieving medical records using voice recognition in combination with unique codes assigned to data elements, and include microprocessor and memory, such as non-transient computer readable medium, having stored thereon a database including vocabulary terms. Speech recognition interface receives spoken language. Display generates an output according to vocabulary terms uniquely associated with the spoken language. Data stored in the database can include records organized into specific modules having specified vocabulary terms synced with each module and unique computer code to key vocabulary terms in the database. Using an associated unique code can cause specific data field to open on display when recognizing specific spoken word or phrase by the speech recognition interface.
Owner:KOZIOL JEFFREY E

Voice spelling in an audio-only interface

A voice spelling method. A voice spelling method can include the steps of: in an audio-only interface, receiving a plurality of audio signals representative of spoken characters, the plurality of spoken characters specifying a string; and, through the audio-only interface, providing audible feedback in between each received spoken character. Additionally, the method can include the steps of: through the audio-only interface, audibly playing back each spoken character; accepting a voice selection of one of the played back characters, the selection denoting a disputed character; identifying a replacement character; and, replacing the disputed character with the identified replacement character in the specified string.
Owner:NUANCE COMM INC

System and Method for Correcting Speech

A method and device for correcting mispronunciations of a user, the method comprising the following steps: providing a database comprising a plurality of records each of which comprising at least a textual and a vocal representation of a specific word; training a speech recognition module to recognize spoken utterances of said user comprising the words represented by said records; generating word models for each recognized spoken word; associating each word model with a respective database record; after training said speech recognition module with sufficient words receiving spoken utterance from said user; extracting a sequence of words from said spoken utterance and generating a word model for each extracted word; comparing said word models to the word models associated with said database records; constructing an audible output comprising vocal representations obtained from records which their word models matched word models generated for said extracted word, wherein said word models comprises features extracted from data of the words spoken by said user.
Owner:RECHLIS GADI

Voice recognition updates via remote broadcast signal

A method is provided for remotely and dynamically updating voice recognition commands available for controlling a device in a vehicle comprising the steps of: (a) receiving a broadcast signal comprising voice recognition data; (b) filtering the received broadcast signal by separating the voice recognition data from a remainder of the broadcast signal; (c) updating the a database containing previously stored voice recognition data with the received voice recognition data; (d) receiving a spoken command from an input device; (e) determining whether the received spoken command matches the voice recognition data stored in the database; and (f) generating a recognized voice command based at least in part on matching the received spoken command with the voice recognition data stored in the database.
Owner:HONDA MOTOR CO LTD

Method and apparatus for specifying a user's preferred spoken language for network communication services

A new telecommunication service provides users with the ability to specify and receive spoken-language services in a preferred language that differs from a preferred written or text language. One or more user (e.g., a subscribing entity) preferences for spoken language as well as written language is specified and stored, e.g., in a subscriber database. An application server uses the configured spoken language preference(s) to select the language used for voice announcements, voice prompts, and other voice applications provided in one or more telecommunication services. For example, the server populates the existing SIP Accept-Language header and / or a new SIP P-Media-Language field with the spoken language preference(s) to allow the other servers, e.g., in a home and destination domain, to provide spoken / voice service as well as text-based service to the user according to the user's language preference(s).
Owner:TELEFON AB LM ERICSSON (PUBL)

Voice Recognition Dialing For Alphabetic Phone Numbers

Systems, methods and media for determining a phone number from a spoken alphabetic phone number are disclosed. Embodiments may include a method for determining a phone number that includes receiving spoken alphanumeric content from a user, the spoken alphanumeric content having one or more alphabetic characters, such as letters, numbers or words. The spoken alphanumeric content may include termination words or separation words in addition to alphabetic characters. The method may also include parsing the received spoken alphanumeric content to determine equivalent numbers for alphabetic characters in the alphanumeric content, such as by parsing spoken received spoken letters, numbers and / or words to determine their equivalent numbers. The method may also include determining the phone number based on the received spoken alphanumeric content and the determined equivalent numbers. Further embodiments may include dialing the determined phone number after determining the phone number.
Owner:IBM CORP

Spoken language recognition and correction system

The invention provides a spoken language recognition and correction system. According to the system, multichannel enhancement and noise reduction are performed on the received spoken language voice signals, then extraction of combined characteristic parameters is performed on analog signals after enhancement and noise reduction, the combined characteristic parameters are converted into optimized characteristic parameters through self-adaptive conversion and parameter conversion, and finally the optimized characteristic parameters are matched with the standard spoken language data in a standard library so that spoken language correction output is completed. A series of signal conversion is performed on the spoken language input information of students so that the students possibly having nonstandard accent are enabled to follow the system to perform spoken language learning and correction through the conversion and matching functions of the system.
Owner:NANYANG INST OF TECH

Chinese spoken language semantic comprehension method and system

PendingCN110516253AReduce demandTraining time will not skyrocketSpecial data processing applicationsSpoken languageUser input
The embodiment of the invention provides a Chinese spoken language semantic comprehension method. The method comprises the steps of obtaining a generalized label-free text sequence training set, and performing forward prediction and reverse prediction on the training set in sequence to train a word-level and a word-level bidirectional language model; receiving spoken language voice audios input bya user, and carrying out sequence word segmentation to obtain character sequences and word sequences; decoding the character sequence and the word sequence by using the character-level bidirectionallanguage model and the word-level bidirectional language model respectively to obtain character-level implicit strata vectors and word-level implicit strata vectors; performing vector alignment on theimplicit strata vectors of the character sequence and the word sequence to obtain an implicit strata vector of spoken language voice audio input by the semantic comprehension model; and inputting thehidden layer vector of the spoken language voice audio into a semantic comprehension model, and determining the semantics of the spoken language voice audio. The embodiment of the invention further provides a Chinese spoken language semantic comprehension system. The embodiment of the invention has good generalization ability, combines word and character sequences, and improves the performance ofChinese semantic comprehension.
Owner:AISPEECH CO LTD

Method and terminal for processing session ability information

InactiveCN1984132ARich application developmentComprehensive descriptionTransmissionReceived spoken
This invention discloses a processing method that terminal does to the received spoken ability information. It includes: A. the terminal receives the spoken information, in the stated spoken information, it carries the spoken ability information; B. the terminal will choose appropriate spoken ability information based on the stated spoken ability information and constitute response news. This invention also discloses one kind terminal which can deal with spoken ability information. Using this method and the terminal, it can choose appropriate spoken ability information from many spoken ability information and its attributed spoken information, and establish consultative conversation.
Owner:HUAWEI TECH CO LTD

Message exchange and method for distributing messages in telephone networks

A voice-controlled message exchange apparatus and method for receiving spoken messages, from a plurality of subscribers in a public switched telephone network via the telephone network. The voice controlled message exchange is used for storing the received messages, together with an identification of the subscriber, who has transmitted the message, transmitting these messages to one or more subscribers or groups of subscribers in the public switched telephone network, and accepting and storing replies from subscribers, to whom the messages were transmitted, whereby the subscribers not only give voice-controlled messages and replies, but can also draw up and administrate lists with identifications of subscribers and groups of subscribers by voice control.
Owner:SWISSCOM

Methods and apparatus for conducting internet protocol telephony communications

IP telephony communications are conducted by sending both audio data produced by a CODEC that represents received spoken audio input, and a textual representation of the spoken audio input. A receiving device utilizes the textual representation of the spoken audio input to help recreate the spoken audio input when a portion of the CODEC data is missing. The textual representation can be generated by a speech-to-text function. Alternatively, the textual representation can be a notation of extracted phonemes.
Owner:VONAGE BUSINESS

Responsive communication system

A spoken communication system includes a plurality of domestic devices and a server. Each of the devices is responsive to spoken communication to communicate that spoken communication to other devices, and to receive spoken communications received by other communication devices. The server is in digital communication with domestic devices to communicate that spoken communication among the registered devices.
Owner:VOCAL POWER HOUSE SYST LLC

Reduced training for dialog systems using a database

Techniques are described for training and executing a machine learning model using data derived from a database. A dialog system uses data from the database to generate related training data for natural language understanding applications. The generated training data is then used to train a machine learning model. This enables the dialog system to leverage a large amount of available data to speed up the training process as compared to conventional labeling techniques. The dialog system uses the trained machine learning model to identify a named entity from a received spoken utterance and generate and output a speech response based upon the identified named entity.
Owner:ORACLE INT CORP

English auxiliary teaching system based on social interaction and data processing method

The invention provides an English auxiliary teaching system based on social interaction and a data processing method. A test question bank of spoken English test question data related to classroom teaching information is constructed through a teacher terminal; the student terminal receives spoken language test answer data input by the student user; the server is used for receiving the spoken language test answer data transmitted by other student terminals, transmitting the received spoken language test answer data to the teacher terminal, receiving the test score and evaluation information returned by the teacher terminal, making evaluation information for the received spoken language test answer data transmitted by other student terminals, and pushing the made evaluation information to other student terminals and / or the teacher terminal. Therefore, test answers and scoring results can be sent by other student terminals between the teacher terminals and the student terminals, the purpose of joint learning can be achieved, and a good learning effect is achieved.
Owner:SHENZHEN TCL NEW-TECH CO LTD

Methods and apparatus for conducting internet protocol telephony communication

IP telephony communications are conducted by sending both data produced by a CODEC that represents received spoken audio input, and a textual representation of the spoken audio input. A receiving device utilizes the textual representation of the spoken audio input to help recreate the spoken audio input when a portion of the CODEC data is missing. The textual representation can be generated by a speech-to-text function. Alternatively, the textual representation can be a notation of extracted phonemes.
Owner:VONAGE BUSINESS

High-accuracy semantic comprehension identification method based on word slot sequence model

PendingCN112149429AAddress flexibilitySolve the problem that is not easy to configureNatural language data processingSpeech recognitionSpoken languageSequence model
The invention discloses a high-accuracy semantic comprehension identification method based on a word slot sequence model, and the method comprises the following steps: formulating a plurality of wordslot matching rules, extracting key information from a received spoken language word sequence by using the plurality of word slot matching rules, and finally carrying out semantic comprehension and dialogue intention identification by using the extracted key information. According to the method, the system has a semantic comprehension capability, a voice interaction function can be further provided, the problems that a rule template model is inflexible and not easy to configure are solved, and the problem that a neural network model needs a large amount of training data is avoided.
Owner:成都小美伴旅信息技术有限公司

Synchronous communication using voice and text

A computing device is described that accepts, a telephone call, from another device, initiated by a caller. Prior to establishing a telephone user interface that receives spoken input from the user and outputs spoken audio from the caller, the computing device executes a call screening service that outputs an audio user interface, to the other device and as part of the telephone call. The audio user interface interrogates the caller for additional information including a purpose of the telephone call, which allows the user to have more context of the telephone call before deciding whether to accept the call or hang up. The computing device outputs a graphical user interface associated with telephone call. The graphical user interface includes an indication of the additional information obtained via the audio user interface that interrogates the caller.
Owner:GOOGLE LLC

Spoken language video synthesis task distribution method and system

The invention relates to a spoken language video synthesis task distribution method and system, and the method comprises the steps: generating a target spoken language video synthesis task in response to a received spoken language video synthesis request sent by a client; and adding the target spoken language video synthesis task to a target task queue based on a preset shunting configuration rule, wherein the target task queue is a local task queue or a cloud task queue, a spoken language video synthesis task in the local task queue is processed by a local synthesis server, and a spoken language video synthesis task in the cloud task queue is processed by a cloud synthesis server. The computing function of the cloud synthesis server for synthesizing the spoken language video is an elastic computing function, so that computing resources are dynamically used according to the number of spoken language video synthesis tasks to be processed by the cloud synthesis server, and the problem of resource waste in related technologies is solved.
Owner:北京鼎事兴教育咨询有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products