Patents
Literature
Hiro is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Hiro

421 results about "Spoken dialog" patented technology

Spoken dialog systems. A spoken dialog system is a computer system able to converse with a human with voice. It has two essential components that do not exist in a written text dialog system: a speech recognizer and a text-to-speech module (written text dialog systems usually use other input systems provided by an OS).

Process for automatic control of one or more devices by voice commands or by real-time voice dialog and apparatus for carrying out this process

A speech dialog system wherein a process for automatic control of devices by speech dialog is used applying methods of speech input, speech signal processing and speech recognition, syntatical-grammatical postediting as well as dialog, executive sequencing and interface control, and which is characterized in that syntax and command structures are set during real-time dialog operation; preprocessing, recognition and dialog control are designed for operation in a noise-encumbered environment; no user training is required for recognition of general commands; training of individual users is necessary for recognition of special commands; the input of commands is done in linked form, the number of words used to form a command for speech input being variable; a real-time processing and execution of the speech dialog is established; and the speech input and output is done in the hands-free mode.
Owner:NUANCE COMM INC

Advanced voice and data operations in a mobile data communication device

A system and method for integrating voice and data operations into a single mobile device capable of simultaneously performing data and voice actions. The mobile device working in a network capable of exchanging both cell phone calls and data items to the mobile device. By wearing an earphone or an ear-bud device the user is capable of dealing with voice conversations while working with data centric information related to the current caller. By providing a data-centric device with voice capabilities there is a new range of features that allow incoming data events to trigger outgoing voice events.
Owner:MALIKIE INNOVATIONS LTD

Knowledge-based flexible natural speech dialogue system

A knowledge-based natural speech dialogue system includes: (i) a knowledge support system, (ii) a flexible dialogue control system, and (iii) a context information system. Flexibilities of the conversation structure, inherent in mixed-initiative mode for dealing with complex user request, are managed because the knowledge structures involved are represented by additional, powerful knowledge representation tools, and because the context information is retained by more specific data structures, which covers larger temporal scopes by the logic of the conversation, rather than by a fixed locality of the grammar flow. This system provides a simple yet reliable method to compensate for these factors to enable more powerful conversation engines with mixed-initiative capabilities.
Owner:VOICE ENABLING SYST TECH

Audio recall during voice conversations

Conversation information is provided to a conversation participant. Audio data is received a client device associated with a conversation participant. The client device maintains a buffer of received audio data, for instance audio data received in a previous interval of time. A recall request is received by the client device for information associated with the buffered audio data. The buffered audio data is analyzed responsive to the recall request to identify the requested information. The identified information is then presented at the client device.
Owner:CISCO TECH INC

Method and system for presenting dynamic commercial content to clients interacting with a voice extensible markup language system

A system for selecting a voice dialog, which may be an advertisement or information message, from a pool of voice dialogs and for causing the selected voice dialog to be utilized by a voice application for presentation to a caller during an automated voice interactive session includes a voice-enabled interaction interface hosting the voice application; and, a sever monitoring the voice-enabled interaction interface for selecting the voice dialog and for serving at least identification and location of the dialog to be presented to the caller via the voice application.
Owner:APPTERA

WiFi phone system

A voice communication system over WiFi 802.11 network includes: WiFi phone devices, subscription servers and wireless access points. Through voice and data gateways the system can transfer both voice and data streams through WiFi; Phone devices with both wired and WiFi wireless interfaces can automatically switches between wired network and wireless network through WiFi access points. Subscription server maintains the current status of each device every time it changes location. Voice packets are prioritized over regular data packets and dispatched to different network interface by smart packet dispatcher. Quality of service for voice conversation over WiFi is achieved by various methods: piggy-tail method and ACK enhancements and other improvements to reduce delay and latency of real time voice packets.
Owner:XU YONGYONG

Statistical Voice Dialog System and Method

ActiveUS20150228275A1Reduce decreaseDisambiguates the belief statesSpeech recognitionState variableAlgorithm
A method for processing a voice command using a statistical dialog model determines a belief state as a probability distribution over states organized in a hierarchy with a parent-child relationship of nodes representing the states. The belief state includes the hierarchy of state variables defining probabilities of each state to correspond to the voice command and a probability of a state of a child node in the hierarchy is conditioned on a probability of a state of a corresponding parent node. A system action is selected based on the belief state.
Owner:MITSUBISHI ELECTRIC RES LAB INC

Advanced voice and data operations in a mobile data communication device

A system and method for integrating voice and data operations into a single mobile device capable of simultaneously performing data and voice actions. The mobile device working in a network capable of exchanging both cell phone calls and data items to the mobile device. By wearing an earphone or an ear-bud device the user is capable of dealing with voice conversations while working with data centric information related to the current caller. By providing a data-centric device with voice capabilities there is a new range of features that allow incoming data events to trigger outgoing voice events.
Owner:MALIKIE INNOVATIONS LTD

Apparatus and method for post-processing dialogue error in speech dialogue system using multilevel verification

Provided are an apparatus and method for post-processing a dialogue error in a speech dialogue system using multilevel verification, in which both of a user's current utterance and a whole dialogue flow are taken into account through the multilevel verification including speech recognition results analysis, linguistic analysis, discourse analysis and dialogue analysis. As a result, various errors that may occur in the speech dialogue system are detected, and error post-processing appropriate to a detected error type is performed, so that speech recognition errors may be reduced.
Owner:ELECTRONICS & TELECOMM RES INST

Voice control method and mobile terminal apparatus

A voice control method and a mobile terminal apparatus are provided. The mobile terminal apparatus includes a voice receiving module, a voice outputting module, a voice wake-up module and a language recognition module. When the voice wake-up module determined that a first voice signal matches to identification information, the voice receiving module is turned on. When the voice receiving module receives a second voice signal after the first voice signal, the language recognition module parses the second voice signal and obtains a voice recognition result. When the voice recognition result includes an executing request, the language recognition module executes a responding operation, and the voice receiving module is turned off from receiving a third voice signal. When the voice recognition result does not include the executing request, the language recognition module executes a speech conversation mode.
Owner:VIA TECH INC

Multiple handset cordless telephone including a ring signal/call routing module

A multiple handset cordless telephone includes a Caller ID / handset association table to identify a desired intended handset to receive an incoming call before the call is answered. When a match is made as between received call related information and a pre-stored entry in the Caller ID / handset association table, one or more intended handsets are provided with the subsequent ring signals and / or the subsequent voice conversation. If no match is found the cordless telephone operates otherwise in conventional fashion, i.e., all handsets ring in response to the incoming call, and any or all of the handsets may join the conversation at any time. The entries in the Caller ID / handset association table may be input manually using a keypad at the base unit or at any one of the handsets, or the received call related information with respect to a current, established incoming call may be stored as an entry in the Caller ID / handset association table and associated with the currently used handset or handsets upon activation of a unique code or dedicated button.
Owner:WSOU INVESTMENTS LLC +1

Method and apparatus for optimizing a spoken dialog between a person and a machine

A network comprises a number of speech-enabled sites maintaining a number of voice pages. A central server on the network executes a voice browser which provides users with access to the sites using voice-activated hyperlinks. The server also maintains and brokers information associated with the users based on spoken dialogs between the users and the sites. In response to a user accessing a given ASR site, information about that user is provided by the server for use by that ASR site. The information is used by the ASR site to optimize a spoken dialog between the user and the ASR site by reducing the amount of information the user is required to provide during the dialog. Information about the user can thereby be shared between separate speech enabled sites, in a manner which is transparent to the user, in order to expedite the user's interaction with those sites.
Owner:NUANCE COMM INC

Voice control method and mobile terminal device

Provided are a mobile terminal device and a voice control method. The mobile terminal device comprises a voice receiving unit, a voice output unit, a voice wake-up module and a language comprehension module. When the voice wake-up module judges that a first voice signal accords with identifying information, the mobile terminal device starts the voice receiving unit. When the voice receiving unit receives a second voice signal after receiving the first voice signal, the language comprehension module analyzes the second voice signal to obtain a voice recognition result. When the voice recognition result includes an executable request message, the language comprehension module executes an answering operation, the mobile terminal device is closed, and the voice receiving unit receives a third voice signal. When the voice recognition result does not include the executable request message, the language comprehension module executes a voice conversation mode.
Owner:VIA TECH INC

Tool for graphically defining dialog flows and for establishing operational links between speech applications and hypermedia content in an interactive voice response environment

InactiveUS7143042B1Quickly and easily createSpeech recognitionDrag and dropApplication software
A computer-implemented graphical design tool allows a developer to graphically author a dialog flow for use in a voice response system and to graphically create an operational link between a hypermedia page and a speech object. The hypermedia page may be a Web site, and the speech object may define a spoken dialog interaction between a person and a machine. Using a drag-and-drop interface, the developer can graphically define a dialog as a sequence of speech objects. The developer can also create a link between a property of any speech object and any field of a Web page, to voice-enable the Web page, or to enable a speech application to access Web site data.
Owner:NUANCE COMM INC

Method and system for retrieving information

System and method for receiving information in a communication terminal during a voice conversation session with a remote communication terminal. After initiating the voice conversation between a first and a second communication terminal, audio signals of the voice conversation are passed to a speech recognition engine to identify a keyword from the voice conversation. The identified keywords are then used for locating and retrieving information related to the keyword, and the retrieved information is presented on the display of at least one of the first and second communication terminals.
Owner:SONY ERICSSON MOBILE COMM AB

Method and system for providing voice communication over data networks

A method and system are disclosed in which an Internet subscriber can establish real-time voice conversations over the Internet. By performing a call negotiation scheme to identify and locate the other calling party, an Internet telephone can establish a voice communication channel over the Internet to a telephony or computer device.
Owner:CISCO TECH INC

Terminal device for voice-directed work and information exchange

A device is comprised of processing circuitry operable for providing a speech interface to facilitate a speech dialog with a user or generates commands for a user. An RFID reader, operably coupled with the processing circuitry, is operable for reading data from an RFID tag. In one aspect, the reading occurs in the context of a speech dialog. In another aspect, the data is used to generate speech commands. In another aspect, information is stored to an RFID tag during the speech dialog.
Owner:VOCOLLECT

Smart vehicle manuals and maintenance tracking system

An emotive advisory system for use by one or more occupants of an automotive vehicle includes a computer configured to receive input indicative of an operating state of the vehicle. The computer determines at least one of a need to provide owner's manual (or maintenance) information to an occupant based on the operating state of the vehicle and a request to provide owner's manual (or maintenance) information to the occupant. The computer generates (i) data representing an avatar having an appearance and (ii) data representing a spoken statement for the avatar. The spoken statement provides owner's manual (or maintenance) information to the occupant in spoken dialog based on at least one of the need and the request. The data representing the avatar is output for visual display, and the data representing the statement for the avatar is output for audio play.
Owner:FORD GLOBAL TECH LLC

Conversion of calls from an ad hoc communication network

An improved system and procedure for allowing participants within a group conversation on a wireless communication network to at least temporarily break away from a group conversation and hold a side bar conversation. The side bar conversation may comprise several different types, including a public conversation (much like the original group conversation) or may constitute a private conversation (much like a standard phone call) or private text messaging or data exchange. A group conversation user wishing to hold the side bar conversation uses his user interface to alert other group conversation user(s) with whom he would like to hold the side bar conversation. Either the initiating user or the recipient user(s) can recommend the type of side bar conversation to be had using their user interfaces. If the side bar conversation is a voice conversation, the original group conversation can still be audible in the back ground, but can be reduced in volume relative to the side bar conversation or squelched in and out when the side bar conversation is active. If desired, side bar participants can rejoin the original group conversation by selecting such an option from their user interfaces.
Owner:MOTOROLA SOLUTIONS INC

Speech dialogue system for controlling an electronic device

A speech dialogue system (SDS) is provided for controlling an electronic device through speech control. A method for controlling an electronic device with a speech dialogue system is provided, using a speech dialogue system with a speech recognition unit. The speech recognition unit includes a control command determining unit, which is activated by a keyword, for determining a control command to control the electronic device. The method comprises the steps of receiving an acoustic input, spoken by a user of the SDS, containing keyword information and control command information; determining a keyword corresponding to the keyword information; activating the command determining unit, and determining a control command corresponding to the control command information. A speech dialogue system for use in a vehicle is also provided, comprising a speech recognition unit including a control command determining unit which is activated by a keyword spoken by a user of the SDS. The speech recognition unit is configured to activate the command determining unit for determining a control command upon receipt of an acoustic input comprising both keyword information and control command information. A computer program product is also provided, including one or more computer readable media having computer executable instructions for performing the method for controlling an electronic device via speech control.
Owner:NUANCE COMM INC

Emotive advisory system including time agent

An emotive advisory system for use by one or more occupants of an automotive vehicle includes a computer. The computer is configured to receive input indicative of an operating state of the vehicle. The computer determines at least one of a need to provide time information to an occupant based on the operating state of the vehicle and an occupant request to provide time information to the occupant. The computer generates data representing an avatar having an appearance, and data representing a spoken statement for the avatar. The spoken statement provides time information to the occupant in spoken dialog based on at least one of the need and the request. The computer outputs data representing the avatar for visual display, and data representing the statement for the avatar for audio play.
Owner:FORD GLOBAL TECH LLC

Advanced voice and data operations in a mobile data communication device

A system and method for integrating voice and data operations into a single mobile device capable of simultaneously performing data and voice actions. The mobile device working in a network capable of exchanging both cell phone calls and data items to the mobile device. By wearing an earphone or an ear-bud device the user is capable of dealing with voice conversations while working with data centric information related to the current caller. By providing a data-centric device with voice capabilities there is a new range of features that allow incoming data events to trigger outgoing voice events.
Owner:MALIKIE INNOVATIONS LTD

Speech recognition based on interactive information retrieval scheme using dialogue control to reduce user stress

In the disclosed speech recognition based interactive information retrieval scheme, the recognition target words in the speech recognition database are divided into prioritized recognition target words that constitute a number of data that can be processed by the speech recognition processing in the prescribed processing time and that have relatively higher importance levels based on statistical information, and the other non-prioritized recognition target words. Then, the speech recognition processing for the speech input with respect to the prioritized recognition target words is carried out at higher priority, and a confirmation process is carried out when the recognition result satisfies a prescribed condition for judging that the retrieval key can be determined only by a confirmation process with the user. On the other hand, a related information query to request the user to enter another speech input for a related information of the retrieval key is carried out when the recognition result does not satisfy the prescribed condition, and the original recognition result is adjusted according to the recognition result for another speech input. In this way, the retrieval key determination is realized through natural speech dialogues with the user.
Owner:NIPPON TELEGRAPH & TELEPHONE CORP

Speech dialog control based on signal pre-processing

A speech dialog system interfaces a user to a computer. The system includes a signal pre-processor that processes a speech input to generate an enhanced signal and an analysis signal. A speech recognition unit may generate a recognition result based on the enhanced signal. A control unit may manage an output unit or an external device based on the information within the analysis signal.
Owner:CERENCE OPERATING CO

System and method of automating a spoken dialogue service

ActiveUS8566102B1Improve the level ofQuickly and easily synchronizesSpeech recognitionWeb siteSpoken language
A system and method of generating and operating a spoken dialog service for a web-site are disclosed. The system parses web-site data and organizes the web-site data in a task knowledge data bank. The system receives text associated with a user query; processes the received text in a spoken language understanding (SLU) module, the SLU module using the web-site data from the task knowledge data bank; generates a ranked list of relevant responses to the user query; generates a hierarchical tree using the web-site data and the ranked list of relevant responses to the user query, generates a response to the user query using the hierarchical tree; and presents the response to the user.
Owner:NUANCE COMM INC
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products