Patents
Literature
Hiro is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Hiro

71 results about "Voice browser" patented technology

A voice browser is a software application that presents an interactive voice user interface to the user in a manner analogous to the functioning of a web browser interpreting Hypertext Markup Language (HTML). Dialog documents interpreted by voice browser are often encoded in standards-based markup languages, such as Voice Dialog Extensible Markup Language (VoiceXML), a standard by the World Wide Web Consortium.

Dialog recognition and control in a voice browser

A voice browser dialog enabler for multimodal dialog uses a multimodal markup document with fields have markup-based forms associated with each field and defining fragments. A voice browser driver resides on a communication device and provides the fragments and identifiers that identify the fragments. A voice browser implementation resides on a remote voice server and receives the fragments from the driver and downloads a plurality of speech grammars. Input speech is matched against those speech grammars associated with the corresponding identifiers received in a recognition request from the voice browser driver.
Owner:GOOGLE TECH HLDG LLC

Dynamic switching between local and remote speech rendering

A multimodal browser for rendering a multimodal document on an end system defining a host can include a visual browser component for rendering visual content, if any, of the multimodal document, and a voice browser component for rendering voice-based content, if any, of the multimodal document. The voice browser component can determine which of a plurality of speech processing configuration is used by the host in rendering the voice-based content. The determination can be based upon the resources of the host running the application. The determination also can be based upon a processing instruction contained in the application.
Owner:NUANCE COMM INC

Voice browser with weighting of browser-level grammar to enhance usability

A computer system in the form of a voice command platform includes a voice browser and voice-based applications. The voice browser has global-level grammar elements and the voice applications have application-level grammar and grammar elements. A programming feature is provided by which developers of the voice applications can programmably weigh or weight global-level grammar elements relative to the application-level grammar or grammar elements. As a consequence of the weighting, a speech recognition engine for the voice browser is more likely to accurately recognize voice input from a user. The weighting can be applied on the application as a whole, or at any given state in the application. Also, the weighting can be made to the global level grammar elements as a group, or on an individual basis.
Owner:SPRINT SPECTRUM LLC

Method and apparatus for coupling a visual browser to a voice browser

A method and apparatus for concurrently accessing network-based electronic content in a Voice Browser and a Visual Browser can include the steps of retrieving a network-based document formatted for display in the Visual Browser; identifying in the retrieved document a reference to the Voice Browser, the reference specifying electronic content formatted for audible presentation in the Voice Browser; and, transmitting the reference to the Voice Browser. The Voice Browser can retrieve the specified electronic content and audibly present the electronic content. Concurrently, the Visual Browser can visually present the network-based document formatted for visual presentation in the Visual Browser. Likewise, the method of the invention can include the steps of retrieving a network-based document formatted for audible presentation in the Voice Browser; identifying in the retrieved document a reference to the Visual Browser, the reference specifying electronic content formatted for visual presentation in the Visual Browser; and, transmitting the reference to the Visual Browser. The Visual Browser can retrieve the specified electronic content and visually present the specified electronic content. Concurrently, the Voice Browser can audibly present the network-based document formatted for audible presentation in the Voice Browser.
Owner:NUANCE COMM INC

Voice browser implemented as a distributable component

A system for implementing voice services can include at least one virtual machine, such as a Java 2 Enterprise Edition (J2EE) virtual machine. The virtual machine can include a bean container for handling software beans, such as Enterprise Java Beans. The bean container can include a voice browser bean. The voice browser bean can include a VoiceXML browser.
Owner:NUANCE COMM INC

Method and apparatus for optimizing a spoken dialog between a person and a machine

A network comprises a number of speech-enabled sites maintaining a number of voice pages. A central server on the network executes a voice browser which provides users with access to the sites using voice-activated hyperlinks. The server also maintains and brokers information associated with the users based on spoken dialogs between the users and the sites. In response to a user accessing a given ASR site, information about that user is provided by the server for use by that ASR site. The information is used by the ASR site to optimize a spoken dialog between the user and the ASR site by reducing the amount of information the user is required to provide during the dialog. Information about the user can thereby be shared between separate speech enabled sites, in a manner which is transparent to the user, in order to expedite the user's interaction with those sites.
Owner:NUANCE COMM INC

Multi-modal information retrieval system

Multi-modal system which enables documents to be interpreted as either or both of voice browser-based documents and / or visual browser-based documents for a thin client such as a portable telephone. Special techniques and additions are added into the document to enable the document to be converted between voice markup and visual markup languages. In addition, the document can be simultaneously viewed in both of the voice markup and the visual markup languages. Special techniques are used to allow keeping track of the browsing position within this document.
Owner:V ENABLE

Method, system, and apparatus for a voice markup language interpreter and voice browser

The present invention can include a method of allocating an interpreter module within an application program. The application program can create one or more interpreter module instances. The method also can include updating a property descriptor of the interpreter module instance and directing the interpreter module instance to allocate speech and audio resources. Content then can be loaded into the interpreter module instance and run.
Owner:NUANCE COMM INC

Wireless multimodal voice browser for wireline-based IPTV services

In accordance with one aspect of the present invention, a method selects a program from a library of programs. A user selection is determined based upon a voice command, and the program is presented at a display device in accordance with the voice command. In accordance with another aspect of the present invention, a system selects a program from a library of programs. The system includes a processor that determines a user selection based upon a voice command, and also includes a display device that presents the program in accordance with the voice command. In accordance with yet another embodiment of the present invention, a computer-readable medium contains a set of instructions that when executed by a processor cause the processor to determine a user selection based upon a voice command and to command a display device to present the program, in accordance with the voice command.
Owner:NUANCE COMM INC

Universal voice browser framework

A system and method for providing voice communications over a network, including a media server having a voice rendering platform for supporting voice-based user interactions. The voice rendering platform provides prompting, information collection and validation, and audio recording and transcoding. A separate application server is responsible for performing call control actions. During a user dialog, control is passed to the media server for rendering and receiving data, and passed back to the application server for call control actions. The media server includes a script execution environment for rendering the voice components of the user dialogs. The application server is an execution platform for applications written in a procedural programming language. Voice communications over a network are provided without combining call control and voice rendering functionality into a single, script execution platform.
Owner:DIALOGIC INC

Wireless multimodal voice browser for wireline-based IPTV services

In accordance with one aspect of the present invention, a method selects a program from a library of programs. A user selection is determined based upon a voice command, and the program is presented at a display device in accordance with the voice command. In accordance with another aspect of the present invention, a system selects a program from a library of programs. The system includes a processor that determines a user selection based upon a voice command, and also includes a display device that presents the program in accordance with the voice command. In accordance with yet another embodiment of the present invention, a computer-readable medium contains a set of instructions that when executed by a processor cause the processor to determine a user selection based upon a voice command and to command a display device to present the program, in accordance with the voice command.
Owner:MICROSOFT TECH LICENSING LLC

Robust voice browser system and voice activated device controller

The present invention relates to a system for acquiring information from sources on a network, such as the Internet. A voice browsing system maintains a database containing a list of information sources, such as web sites, connected to a network. Each of the information sources is assigned a rank number which is listed in the database along with the record for the information source. In response to a speech command received from a user, a network interface system accesses the information source with the highest rank number in order to retrieve information requested by the user.
Owner:PARUS HLDG

Synchronization among plural browsers

A technique for synchronizing a visual browser and a voice browser. A visual browser is used to navigate through visual content, such as WML pages. During the navigation, the visual browser creates a historical record of events that have occurred during the navigation. The voice browser uses this historical record to navigate the content in the same manner as occurred on the visual browser, thereby synchronizing to a state equivalent to that of the visual browser. The creation of the historical record may be performed by using a script to trap events, where the script contains code that records the trapped events. The synchronization technique may be used with a multi-modal application that permits the mode of input / output (I / O) to be changed between visual and voice browsers. When the mode is changed from visual to voice, the record of events captured by the visual browser is provided to the voice browser, thereby allowing the I / O mode to change seamlessly from visual to voice. Likewise, the voice browser captures events which may be provided to the visual browser when the I / O mode is changed from voice to visual.
Owner:GULA CONSULTING LLC

System and media for simplifying web contents, and method thereof

A technique for the simplification of Web pages in order to access necessary information rapidly, when displaying or outputting Web pages using a small screen device or a voice browser. A method for simplifying Web contents includes the steps of acquiring a target page subject to simplification, acquiring adjoining pages that adjoin the target page and performing a difference operation to delete objects that are common among the target page and the adjoining pages from the target page to generate a simplified page.
Owner:TWITTER INC

System and method for multi-modal authentication using speaker verification

A system authenticates an electronic transaction between a first user-operated device and a computer. The computer is configured to conduct electronic transactions. The system includes a voice browser and a session correlator. The voice browser is configured to receive and process user-spoken information from a second user-operated device, where the voice browser is programmed to compare a user-spoken transaction identifier to a computer generated transaction identifier, and to compare a user-spoken verification identifier to a voice print of the user. The session correlator is coupled to the voice browser and is configured to transmit an authentication message to the computer if the user-spoken transaction identifier matches the computer transaction identifier, and if the user-spoken verification identifier matches the voice print.
Owner:RAKUTEN GRP INC

Idiom handling in voice service systems

The grammar of the speech input to a voice service system is normally specified by the voice service system. However, this can produce problems in respect of idioms, such as dates, which are expressed different ways by different users. To facilitate the handling of idioms, a user is permitted to specify their own idiom grammar which is then used by the voice service system to interpret idioms in speech input from that user. Typically, the normal grammar of speech input is specified by grammar tags used to mark up a voice page script interpreted by a voice page browser; in this case, it will generally be the voice browser that is responsible for employing the user-specified idiom grammar to interpret the corresponding idiom in the speech input by the user. The user-specified grammar can be pre-specified directly to the voice browser by the user or fetched by the browser from a remote location on the fly.
Owner:REGIONAL RESOURCES LTD

Voice extensible markup language-based announcements for use with intelligent network services

An advanced intelligent network (AIN) telephony service provided in a public switched telephone network (PSTN) is controlled based on instructions from a subscriber, browsing a packet switched data network using a voice browser. A voice extensible markup language (VXML) platform is connected to a web server and a service control point (SCP). The web server stores announcements that can be easily created and modified by a subscriber. When the VXML platform receives an announcement ID from the SCP, the VXML platform determines a location where the announcement is stored based on the ID, and then plays the remotely stored announcement. Thus, the announcements can be easily changed without affecting the SCP and other AIN network elements.
Owner:SBC KNOWLEDGE VENTURES LP

IMS-based interactive media system and method

An interactive media system for an IP Multimedia Subsystem (IMS) architecture includes a media resource function processor (MRFP) and a media resource function controller (MRFC). The MRFP includes media resources, such as automatic speech recognition (ASR) engines and text-to-speech (TTS) engines, and a voice browser. A caller places a call to a telephone number to reach an interactive media service, and the MRFC establishes a real-time packet media session between the media gateway and the MRFP. The voice browser queries a location server to determine the location of an interactive application associated with the called telephone number and then queries an application server corresponding to the location to obtain the interactive application as a voice extensible markup language (VXML) document. The browser interprets the VXML document and responsively invokes the media resources, via the MRFC, to provide the interactive media service to the caller.
Owner:SPRINT SPECTRUM LLC

Barge-in capabilities of a voice browser

A method for barging-in a call can include the step of establishing a component as a communication intermediary between at least one speech engine and a telephone gateway. The component can include at least one input media port and at least one output media port, where the input media port and the output media port can interface with selective ones of the speech engines. A barge-in command can be received while an automated prompt is being audibly presented. The signals for the prompt can be conveyed to the component via the output media port. Once the barge-in command has been received, the output media port can be closed to immediately stop the prompt from being presented. A halt command can be conveyed to a prompt generating source so that the source stops generating the prompt.
Owner:NUANCE COMM INC

Scalable call management system

A scalable call management system. The system can include at least one voice server hosting one or more voice browsers, the voice server having a single communications port through which voice call requests can be processed by the voice browsers, each voice browser having a port alias through which call requests can be processed. The system also can include a call processing gateway linking telephony endpoints in a public switched telephone network (PSTN) to the voice server. Finally, the system can include a translation table mapping port aliases to respective voice browsers.
Owner:IBM CORP +1

Enhanced go-back feature system and method for use in a voice portal

A system, method and computer program product are provided for transitioning among states during use of a voice browser. Initially, a script is received at a voice browser from a web server utilizing a network. Next, the script is executed utilizing the voice browser. A plurality of states is then automatically tracked during the execution of the script utilizing the voice browser. Upon receiving a request from a user during the execution of the script to transition to a previous state, the voice browser automatically transitions to the previous state of the script.
Owner:NUANCE COMM INC

Voice extensible markup language-based announcements for use with intelligent network services

An advanced intelligent network (AIN) telephony service provided in a public switched telephone network (PSTN) is controlled based on instructions from a subscriber, browsing a packet switched data network using a voice browser. A voice extensible markup language (VXML) platform is connected to a web server and a service control point (SCP). The web server stores announcements that can be easily created and modified by a subscriber. When the VXML platform receives an announcement ID from the SCP, the VXML platform determines a location where the announcement is stored based on the ID, and then plays the remotely stored announcement. Thus, the announcements can be easily changed without affecting the SCP and other AIN network elements.
Owner:SBC KNOWLEDGE VENTURES LP

Information retrieval system including voice browser and data conversion server

A method for retrieving content from one or more remote information sources is disclosed herein. The inventive method contemplates transmitting a user request over a communication link to a voice browser operative in accordance with a voice-based protocol. In response, a browsing request identifying a remote information source corresponding to the user request is generated. Content formatted in accordance with a predefined protocol is then retrieved from the remote information source in accordance with the browsing request. The retrieved content is converted into a file of information formatted in compliance with the voice-based protocol. A response is provided to the user request on the basis of the file of converted information.
Owner:V ENABLE

Integrating a voice browser into a web 2.0 environment

The present invention discloses a system and method for integrating a voice browser into a Web 2.0 environment. For example, a system is disclosed which includes at least a Web 2.0 server, a voice browser, and a server-side speech processing system. The Web 2.0 server can serve Web 2.0 content comprising at least one speech-enabled application. The served Web 2.0 content can include voice markup. The voice browser can render the Web 2.0 content received from the Web 2.0 server which includes rendering the voice markup. The server-side speech processing system can handle speech processing operations for the speech-enabled application. Communications with the server-side speech processing system occur via a set of RESTful commands, such as an HTTP GET command, an HTTP POST command, an HTTP PUT command, and an HTTP DELETE command.
Owner:IBM CORP

Providing user customization of web 2.0 applications

The present invention discloses a solution for user customization of Web 2.0 applications. In the solution, a browser can establish a connection with a Web 2.0 server. The browser can convey user identification information to the Web 2.0 server and request a Web 2.0 application. Based on the conveyed identifying information, a user can be identified. The Web 2.0 server can then query a server-side data store for customizations associated with the identified user and requested application. When customizations exist in the data store, the customizations can be applied to the Web 2.0 application. The customized Web 2.0 application can then be conveyed to the browser. The browser can be a Web browser or a voice browser depending upon implementation specifics.
Owner:IBM CORP

Voice extensible markup language-based web interface with intelligent network services

An advanced intelligent network (AIN) telephony service provided in a public switched telephone network (PSTN) is controlled based on instructions from a subscriber, browsing a packet switched data network using a voice browser. The packet switched data network may include the Internet. The subscriber is connected to the voice browser through the PSTN using, for example, a GR-1129 audio channel set up between the voice browser and a switch in the PSTN. The browsing is based on a verbal instruction received from the subscriber through the PSTN. A web server in the packet switched data network is connected to the voice browser and call routing data is retrieved based on the verbal instruction. At least one call is routed through the PSTN based on the call routing data retrieved from the web server.
Owner:SBC KNOWLEDGE VENTURES LP

Automated confirmation and disambiguation modules in voice applications

A method for providing a voice application includes executing control flow logic modeling a dialog flow with a user via a voice browser. The control flow logic produces a disambiguation requirement. A disambiguation module is initiated and a set of at least two candidates and partitioning criteria is sent from the control flow logic to the module. Attributes of the candidates are analyzed to determine a partitioning score for each attribute indicative of ability to distinguish between candidates based on the partitioning criteria. The attributes are sorted based on the partitioning scores. The user is queried based on a top-sorted attribute and results of the query are used to reduce the set of candidates. The steps of analyzing, sorting, and querying are repeated until the set of candidates is reduced to a single candidate. The single candidate is returned to the control flow logic for continued execution.
Owner:IBM CORP

Content interpolating web proxy server

A content interpolating web proxy server is configured in a computer network for processing retrieved web content so as to place it in a format suitable for presentation on a particular client device such as, e.g., a computer, personal digital assistant (PDA), wireless telephone or voice browser-equipped device. The server processes a client request generated by a client device to determine a particular client type associated with the client device, retrieves web content identified in the client request, retrieves one or more augmentation files associated with the web content and the particular client type, and alters the retrieved web content in accordance with the one or more augmentation files. The altered web content is then delivered to the client device. The one or more augmentation files may be co-located with the web content at a site remote from the proxy server, such that the content owner need not own, maintain or otherwise control the proxy server.
Owner:SOUND VIEW INNOVATIONS

Method and apparatus for coupling a visual browser to a voice browser

A method and apparatus for concurrently accessing network-based electronic content in a Voice Browser and a Visual Browser can include the steps of retrieving a network-based document formatted for display in the Visual Browser; identifying in the retrieved document a reference to the Voice Browser, the reference specifying electronic content formatted for audible presentation in the Voice Browser; and, transmitting the reference to the Voice Browser. The Voice Browser can retrieve the specified electronic content and audibly present the electronic content. Concurrently, the Visual Browser can visually present the network-based document formatted for visual presentation in the Visual Browser. Likewise, the method of the invention can include the steps of retrieving a network-based document formatted for audible presentation in the Voice Browser; identifying in the retrieved document a reference to the Visual Browser, the reference specifying electronic content formatted for visual presentation in the Visual Browser; and, transmitting the reference to the Visual Browser. The Visual Browser can retrieve the specified electronic content and visually present the specified electronic content. Concurrently, the Voice Browser can audibly present the network-based document formatted for audible presentation in the Voice Browser.
Owner:NUANCE COMM INC
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products