Patents
Literature
Hiro is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Hiro

30results about "Terminals with audio html browser" patented technology

Dialog recognition and control in a voice browser

A voice browser dialog enabler for multimodal dialog uses a multimodal markup document with fields have markup-based forms associated with each field and defining fragments. A voice browser driver resides on a communication device and provides the fragments and identifiers that identify the fragments. A voice browser implementation resides on a remote voice server and receives the fragments from the driver and downloads a plurality of speech grammars. Input speech is matched against those speech grammars associated with the corresponding identifiers received in a recognition request from the voice browser driver.
Owner:GOOGLE TECH HLDG LLC

Enabling Dynamic VoiceXML In An X+ V Page Of A Multimodal Application

Enabling dynamic VoiceXML in an X+V page of a multimodal application implemented with the multimodal application operating in a multimodal browser on a multimodal device supporting multiple modes of interaction including a voice mode and one or more non-voice modes, the multimodal application operatively coupled to a VoiceXML interpreter, including representing by the multimodal browser an XML element of a VoiceXML dialog of the X+V page as an ECMAScript object, the XML element comprising XML content; storing by the multimodal browser the XML content of the XML element in an attribute of the ECMAScript object; and accessing the XML content of the XML element in the attribute of the ECMAScript object from an ECMAScript script in the X+V page.
Owner:NUANCE COMM INC

Servers for web enabled speech recognition

A markup language for execution on a client device in a client / server system includes instructions to unify at least one of recognition-related events, GUI events and telephony events on non-display, voice input based client device and a multimodal based client for a web server interacting with each of the client devices. A recognition server for receiving data indicative of inputted data provided to a client device and an indication of a grammar to use for recognition is also provided.
Owner:MICROSOFT TECH LICENSING LLC

Invoking Tapered Prompts In A Multimodal Application

Methods, apparatus, and computer program products are described for invoking tapered prompts in a multimodal application implemented with a multimodal browser and a multimodal application operating on a multimodal device supporting multiple modes of user interaction with the multimodal application, the modes of user interaction including a voice mode and one or more non-voice modes. Embodiments include identifying, by a multimodal browser, a prompt element in a multimodal application; identifying, by the multimodal browser, one or more attributes associated with the prompt element; and playing a speech prompt according to the one or more attributes associated with the prompt element.
Owner:NUANCE COMM INC

VOIP barge-in support for half-duplex DSR client on a full-duplex network

Providing VOIP barge-in support for a half-duplex DSR client on a full-duplex network by buffering, in a half-duplex DSR client, input audio from the full-duplex network; playing, through the half-duplex DSR client, the buffered input audio; pausing, during voice activity on the half-duplex DSR client, the playing of the buffered input audio; sending, during voice activity on the half-duplex DSR client, speech for recognition through the full-duplex network to a voice server; receiving in the half-duplex DSR client through the full-duplex network from the voice server notification of speech recognition, the notification bearing a time stamp; and, responsive to receiving the notification, resuming the playing of the buffered input audio, including playing only buffered VOIP audio data bearing time stamps later than the time stamp of the recognition notification.
Owner:NUANCE COMM INC

Web enabled recognition architecture

A server / client system for processing data includes a network having a web server with information accessible remotely. A client device includes a microphone and a rendering component such as a speaker or display. The client device is configured to obtain the information from the web server and record input data associated with fields contained in the information. The client device is adapted to send the input data to a remote location with an indication of a grammar to use for recognition. A recognition server receives the input data and the indication of the grammar. The recognition server returns data indicative of what was recognized to at least one of the client and the web server.
Owner:MICROSOFT TECH LICENSING LLC

Personal voice-based information retrieval system

The present invention relates to a system for retrieving information from a network such as the Internet. A user creates a user-defined record in a database that identifies an information source, such as a web site, containing information of interest to the user. This record identifies the location of the information source and also contains a recognition grammar based upon a speech command assigned by the user. Upon receiving the speech command from the user that is described within the recognition grammar, a network interface system accesses the information source and retrieves the information requested by the user.
Owner:WEBLEY SYST

Personal and remote article-on-demand system

A communication system (10) includes a remote personal communication device (14) that has an input device 68, 70, 72) and a data transceiver (62). The input device (68, 70, 72) receives a text article request command from a user. The data transceiver (62) transmits text article request signals in response to the text article request command and receives text article signals from an article source (12) in response to the text article request signals. A text-to-speech converter (46, 66) converts the text article signals into speech signals. An audio system (74) is coupled to the text-to-speech converter (46, 66) and audibly transmits the speech signals to the user.
Owner:E3 INFOSYST

Apparatus and method for contacting a customer support line on customer's behalf and having a customer support representative contact the customer

A method is provided in an application server configured for responding to hypertext transport protocol (HTTP) requests. The method includes storing, in response to a first HTTP request, an XML document that specifies for a user, a call number of a second party. The stored XML document is retrieved based on a second HTTP request by the user. A first HTML document is generated based on the retrieved XML document. The first HTML document has instructions including the call number for accessing the second party. A second HTML document is generated based on a prescribed input received from the second party. The second HTML document has instructions for connecting the second party with the user. Hence, a user may speak with a called party without ever having to remain on hold.
Owner:CISCO TECH INC

World Wide Telecom Web Voice Applications

A framework for creating a voice application in a world wide telecom web (WWTW) is provided. The techniques include using a pre-defined schema to create one or more voice application templates, using the one or more voice application templates to generate a first version of the voice application, using the first version of the voice application and a library of one or more components to generate a deployable version of the voice application and deploying the deployable version of the voice application onto a run-time execution engine.
Owner:IBM CORP

Device and method for the creation of a voice browser functionality

Services and performance characteristics are more and more frequently defined according to standard descriptions and formats, which is the case also for announcement services and dialogue services required especially in network services, for example. The associated descriptions are also provided in a standard form, e.g. by means of VoiceXML. When a service is introduced in the network, said descriptions are inserted into the network nodes, application, and / or media server. A browser functionality which reads and interprets the VoiceXML pages is required for processing the VoiceXML description on a media server platform such that the necessary basic functions of the media server can be allocated to the desired service and can be controlled. Prior art uses media servers comprising a single VoiceXML browser. The problem with such commercial products lies in the resulting suboptimality in terms of resource utilization and expenses, which is the case particularly for simple applications. The invention resolves said problem by providing a plurality of browser functionalities. In the inventive multistep method, the optimal browser functionalities are determined step by step by evaluating signaling data and descriptive data as required.
Owner:NOKIA SIEMENS NETWORKS GMBH & CO KG

Automated conversation assistance

Methods, apparatuses, systems, and computer-readable media for providing automated conversation assistance are presented. According to one or more aspects, a computing device may obtain user profile information associated with a user of the computing device, the user profile information including a list of one or more words that have previously been detected in one or more previously captured speeches associated with the user. Subsequently, the computing device may select, based on the user profile information, one or more words from a captured speech for inclusion in a search query. Then, the computing device may generate the search query based on the selected one or more words.
Owner:QUALCOMM INC

Invoking tapered prompts in a multimodal application

Methods, apparatus, and computer program products are described for invoking tapered prompts in a multimodal application implemented with a multimodal browser and a multimodal application operating on a multimodal device supporting multiple modes of user interaction with the multimodal application, the modes of user interaction including a voice mode and one or more non-voice modes. Embodiments include identifying, by a multimodal browser, a prompt element in a multimodal application; identifying, by the multimodal browser, one or more attributes associated with the prompt element; and playing a speech prompt according to the one or more attributes associated with the prompt element.
Owner:NUANCE COMM INC

World wide telecom web voice applications

A framework for creating a voice application in a world wide telecom web (WWTW) is provided. The techniques include using a pre-defined schema to create one or more voice application templates, using the one or more voice application templates to generate a first version of the voice application, using the first version of the voice application and a library of one or more components to generate a deployable version of the voice application and deploying the deployable version of the voice application onto a run-time execution engine.
Owner:IBM CORP

System for dispatching information packets and method therefor

A system (20) for simplex dispatch of an information packet (22) utilizing a telecommunication network (24) is provided. The system (20) includes an origination unit (26), a server (42), and a destination unit (28). The origination unit (26) is configured to generate an origination packet (50) containing a voice frame (54), and to transmit the origination packet (50) utilizing a wireless non-circuit-switching service of network (24). The origination unit (26) and the server (42) are coupled through an origination cell site (36) of the network (24). The server (42) is configured to receive the origination packet (50), to convert the origination packet (50) to a destination packet (52) containing a voice frame (54) and / or a text frame (56), and to transmit the destination packet (52). The server (42) and the destination unit (28) are coupled through a destination cell site (46) of the network (24). The destination unit (28) is configured to receive the destination packet (52) utilizing a non-circuit-switching service of the network (24), and to present the contents of the destination packet (52) to a recipient (174).
Owner:RED HAT

Method and device for sending and receiving browser voice, and voice talkback system

The embodiment of the invention provides a method and a device for sending and receiving browser voice, and a voice talkback system. The method for sending the browser voice comprises the following steps that: through the audio sampling interface of a browser, collecting first speech data; obtaining a first voice sampling parameter corresponding to voice data which can be processed by target equipment; on the basis of the first voice sampling parameter, sampling the first voice data to obtain target voice data; and sending the target voice data to target equipment. When the embodiment of the invention is applied, a browser can realize a voice talkback function under a situation that no plugins are used.
Owner:HANGZHOU HIKVISION DIGITAL TECH

Personal Voice-Based Information Retrieval System

The present invention relates to a system for retrieving information from a network such as the Internet. A user creates a user-defined record in a database that identifies an information source, such as a web site, containing information of interest to the user. This record identifies the location of the information source and also contains a recognition grammar based upon a speech command assigned by the user. Upon receiving the speech command from the user that is described within the recognition grammar, a network interface system accesses the information source and retrieves the information requested by the user.
Owner:WEBLEY SYST

System for dispatching information packets and method therefor

A system (20) for simplex dispatch of an information packet (22) utilizing a telecommunication network (24) is provided. The system (20) includes an origination unit (26), a server (42), and a destination unit (28). The origination unit (26) is configured to generate an origination packet (50) containing a voice frame (54), and to transmit the origination packet (50) utilizing a wireless non-circuit-switching service of network (24). The origination unit (26) and the server (42) are coupled through an origination cell site (36) of the network (24). The server (42) is configured to receive the origination packet (50), to convert the origination packet (50) to a destination packet (52) containing a voice frame (54) and / or a text frame (56), and to transmit the destination packet (52). The server (42) and the destination unit (28) are coupled through a destination cell site (46) of the network (24). The destination unit (28) is configured to receive the destination packet (52) utilizing a non-circuit-switching service of the network (24), and to present the contents of the destination packet (52) to a recipient (174).
Owner:RED HAT

Identifying system structure of WEB invocation

A server / client system for processing data includes a network having a web server with information accessible remotely. A client device includes a microphone and a rendering component such as a speaker or display. The client device is configured to obtain the information from the web server and record input data associated with fields contained in the information. The client device is adapted to send the input data to a remote location with an indication of a grammar to use for recognition. A recognition server receives the input data and the indication of the grammar. The recognition server returns data indicative of what was recognized to at least one of the client and the web server.
Owner:MICROSOFT TECH LICENSING LLC

Invoking tapered prompts in a multimodal application

Methods, apparatus, and computer program products are described for invoking tapered prompts in a multimodal application implemented with a multimodal browser and a multimodal application operating on a multimodal device supporting multiple modes of user interaction with the multimodal application, the modes of user interaction including a voice mode and one or more non-voice modes. Embodiments include identifying, by a multimodal browser, a prompt element in a multimodal application; identifying, by the multimodal browser, one or more attributes associated with the prompt element; and playing a speech prompt according to the one or more attributes associated with the prompt element.
Owner:MICROSOFT TECH LICENSING LLC

Exchange apparatus and method of selecting voice storing area

According to one embodiment, an exchange apparatus comprising a first storing unit which includes a plurality of voice storing areas, a read unit which reads, from the first storing unit, information indicating one of the voice storing areas corresponding to a request to record a message, upon receiving the request from an information terminal, if the information terminal has a web function, a second storing unit which provides the information, indicating the one voice storing area and read by the read unit, with link information used for accessing the one voice storing area, to generate a web page and store the generated web page, and a transmission unit which generates URL information used to access the web page stored in the second storing unit, and transmit the web page to the information terminal.
Owner:KK TOSHIBA
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products