Patents

Literature

Patsnap Eureka AI that helps you search prior art, draft patents, and assess FTO risks, powered by patent and scientific literature data.

3560 results about "Speech input" patented technology

Filter

Efficacy Topic

Property

Owner

Technical Advancement

Application Domain

Technology Topic

Technology Field Word

Patent Country/Region

Patent Type

Patent Status

Application Year

Inventor

Speech input is one of the most innovative browser technologies to appear in recent months. It’s easy to implement and there are several obvious uses: assistive dictation for those with impaired mobility. an alternative input option for mobile phones and tablets, and. any environment where a keyboard or mouse is impractical.

Distributed voice user interface

InactiveUS6408272B1Low costSmall sizeSpeech recognitionRemote systemSpeech input

A distributed voice user interface system includes a local device which receives speech input issued from a user. Such speech input may specify a command or a request by the user. The local device performs preliminary processing of the speech input and determines whether it is able to respond to the command or request by itself. If not, the local device initiates communication with a remote system for further processing of the speech input.

Distributed voice user interface

Distributed voice user interface

Distributed voice user interface

Owner:INTELLECTUAL VENTURES I LLC

Intelligent automated assistant for TV user interactions

ActiveUS20150382047A1Television system detailsVideo data queryingInteraction systemsDisplay device

Systems and processes are disclosed for controlling television user interactions using a virtual assistant. A virtual assistant can interact with a television set-top box to control content shown on a television. Speech input for the virtual assistant can be received from a device with a microphone. User intent can be determined from the speech input, and the virtual assistant can execute tasks according to the user's intent, including causing playback of media on the television. Virtual assistant interactions can be shown on the television in interfaces that expand or contract to occupy a minimal amount of space while conveying desired information. Multiple devices associated with multiple displays can be used to determine user intent from speech input as well as to convey information to users. In some examples, virtual assistant query suggestions can be provided to the user based on media content shown on a display.

Intelligent automated assistant for TV user interactions

Intelligent automated assistant for TV user interactions

Intelligent automated assistant for TV user interactions

Owner:APPLE INC

Speech interface system and method for control and interaction with applications on a computing system

ActiveUS8165886B1Reduce ambiguityImprove integritySound input/outputSpeech recognitionSpeech inputApplication software

A speech processing system which exploits statistical modeling and formal logic to receive and process speech input, which may represent data to be received, such as dictation, or commands to be processed by an operating system, application or process. A command dictionary and dynamic grammars are used in processing speech input to identify, disambiguate and extract commands. The logical processing scheme ensures that putative commands are complete and unambiguous before processing. Context sensitivity may be employed to differentiate data and commands. A multi faceted graphic user interface may be provided for interaction with a user to speech enable interaction with applications and processes that do not necessarily have native support for speech input.

Speech interface system and method for control and interaction with applications on a computing system

Speech interface system and method for control and interaction with applications on a computing system

Speech interface system and method for control and interaction with applications on a computing system

Owner:SAMSUNG ELECTRONICS CO LTD

Multimodal natural language query system and architecture for processing voice and proximity-based queries

InactiveUS7376645B2Data processing applicationsDigital data information retrievalDatabase querySpeech input

The present invention provides a wireless natural language query system, architecture, and method for processing multimodally-originated queries, including voice and proximity-based queries. The natural language query system includes a Web-enabled device including a speech input module for receiving a voice-based query in natural language form from a user and a location / proximity module for receiving location / proximity information from a location / proximity device. The natural language query system also includes a speech conversion module for converting the voice-based query in natural language form to text in natural language form and a natural language processing module for converting the text in natural language form to text in searchable form. The natural language query system further includes a semantic engine module for converting the text in searchable form to a formal database query and a database-look-up module for using the formal database query to obtain a result related to the voice-based query in natural language form from a database.

Multimodal natural language query system and architecture for processing voice and proximity-based queries

Multimodal natural language query system and architecture for processing voice and proximity-based queries

Multimodal natural language query system and architecture for processing voice and proximity-based queries

Owner:PORTAL COMM LLC

Automated database assistance using a telephone for a speech based or text based multimedia communication mode

InactiveUS6996531B2Digital data information retrievalAutomatic call-answering/message-recording/conversation-recordingAutomated databaseSpeech identification

An interface for remote human input for reading a database, the interface including an automatic voice question unit for eliciting speech input, a speech recognition unit for recognizing human speech input, and a data recognition unit for recognizing remote data input. The interface is associated with a database to search the database using the recognized input. A typical application is as an automated directory enquiry service.

Automated database assistance using a telephone for a speech based or text based multimedia communication mode

Automated database assistance using a telephone for a speech based or text based multimedia communication mode

Automated database assistance using a telephone for a speech based or text based multimedia communication mode

Owner:AMAZON TECH INC

Systems and methods for hands-free notification summaries

ActiveUS20140195252A1Adjusting operationAdapt to the environmentNatural language data processingSpeech recognitionHands freeSpeech input

A method includes outputting an alert corresponding to an information item. In some implementations, the alert is a sound. In some implementations, the alert is ambiguous (e.g., the sound indicates several possible information items). The method further includes receiving a speech input after outputting the alert. The method further includes determining whether the speech input includes a request for information about the alert. The method further includes, in response to determining that the speech input includes a request for information about the alert, providing a first speech output including information about the alert.

Systems and methods for hands-free notification summaries

Systems and methods for hands-free notification summaries

Systems and methods for hands-free notification summaries

Owner:APPLE INC

Multimodal natural language query system for processing and analyzing voice and proximity-based queries

InactiveUS7873654B2Semantic analysisDigital data processing detailsDatabase querySpeech input

The present invention provides a natural language query system and method for processing and analyzing multimodally-originated queries, including voice and proximity-based queries. The natural language query system includes a Web-enabled device including a speech input module for receiving a voice-based query in natural language form from a user and a location / proximity module for receiving location / proximity information from a location / proximity device. The query system also includes a speech conversion module for converting the voice-based query in natural language form to text in natural language form and a natural language processing module for converting the text in natural language form to text in searchable form. The query system further includes a semantic engine module for converting the text in searchable form to a formal database query and a database-look-up module for using the formal database query to obtain a result related to the voice-based query in natural language form from a database.

Multimodal natural language query system for processing and analyzing voice and proximity-based queries

Multimodal natural language query system for processing and analyzing voice and proximity-based queries

Multimodal natural language query system for processing and analyzing voice and proximity-based queries

Owner:PORTAL COMM LLC

Method for processing the output of a speech recognizer

ActiveUS8219407B1Improve integrityReduce ambiguityEngine fuctionsBlade accessoriesUser inputSpeech identification

A system and method for processing speech input comprising a speech recognizer and a logical command processor which facilitates additional processing of speech input beyond the speech recognizer level. A speech recognizer receives input from a user, and when a command is identified in the speech input, if the command meets conditions that require additional processing, a representation of the speech input s stored for subsequent processing. A logical command processor performs additional processing of command input by analyzing the command and its elements, determining which elements are required for successful processing the command and which elements are present and lacking. The user is prompted to supply missing information, and subsequent user input is added to the command structure until the command input is aborted or the command structure reaches sufficient completeness to enable execution of the command. Thereby, speech input of complex commands in natural language in a system running a plurality of applications and processes is made possible.

Method for processing the output of a speech recognizer

Method for processing the output of a speech recognizer

Method for processing the output of a speech recognizer

Owner:GREAT NORTHERN RES

Interface with Gaze Detection and Voice Input

ActiveUS20120295708A1Dashboard fitting arrangementsInstrument arrangements/adaptationsSpeech inputHuman–computer interaction

Methods, computer programs, and systems for interfacing a user with a computer program, utilizing gaze detection and voice recognition, are provided. One method includes an operation for determining if a gaze of a user is directed towards a target associated with the computer program. The computer program is set to operate in a first state when the gaze is determined to be on the target, and set to operate in a second state when the gaze is determined to be away from the target. When operating in the first state, the computer program processes voice commands from the user, and, when operating in the second state, the computer program omits processing of voice commands.

Interface with Gaze Detection and Voice Input

Interface with Gaze Detection and Voice Input

Interface with Gaze Detection and Voice Input

Owner:SONY COMPUTER ENTERTAINMENT INC

Prioritizing Selection Criteria by Automated Assistant

ActiveUS20130111348A1Improve user interactionEffectively engageNatural language translationSemantic analysisSelection criterionSpeech input

Methods, systems, and computer readable storage medium related to operating an intelligent digital assistant are disclosed. A user request is received, the user request including at least a speech input received from a user. The user request including the speech input is processed to obtain a representation of user intent for identifying items of a selection domain based on at least one selection criterion. A prompt is provided to the user, the prompt presenting two or more properties relevant to items of the selection domain and requesting the user to specify relative importance between the two or more properties. A listing of search results is provided to the user, where the listing of search results has been obtained based on the at least one selection criterion and the relative importance provided by the user.

Prioritizing Selection Criteria by Automated Assistant

Prioritizing Selection Criteria by Automated Assistant

Prioritizing Selection Criteria by Automated Assistant

Owner:APPLE INC

Process for automatic control of one or more devices by voice commands or by real-time voice dialog and apparatus for carrying out this process

InactiveUS6839670B1Reduce spendingSpeech recognitionElectric/fluid circuitAutomatic controlSpeech identification

A speech dialog system wherein a process for automatic control of devices by speech dialog is used applying methods of speech input, speech signal processing and speech recognition, syntatical-grammatical postediting as well as dialog, executive sequencing and interface control, and which is characterized in that syntax and command structures are set during real-time dialog operation; preprocessing, recognition and dialog control are designed for operation in a noise-encumbered environment; no user training is required for recognition of general commands; training of individual users is necessary for recognition of special commands; the input of commands is done in linked form, the number of words used to form a command for speech input being variable; a real-time processing and execution of the speech dialog is established; and the speech input and output is done in the hands-free mode.

Process for automatic control of one or more devices by voice commands or by real-time voice dialog and apparatus for carrying out this process

Process for automatic control of one or more devices by voice commands or by real-time voice dialog and apparatus for carrying out this process

Process for automatic control of one or more devices by voice commands or by real-time voice dialog and apparatus for carrying out this process

Owner:NUANCE COMM INC

Interface for a Virtual Digital Assistant

ActiveUS20140040748A1Sound input/outputSpeech recognitionComputer graphics (images)Speech input

The digital assistant displays a digital assistant object in an object region of a display screen. The digital assistant then obtains at least one information item based on a speech input from a user. Upon determining that the at least one information item can be displayed in its entirety in the display region of the display screen, the digital assistant displays the at least one information item in the display region, where the display region and the object region are not visually distinguishable from one another. Upon determining that the at least one information item cannot be displayed in its entirety in the display region of the video display screen, the digital assistant displays a portion of the at least one information item in the display region, where the display region and the object region are visually distinguishable from one another.

Interface for a Virtual Digital Assistant

Interface for a Virtual Digital Assistant

Interface for a Virtual Digital Assistant

Owner:APPLE INC

System and method of a list commands utility for a speech recognition command system

InactiveUS20100169098A1Facilitates a phrase modeEasy to installSpeech recognitionSpecial data processing applicationsCommand and controlCommand system

In embodiments of the present invention, a system and computer-implemented method for enabling a user to interact with a mobile device using a voice command may include the steps of defining a structured grammar for generating a global voice command, defining a global voice command of the structured grammar, wherein the global voice command enables access to an object of the mobile device using a single command, and mapping at least one function of the object to the global voice command, wherein upon receiving voice input from the user of the mobile device, the object recognizes the global voice command and controls the function.

System and method of a list commands utility for a speech recognition command system

System and method of a list commands utility for a speech recognition command system

System and method of a list commands utility for a speech recognition command system

Owner:PATCH KIMBERLY C

Techniques for disambiguating speech input using multimodal interfaces

ActiveUS7684985B2Improve speech recognition processSpeech recognitionSpecial data processing applicationsCombined useSpeech input

A technique is disclosed for disambiguating speech input for multimodal systems by using a combination of speech and visual I / O interfaces. When the user's speech input is not recognized with sufficiently high confidence, a the user is presented with a set of possible matches using a visual display and / or speech output. The user then selects the intended input from the list of matches via one or more available input mechanisms (e.g., stylus, buttons, keyboard, mouse, or speech input). These techniques involve the combined use of speech and visual interfaces to correctly identify user's speech input. The techniques disclosed herein may be utilized in computer devices such as PDAs, cellphones, desktop and laptop computers, tablet PCs, etc.

Techniques for disambiguating speech input using multimodal interfaces

Techniques for disambiguating speech input using multimodal interfaces

Techniques for disambiguating speech input using multimodal interfaces

Owner:WALOOMBA TECH

Method for processing speech signal features for streaming transport

InactiveUS7376556B2Flexibly and optimally distributedImprove accuracyNatural language translationData processing applicationsNetwork onClient server systems

Speech signal information is formatted, processed and transported in accordance with a format adapted for TCP / IP protocols used on the Internet and other communications networks. NULL characters are used for indicating the end of a voice segment. The method is useful for distributed speech recognition systems such as a client-server system, typically implemented on an intranet or over the Internet based on user queries at his / her computer, a PDA, or a workstation using a speech input interface.

Method for processing speech signal features for streaming transport

Method for processing speech signal features for streaming transport

Method for processing speech signal features for streaming transport

Owner:NUANCE COMM INC

Disambiguation Based on Active Input Elicitation by Intelligent Automated Assistant

ActiveUS20130110515A1Improve user interactionEffectively engageNatural language translationSemantic analysisUser inputSpeech input

Methods, systems, and computer readable storage medium related to operating an intelligent digital assistant are disclosed. A user request is received, the user request including at least a speech input received from a user. Two or more alternative interpretations of user intent are obtained based on the received user request. One or more commonalities and one or more differences among the two or more alternative interpretations of user intent are identified. A response is provided to the user, the response presenting at least one of the identified differences and eliciting additional user input to choose among the two or more alternative interpretations of user intent based on the at least one difference.

Disambiguation Based on Active Input Elicitation by Intelligent Automated Assistant

Disambiguation Based on Active Input Elicitation by Intelligent Automated Assistant

Disambiguation Based on Active Input Elicitation by Intelligent Automated Assistant

Owner:APPLE INC

Digital assistant providing whispered speech

ActiveUS20170358301A1Speech recognitionSpeech synthesisSpeech inputSpeech sound

Systems and processes for detecting and / or providing a whispered speech response are provided. In one example process, speech is received from a user, and based on the speech input, determined that a whispered speech response is to be provided. Upon determining that a whispered speech response is to be provided, the whispered speech response is generated and provided to the user.

Digital assistant providing whispered speech

Digital assistant providing whispered speech

Digital assistant providing whispered speech

Owner:APPLE INC

Replying to text messages via automated voice search techniques

ActiveUS20100145694A1Limiting distractionDistractionSpeech recognitionExact matchDistraction

An automated “Voice Search Message Service” provides a voice-based user interface for generating text messages from an arbitrary speech input. Specifically, the Voice Search Message Service provides a voice-search information retrieval process that evaluates user speech inputs to select one or more probabilistic matches from a database of pre-defined or user-defined text messages. These probabilistic matches are also optionally sorted in terms of relevancy. A single text message from the probabilistic matches is then selected and automatically transmitted to one or more intended recipients. Optionally, one or more of the probabilistic matches are presented to the user for confirmation or selection prior to transmission. Correction or recovery of speech recognition errors avoided since the probabilistic matches are intended to paraphrase the user speech input rather than exactly reproduce that speech, though exact matches are possible. Consequently, potential distractions to the user are significantly reduced relative to conventional speech recognition techniques.

Replying to text messages via automated voice search techniques

Replying to text messages via automated voice search techniques

Replying to text messages via automated voice search techniques

Owner:MICROSOFT TECH LICENSING LLC

Handwriting and voice input with automatic correction

InactiveUS7319957B2Speech recognitionCharacter recognitionHandwritingProcess systems

A hybrid approach to improve handwriting recognition and voice recognition in data process systems is disclosed. In one embodiment, a front end is used to recognize strokes, characters and / or phonemes. The front end returns candidates with relative or absolute probabilities of matching to the input. Based on linguistic characteristics of the language, e.g. alphabetical or ideographic language for the words being entered, e.g. frequency of words and phrases being used, likely part of speech of the word entered, the morphology of the language, or the context in which the word is entered), a back end combines the candidates determined by the front end from inputs for words to match with known words and the probabilities of the use of such words in the current context.

Handwriting and voice input with automatic correction

Handwriting and voice input with automatic correction

Handwriting and voice input with automatic correction

Owner:TEGIC COMM

Context aware service provision method and apparatus of user device

ActiveUS20140082501A1Devices with voice recognitionSubstation equipmentContext-aware servicesUser device

A context aware service provision method and apparatus for recognizing the user context and executing an action corresponding to the user context according to a rule defined by the user and feeding back the execution result to the user interactively are provided. The method for providing a context-aware service includes receiving a user input, the user input being at least one of a text input and a speech input, identifying a rule including a condition and an action corresponding to the condition based on the received user input, activating the rule to detect a context which corresponds to the condition of the rule, and executing, when the context is detected, the action corresponding to the condition.

Context aware service provision method and apparatus of user device

Context aware service provision method and apparatus of user device

Context aware service provision method and apparatus of user device

Owner:SAMSUNG ELECTRONICS CO LTD

Method and apparatus for searching for music based on speech recognition

InactiveUS20080249770A1Metadata audio data retrievalSpeech recognitionPersonalizationAcoustic model

Provided is a method and apparatus for searching music based on speech recognition. By calculating search scores with respect to a speech input using an acoustic model, calculating preferences in music using a user preference model, reflecting the preferences in the search scores, and extracting a music list according to the search scores in which the preferences are reflected, a personal expression of a search result using speech recognition can be achieved, and an error or imperfection of a speech recognition result can be compensated for.

Method and apparatus for searching for music based on speech recognition

Method and apparatus for searching for music based on speech recognition

Method and apparatus for searching for music based on speech recognition

Owner:SAMSUNG ELECTRONICS CO LTD

Method and apparatus for improving the transcription accuracy of speech recognition software

ActiveUS7805299B2Improve accuracyEasy to identifySpeech recognitionDigital dataSpeech identification

A virtual vocabulary database is provided for use with a with a particular user database as part of a speech recognition system. Vocabulary elements within the virtual database are imported from the user database and are tagged to include numerical data corresponding to the historical use of the vocabulary element within the user database. For each speech input, potential vocabulary element matches from the speech recognition system are provided to the virtual database software which creates virtual sub-vocabularies from the criteria according to predefined criteria templates. The software then applies vocabulary element weighting adjustments according to the virtual sub-vocabulary weightings and applies the adjustment to the default weighting provided by the speech recognition system. The modified weightings are returned with the associated vocabulary elements to the speech engine for selection of an appropriate match to the input speech.

Method and apparatus for improving the transcription accuracy of speech recognition software

Method and apparatus for improving the transcription accuracy of speech recognition software

Method and apparatus for improving the transcription accuracy of speech recognition software

Owner:COIFMAN ROBERT E

Method and apparatus for media rendering services using gesture and/or voice control

InactiveUS20130063369A1Speech recognitionInput/output processes for data processingUser inputApplication software

An approach for providing media rendering services using touch input and voice input. An apparatus invokes a media application and presents media content at the apparatus. The apparatus monitors for touch input and / or voice input to execute a function to apply the media content. The apparatus receives user input as a sequence of user actions, wherein each of the user actions is provided via the touch input or the voice input. The touch input or the voice input is received without presentation of an input prompt that overlays or alters the media content

Method and apparatus for media rendering services using gesture and/or voice control

Method and apparatus for media rendering services using gesture and/or voice control

Method and apparatus for media rendering services using gesture and/or voice control

Owner:VERIZON PATENT & LICENSING INC

Signal adaptation for higher band coding in a codec utilizing band split coding

InactiveUS20050004793A1Speech analysisPattern perceptionSpeech input

The present invention describes a novel methodology for adjusting a bandwidth extension algorithm by adapting one or more of enhancing perception parameters (e.g., a signal level, a signal energy and/or a gain) of a high-band encoded signal based on the characteristics of the input signal and an encoding performance in a low band with a codec utilizing audio-band-split coding by separate encoders and decoders for each audio band. The adaptation is based on the low-band coding algorithm. It can be at least two types of such an algorithm: e.g., an algebraic code excitation linear prediction (ACELP) algorithm for a speech-like input signal and a transform algorithm of a non-speech-like input signal, such that when the ACELP coding is selected, the corresponding enhancing perception parameter is gradually tuned down and when the encoding algorithm is changed to the transform coding, the corresponding enhancing perception parameter is gradually tuned up.

Signal adaptation for higher band coding in a codec utilizing band split coding

Signal adaptation for higher band coding in a codec utilizing band split coding

Signal adaptation for higher band coding in a codec utilizing band split coding

Owner:NOKIA CORP

Communication system with handset for distributed processing

InactiveUS6125284ACordless telephonesSpecial service for subscribersThird partyCommunications system

A communication system comprising at least one mobile handheld telephone handset adapted to communicate via a wireless telephony medium with a telephone network handling system. The handset comprises input devices to receive input from a user and produce signals dependent thereupon, an onboard processor to adapt speech input to produce a voice transmission signal as part of a telephone conversation with a third party; and an antenna to transmit the voice transmission signal via the wireless telephony medium. The telephone network handling system comprises a receiver to receive the voice transmission signal, and means to forward the voice signal to a third party. The handset further comprises a first processor to carry out a first processing step on selected input signals and produce data dependent thereupon which preserves predetermined information necessary to carry out a remote second processing step, an onboard processor to adapt the data according to a conventional wireless telephony protocol to produce a transmission signal, and an antenna to transmit the transmission signal via the wireless telephony medium to the telephone network handling system. The system further comprises a remote processor adapted to receive and adapt the transmission signal from the telephone network handling system to regenerate the data, and to carry out a second processing step on the data and produce an output dependent thereupon.

Communication system with handset for distributed processing

Communication system with handset for distributed processing

Communication system with handset for distributed processing

Owner:CABLE & WIRELESS PLC

Multimodal natural language query system for processing and analyzing voice and proximity-based queries

ActiveUS20110093271A1Inaccurate and imprecise and unreliable and trainingData processing applicationsSemantic analysisDatabase querySpeech input

The present disclosure provides a natural language query system and method for processing and analyzing multimodally-originated queries, including voice and proximity-based queries. The natural language query system includes a Web-enabled device including a speech input module for receiving a voice-based query in natural language form from a user and a location / proximity module for receiving location / proximity information from a location / proximity device. The query system also includes a speech conversion module for converting the voice-based query in natural language form to text in natural language form and a natural language processing module for converting the text in natural language form to text in searchable form. The query system further includes a semantic engine module for converting the text in searchable form to a formal database query and a database-look-up module for using the formal database query to obtain a result related to the voice-based query in natural language form from a database.

Multimodal natural language query system for processing and analyzing voice and proximity-based queries

Multimodal natural language query system for processing and analyzing voice and proximity-based queries

Multimodal natural language query system for processing and analyzing voice and proximity-based queries

Owner:PORTAL COMM LLC

Recognition architecture for generating Asian characters

ActiveUS20080270118A1Easy to determineEasy inputNatural language data processingSpeech recognitionSpeech inputLanguage speech

Architecture for correcting incorrect recognition results in an Asian language speech recognition system. A spelling mode can be launched in response to receiving speech input, the spelling mode for correcting incorrect spelling of the recognition results or generating new words. Correction can be obtained using speech and / or manual selection and entry. The architecture facilitates correction in a single pass, rather than multiples times as in conventional systems. Words corrected using the spelling mode are corrected as a unit and treated as a word. The spelling mode applies to languages of at least the Asian continent, such as Simplified Chinese, Traditional Chinese, and / or other Asian languages such as Japanese.

Recognition architecture for generating Asian characters

Recognition architecture for generating Asian characters

Recognition architecture for generating Asian characters

Owner:MICROSOFT TECH LICENSING LLC

Method and system for providing alternatives for text derived from stochastic input sources

InactiveUS7546529B2Easy to editImprove editing efficiencyNatural language data processingSound input/outputSpeech identificationSpeech input

A computer-implemented method for providing a candidate list of alternatives for a text selection containing text from multiple input sources, each of which can be stochastic (such as a speech recognition unit, handwriting recognition unit, or input method editor) or non-stochastic (such as a keyboard and mouse). A text component of the text selection may be the result of data processed through a series of stochastic input sources, such as speech input that is converted to text by a speech recognition unit before being used as input into an input method editor. To determine alternatives for the text selection, a stochastic input combiner parses the text selection into text components from different input sources. For each stochastic text component, the combiner retrieves a stochastic model containing alternatives for the text component. If the stochastic text component is the result of a series of stochastic input sources, the combiner derives a stochastic model that accurately reflects the probabilities of the results of the entire series. The combiner creates a list of alternatives for the text selection by combining the stochastic models retrieved. The combiner may revise the list of alternatives by applying natural language principles to the text selection as a whole. The list of alternatives for the text selection is then presented to the user. If the user chooses one of the alternatives, then the word processor replaces the text selection with the chosen candidate.

Method and system for providing alternatives for text derived from stochastic input sources

Method and system for providing alternatives for text derived from stochastic input sources

Method and system for providing alternatives for text derived from stochastic input sources

Owner:MICROSOFT TECH LICENSING LLC

Voice Recognition Device and Method, and Program

InactiveUS20080052073A1Short timeSpeech recognitionWord selectionSpeech input

A speech recognition system in which a user may correct a recognition error resulting from speech recognition more efficiently and easily. Speech recognition means compares a plurality of words inputted from speech input means with a plurality of words stored in dictionary means, respectively, and determines a most-competitive word candidate. Word correction means has a word correction function of correcting the words constituting a word sequence displayed on a screen. Competitive word display commanding means selects one or more competitive words having competitive probabilities close to the competitive probability of the most-competitive word candidate and displays the one or more competitive words adjacent to the most-competitive word candidate. Competitive word selection means selects an appropriate correction word from the one or more competitive words. Word replacement commanding means causes one of the most-competitive word candidate to be replaced with the correction word selected by the competitive word selection means.

Voice Recognition Device and Method, and Program

Voice Recognition Device and Method, and Program

Voice Recognition Device and Method, and Program

Owner:NAT INST OF ADVANCED IND SCI & TECH

Method for correcting a speech response and natural language dialogue system

ActiveUS20140188477A1Easy to useSpeech recognitionSpeech inputSpeech sound

A natural language dialogue system and a method capable of correcting a speech response are provided. The method includes following steps. A first speech input is received. At least one keyword included in the first speech input is parsed to obtain a candidate list having at least one report answers. One of the report answers is selected from the candidate list as a first report answer, and a first speech response is output according to the first report answer. A second speech input is received and parsed to determine whether the first report answer is correct. If the first report answer is incorrect, another report answer other than the first report answer is selected from the candidate list as a second report answer. According to the second report answer, a second speech response is output.

Method for correcting a speech response and natural language dialogue system

Method for correcting a speech response and natural language dialogue system

Method for correcting a speech response and natural language dialogue system

Owner:VIA TECH INC

Popular searches

Voice user interface Set top box Cable television Multiple device User intent Microphone Media content Ambiguity Graphics User interface