Patents
Literature
Hiro is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Hiro

2781 results about "Natural language processing" patented technology

Natural language processing (NLP) is a subfield of linguistics, computer science, information engineering, and artificial intelligence concerned with the interactions between computers and human (natural) languages, in particular how to program computers to process and analyze large amounts of natural language data.

Method and apparatus for textual exploration discovery

The present invention relates to a method and system for textual exploration and discovery. More specifically, the method and system provide a text-driven and grammar based tool for textual exploration and textual navigation. The facilities for textual exploration and textual navigation are based on a system of index entries that are connected to the underlying text segments from which the index entries are derived. Text units with particular grammatical, semantic, and / or pragmatic features constitute bundles of sentences or text zones.
Owner:BRIT HELLE AARSKOG +1

Virtual Keyboard Systems with Automatic Correction

There is disclosed an enhanced text entry system which uses word-level analysis to correct inaccuracies automatically in user keystroke entries on reduced-size or virtual keyboards. A method and system are defined which determine one or more alternate textual interpretations of each sequence of inputs detected within a designated auto-correcting region. The actual interaction locations for the keystrokes may occur outside the boundaries of the specific keyboard key regions associated with the actual characters of the word interpretations proposed or offered for selection, where the distance from each interaction location to each corresponding intended character may in general increase with the expected frequency of the intended word in the language or in a particular context. Likewise, in a virtual keyboard system, the keys actuated may differ from the keys actually associated with the letters of the word interpretations. Each such sequence corresponds to a complete word, and the user can easily select the intended word from among the generated interpretations. Additionally, when the system cannot identify a sufficient number of likely word interpretation candidates of the same length as the input sequence, candidates are identified whose initial letters correspond to a likely interpretation of the input sequence.
Owner:TEGIC COMM

Method and system for interactive conversational dialogue for cognitively overloaded device users

A system and method to interactively converse with a cognitively overloaded user of a device, includes maintaining a knowledge base of information regarding the device and a domain, organizing the information in at least one of a relational manner and an ontological manner, receiving speech from the user, converting the speech into a word sequence, recognizing a partial proper name in the word sequence, identifying meaning structures from the word sequence using a model of the domain information, adjusting a boundary of the partial proper names to enhance an accuracy of the meaning structures, interpreting the meaning structures in a context of the conversation with the cognitively overloaded user using the knowledge base, selecting a content for a response to the cognitively overloaded user, generating the response based on the selected content, the context of the conversation, and grammatical rules, and synthesizing speech wave forms for the response.
Owner:ROBERT BOSCH CORP +1

Linguistic user interface

A system for retrieval of text includes a processor which identifies grammar rules associated with text fragments of a text string that is retrieved from an associated storage medium, and retrieves text strings from the storage medium which satisfy the grammar rules. A display displays retrieved text strings. A user input device in communication with the processor enables a user to select text fragments of the displayed text strings for generating a query. The processor identifies grammar rules associated with the user-selected text fragments and retrieves text strings from the storage medium which satisfy the grammar rules.
Owner:MAJANDRO LLC

Method and system for generating spelling suggestions

A computer implemented method of suggesting replacement words for words of a string. In the method, an input string of input words is received. The input words are then matched to subject words of a candidate table. Next, candidate replacement words and scores from the candidate table corresponding to the matched subject words are extracted. Each score is indicative of a probability that the input word should be replaced with the corresponding candidate replacement word. Finally, replacement of the input words with their corresponding candidate replacement words is selectively suggested based on the scores for the replacement words. Another aspect of the present invention is directed to a spell checking system that is configured to implement the method.
Owner:MICROSOFT TECH LICENSING LLC

Knowledge-based flexible natural speech dialogue system

A knowledge-based natural speech dialogue system includes: (i) a knowledge support system, (ii) a flexible dialogue control system, and (iii) a context information system. Flexibilities of the conversation structure, inherent in mixed-initiative mode for dealing with complex user request, are managed because the knowledge structures involved are represented by additional, powerful knowledge representation tools, and because the context information is retained by more specific data structures, which covers larger temporal scopes by the logic of the conversation, rather than by a fixed locality of the grammar flow. This system provides a simple yet reliable method to compensate for these factors to enable more powerful conversation engines with mixed-initiative capabilities.
Owner:VOICE ENABLING SYST TECH

Modifying electronic documents with recognized content or other associated data

Systems and methods enhance editing capabilities associated with a wide variety of different types of electronic documents. Such systems and methods may include a processor that maintains an electronic document having a first portion (e.g., an individual word, character, character string, or the like) provided by a recognizer (e.g., by handwriting or speech recognition software), and they may provide access to potential alternative characters, words, or character strings generated by the recognizer during various user editing functions and operations. Other data associated with the first portion of the document also may be stored and made available to the user during various functions and operations. This invention further relates to computer-readable media including instructions for performing various methods and / or operating various systems for editing electronic documents, including systems and methods like those described above.
Owner:MICROSOFT TECH LICENSING LLC

System, Method, Apparatus and Computer Program Product for Providing Dynamic Vocabulary Prediction for Speech Recognition

InactiveUS20080154600A1Improve speech processingAccurate word recognitionSpeech recognitionNatural language processingConfidence measures
An apparatus for providing dynamic vocabulary prediction for setting up a speech recognition network of resource constrained portable devices may include a recognition network element. The recognition network element may be configured to determine a confidence measure for each candidate recognized word for a current word to be recognized. The recognition network element may also be configured to select a subset of candidate recognized words as selected candidate words based on the confidence measure of each one of the candidate recognized words, and determine a recognition network for a next word to be recognized, the recognition network including likely follower words for each of the selected candidate words, e.g. using language model and highly frequently used words.
Owner:NOKIA CORP

Systems and methods for text to speech synthesis

Algorithms for synthesizing speech used to identify media assets are provided. Speech may be selectively synthesized form text strings associated with media assets. A text string may be normalized and its native language determined for obtaining a target phoneme for providing human-sounding speech in a language (e.g., dialect or accent) that is familiar to a user. The algorithms may be implemented on a system including several dedicated render engines. The system may be part of a back end coupled to a front end including storage for media assets and associated synthesized speech, and a request processor for receiving and processing requests that result in providing the synthesized speech. The front end may communicate media assets and associated synthesized speech content over a network to host devices coupled to portable electronic devices on which the media assets and synthesized speech are played back.
Owner:APPLE INC

Method and system for building and contracting a linguistic dictionary

A method for building and contracting a linguistic dictionary, the linguistic dictionary comprising a list of surface forms and a list of normalized forms, each normalized form being associated with a surface form, the method comprising the steps of: comparing each character of a surface form with each character of the surface form's normalized form; in response to the comparing step, determining an edit operation for each character compared; and generating a transform code from the set of the edit operations in order to transform the surface form to its normalized form.
Owner:IBM CORP

Supporting web-query expansion efficiently using multi-granularity indexing and query processing

A method and apparatus for efficient query expansion using reduced size indices and for progressive query processing. Queries are expanded conceptually, using semantically similar and syntactically related words to those specified by the user in the query to reduce the chances of missing relevant documents. The notion of a multi-granularity information and processing structure is used to support efficient query expansion, which involves an indexing phase, a query processing and a ranking phase. In the indexing phase, semantically similar words are grouped into a concept which results in a substantial index size reduction due to the coarser granularity of semantic concepts. During query processing, the words in a query are mapped into their corresponding semantic concepts and syntactic extensions, resulting in a logical expansion of the original query. Additionally, the processing overhead is avoided. The initial query words can then be used to rank the documents in the answer set on the basis of exact, semantic and syntactic matches and also to perform progressive query processing.
Owner:NEC CORP

Systems and methods for speech preprocessing in text to speech synthesis

Algorithms for synthesizing speech used to identify media assets are provided. Speech may be selectively synthesized form text strings associated with media assets. A text string may be normalized and its native language determined for obtaining a target phoneme for providing human-sounding speech in a language (e.g., dialect or accent) that is familiar to a user. The algorithms may be implemented on a system including several dedicated render engines. The system may be part of a back end coupled to a front end including storage for media assets and associated synthesized speech, and a request processor for receiving and processing requests that result in providing the synthesized speech. The front end may communicate media assets and associated synthesized speech content over a network to host devices coupled to portable electronic devices on which the media assets and synthesized speech are played back.
Owner:APPLE INC

Multilingual translation memory, translation method, and translation program

A translation memory adapted to sentence-based translation includes a first retrieval dictionary of a first language having a first dictionary TRIE structure and a first index TRIE structure, and a second retrieval dictionary of a second language having a second dictionary TRIE structure and a second index TRIE structure. The first and second dictionary TRIE structures respectively store sub character strings on respective paths from roots to multiple nodes thereof. The first and second index TRIE structure respectively store index information to specify the sub character strings stored in the first and second dictionary TRIE structures. The index information stored in the first retrieval dictionary includes identification information that identifies the index information to specify the sub character strings in the second retrieval dictionary that are translations of those in the first retrieval dictionary.
Owner:FUJIFILM BUSINESS INNOVATION CORP

Text pre-processing for text-to-speech generation

A system and method are provided for improved speech synthesis, wherein text data is pre-processed according to updated grammar rules or a selected group of grammar rules. In one embodiment, the TTS system comprises a first memory adapted to store a text information database, a second memory adapted to store grammar rules, and a receiver adapted to receive update data regarding the grammar rules. The system also includes a TTS engine adapted to retrieve at least one text entry from the text information database, pre-process the at least one text entry by applying the updated grammar rules to the at least one text entry, and generate speech based at least in part on the least one pre-processed text entry.
Owner:HONDA MOTOR CO LTD

Method and apparatus for correction of spelling errors in text composition

Method and apparatus for correction of spelling errors in text composition is provided. Correctly spelled words may be determined for an incorrectly spelled word in accordance with a common error approach. Corrections to the incorrectly word are applied using common typographical and spelling errors. Resulting words which are correctly spelled may be offered as replacements or automatically used to replace the incorrectly word. Corrections may be applied in response to a frequency of the error in text composition to generate correct words that are more likely to be the word intended to be composed. The specific order of application may be configurable to meet a user's preference or a learned behavior for a user.
Owner:BLACKBERRY LTD

Natural input recognition tool

A recognition tool according to various examples of the invention intelligently recognizes natural input before it is passed to a destination or target application. More particularly, the recognition tool according to various examples of the invention provides better formatting for text recognized from natural input, based upon the context in which the text is being inserted into a target application. The recognition tool also provides various tools for correcting inaccurately recognized text. The recognition tool may allow a user to select only a part of an inaccurate text, and then identify alternate text candidates based upon the selected portion of the inaccurate text. Further, when the user selects text containing multiple words for correction, the recognition tool provides cross combinations of alternate text candidates for the user's selection. Still further, if the user replaces inaccurate text by submitting a new natural input object, the recognition tool ensures that the text recognized from the new natural input object is different from the inaccurate text been replaced. The recognition tool additionally affects the recognition experience after recognized text has been provided to the target application. The recognition tool provides the target application with the original natural input object for the recognized text, along with the alternate text candidates for that original natural input object. Thus, the target application can use the alternate text candidates to correct inaccurately recognized text. Further, a user can insert the original natural input object for recognized text within the target application.
Owner:MICROSOFT TECH LICENSING LLC

Identifying language attributes through probabilistic analysis

A system and method for identifying language attributes through probabilistic analysis is described. A set of language classes and a plurality of training documents are defined, Each language class identifies a language and a character set encoding. Occurrences of one or more document properties within each training document are evaluated. For each language class, a probability for the document properties set conditioned on the occurrence of the language class is calculated. Byte occurrences within each training document are evaluated. For each language class, a probability for the byte occurrences conditioned on the occurrence of the language class is calculated.
Owner:GOOGLE LLC

Machine translation techniques

Machine translation decoding is accomplished by receiving as input a text segment in a source language to be translated into a target language, generating an initial translation as a current target language translation, applying one or more modification operators to the current target language translation to generate one or more modified target language translations, determining whether one or more of the modified target language translations represents an improved translation in comparison with the current target language translation, setting a modified target language translation as the current target language translation, and repeating these steps until occurrence of a termination condition. Automatically generating a tree (e.g., either a syntactic tree or a discourse tree) can be accomplished by receiving as input a tree corresponding to a source language text segment, and applying one or more decision rules to the received input to generate a tree corresponding to a target language text segment.
Owner:UNIV OF SOUTHERN CALIFORNIA

Method for Spell-Checking Location-Bound Words Within a Document

The present invention uses spatial dictionaries (termed locationaries), for meaningfully checking and correcting the spelling of text on Geographic Information System (GIS) maps or other computer documents containing spatially, or coordinate-, bound text. Locationaries keep track of correctly spelled words and the geographic areas within which they are considered correctly spelled. In a preferred embodiment, locationaries are feature classes, ‘map layers’ of properly-spelled words or attributed features (objects). The invention selects such features from locationaries based on spatial criteria relative to the textual objects to spell-check. It then exports the attribute values to dictionaries that can be used by conventional spell checking engines. Once all errors and suggestions are found with such an engine and within user-specified criteria, a dialog is shown that contains a row per potentially-misspelled word found. From that dialog, the user can edit the word in the original textual object of the map.
Owner:EDGETECH AMERICA

Speech Recognition Dialog Management

InactiveUS20090018829A1Increase flexibilityEasy and faster and more productive to interactSpeech recognitionNatural language processingDialog management
Described is a speech recognition dialog management system that allows more open-ended conversations between virtual agents and people than are possible using just agent-directed dialogs. The system uses both novel dialog context switching and learning algorithms based on spoken interactions with people. The context switching is performed through processing multiple dialog goals in a last-in-first-out (LIFO) pattern. The recognition accuracy for these new flexible conversations is improved through automated learning from processing errors and addition of new grammars.
Owner:METAPHOR SOLUTIONS

Systems and methods for spell correction of non-roman characters and words

Systems and methods to process and correct spelling errors for non-Roman based words such as in Chinese, Japanese, and Korean languages using a rule-based classifier and a hidden Markov model are disclosed. The method generally includes converting an input entry in a first language such as Chinese to at least one intermediate entry in an intermediate representation, such as pinyin, different from the first language, converting the intermediate entry to at least one possible alternative spelling or form of the input in the first language, and determining that the input entry is either a correct or questionable input entry when a match between the input entry and all possible alternative spellings to the input entry is or is not located, respectively. The questionable input entry may be classified using, for example, a transformation rule based classifier based on transformation rules generated by a transformation rules generator.
Owner:GOOGLE LLC

Speech recognition semantic classification training

An automated method is described for developing an automated speech input semantic classification system such as a call routing system. A set of semantic classifications is defined for classification of input speech utterances, where each semantic classification represents a specific semantic classification of the speech input. The semantic classification system is trained from training data having little or no in-domain manually transcribed training data, and then operated to assign input speech utterances to the defined semantic classifications. Adaptation training data based on input speech utterances is collected with manually assigned semantic labels. When the adaptation training data satisfies a pre-determined adaptation criteria, the semantic classification system is automatically retrained based on the adaptation training data.
Owner:NUANCE COMM INC

Method and apparatus for detecting incorrectly translated text in a document

A system that detects incorrectly translated text in a document. During operation, the system determines a target language for the document. The system then identifies one or more terms in the document that are invalid terms for the language. Next, the system analyzes correlations between these invalid terms, and, based on these correlations, determines whether one or more of the invalid terms comprise incorrectly translated text.
Owner:GOOGLE LLC

Hybrid Speech Synthesizer, Method and Use

Disclosed are novel embodiments of a speech synthesizer and speech synthesis method for generating human-like speech wherein a speech signal can be generated by concatenation from phonemes stored in a phoneme database. Wavelet transforms and interpolation between frames can be employed to effect smooth morphological fusion of adjacent phonemes in the output signal. The phonemes may have one prosody or set of prosody characteristics and one or more alternative prosodies may be created by applying prosody modification parameters to the phonemes from a differential prosody database. Preferred embodiments can provide fast, resource-efficient speech synthesis with an appealing musical or rhythmic output in a desired prosody style such as reportorial or human interest. The invention includes computer-determining a suitable prosody to apply to a portion of the text by reference to the determined semantic meaning of another portion of the text and applying the detennined prosody to the text by modification of the digitized phonemes. In this manner, prosodization can effectively be automated.
Owner:LESSAC TECH INC

Method and System for Building and Contracting a Linguistic Dictionary

A method for building and contracting a linguistic dictionary, the linguistic dictionary comprising a list of surface forms and a list of normalized forms, each normalized form being associated with a surface form, the method comprising the steps of: comparing each character of a surface form with each character of the surface form's normalized form; in response to the comparing step, determining an edit operation for each character compared; and generating a transform code from the set of the edit operations in order to transform the surface form to its normalized form.
Owner:IBM CORP

Phonetic-based text input method

A method for transliterating languages in a computer system is disclosed. The method includes using a phonetic mapping engine to map languages to phonetic strings and vice versa. A user unfamiliar with a keyboard layout for a particular language may type phonetic strings using a known keyboard layout of a language known to the user. The typed input is captured by a keyboard hook, and based on a predefined phonetic mapping scheme, can be converted to the desired language.
Owner:MICROSOFT TECH LICENSING LLC

Incremental ideographic character input method

A method for incremental recognition of ideographic handwriting comprises in order the steps of: (1) entering in a natural stroke order at least one stroke of an ideographic character from a computer entry tablet; (2) providing the at least one stroke to an incremental character recognizer, which produces a hypothesis list of at least one candidate character; (3) displaying a hypothesis list of candidate characters containing the at least one stroke; (4) selecting a correct character from among the candidate characters on the hypothesis list if it a correct character appears thereon; (5) entering in natural stroke order at least one additional stroke of the ideographic character from the computer entry tablet if no candidate character is a correct character; (6) providing the additional stroke(s) to the incremental character recognizer, which produces an updated hypothesis list; (7) displaying the updated hypothesis list of candidate characters containing every stroke; (8) selecting a correct character from among the candidate characters on the updated hypothesis list if it a correct character appears thereon; and (9) repeating steps (5) through (8) until a correct character is selected from the updated hypothesis list.
Owner:SYNAPTICS INC

Ultralink text analysis tool

The invention is a computer implemented method that allows a user to search a text for specific words (e.g. words found in a user-defined library), visually identify any such words found in the text, and have at their fingertips a utility for further exploration of each identified word. The utility, sometimes herein called an ultralink utility, allows a wide variety of options for further exploration of the identified word. The user thereby saves time in explorations; in addition the utility can assist in directing the user to inquiries which may be important for the user.
Owner:CRONENBERGER DANIEL +7
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products