Patents
Literature
Hiro is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Hiro

95 results about "Lexical database" patented technology

A lexical database is a lexical resource which has an associated software environment database which permits access to its contents. The database may be custom-designed for the lexical information or a general-purpose database into which lexical information has been entered.

Method and apparatus for improving the transcription accuracy of speech recognition software

A virtual vocabulary database is provided for use with a with a particular user database as part of a speech recognition system. Vocabulary elements within the virtual database are imported from the user database and are tagged to include numerical data corresponding to the historical use of the vocabulary element within the user database. For each speech input, potential vocabulary element matches from the speech recognition system are provided to the virtual database software which creates virtual sub-vocabularies from the criteria according to predefined criteria templates. The software then applies vocabulary element weighting adjustments according to the virtual sub-vocabulary weightings and applies the adjustment to the default weighting provided by the speech recognition system. The modified weightings are returned with the associated vocabulary elements to the speech engine for selection of an appropriate match to the input speech.
Owner:COIFMAN ROBERT E

Natural language processor

A computer program product for controlling the computer's processor to perform responsive actions a natural language input has: (1) vocabulary, phrase and concept databases of words, phrase and concepts, respectively, that can be recognized in the inputted communication, wherein each of these database elements is representable by a designated semantic symbol, (2) means for searching the inputted communication to identify the words in the communication that are contained within the vocabulary database, (3) means for expressing the communication in terms of the word semantic symbols that correspond to each of the words identified in the inputted communication, (4) means for searching the communication when expressed in terms of its corresponding word semantic symbols so as to identify the phrases in the communication that are contained within the phrase database, (5) means for expressing the communication in terms of the phrase semantic symbols that correspond to each of the phrases identified in the communication, (6) means for searching the communication when expressed in terms of its corresponding phrase semantic symbols so as to identify the concepts in the communication that are contained within the concept database, and (7) means for expressing the communication in terms of the concept semantic symbols that correspond to each of the concepts identified in the inputted communication, wherein these concept semantic symbols are recognizable by the processor and can cause the processor to take action responsive to the inputted communication.
Owner:SONUM TECH

Method and apparatus for improving the transcription accuracy of speech recognition software

The present invention involves the dynamic loading and unloading of relatively small text-string vocabularies within a speech recognition system. In one embodiment, sub-databases of high likelihood text strings are created and prioritized such that those text strings are made available within definable portions of computer-transcribed dictations as a first-pass vocabulary for text matches. Failing a match within the first-pass vocabulary, the voice recognition software attempts to match the speech input to text strings within a more general vocabulary. In another embodiment, the first-pass text string vocabularies are organized and prioritized and loaded in relation to specific fields within an electronic form, specific users of the system and / or other general context-based, interrelationships of the data that provide a higher probability of text string matches then those otherwise provided by commercially available speech recognition systems and their general vocabulary databases.
Owner:COIFMAN ROBERT E +1

Method and apparatus for performing speech recognition utilizing a supplementary lexicon of frequently used orthographies

The invention relates to a method and an apparatus for recognising speech, more particularly to a speech recognition system and method utilising a speech recognition dictionary supplemented by a lexicon containing frequently occurring word sequences (orthographies). In typical speech recognition systems, the process of speech recognition consists of scanning the vocabulary database or dictionary by using a fast match algorithm to find the top N candidates that potentially match the input speech. In a second pass the N candidates are re-scored using more precise likelihood computations. The novel method comprises the introduction of a step in the search stage that consists of forcing the insertion in the list of N candidates entries selected from a lexicon containing frequently used orthographies to increase the probability of occurrence of certain text combinations.
Owner:RPX CLEARINGHOUSE

Method and system for naming a cluster of words and phrases

The present invention provides a method, system and computer program for naming a cluster, or a hierarchy of clusters, of words and phrases that have been extracted from a set of documents. The invention takes these clusters as the input and generates appropriate labels for the clusters using a lexical database. Naming involves first finding out all possible word senses for all the words in the cluster, using the lexical database; and then augmenting each word sense with words that are semantically similar to that word sense to form respective definition vectors. Thereafter, word sense disambiguation is done to find out the most relevant sense for each word. Definition vectors are clustered into groups. Each group represents a concept. These concepts are thereafter ranked based on their support. Finally, a pre-specified number of words and phrases from the definition vectors of the dominant concepts are selected as labels, based on their generality in the lexical database. Therefore, the labels may not necessarily consist of the original words in the cluster. A hierarchy of clusters is named in a recursive fashion starting from leaf clusters. Dominant concepts in child clusters are propagated into their parent to reduce the labeling complexity of parent clusters.
Owner:MICRO FOCUS LLC

Text similarity, acceptation similarity calculating method and system and application system

The invention discloses a calculating method of text similarity degree and vocabulary meaning similarity degree and system and application system, which comprises the following steps: basing on vocabulary data bank; proceeding initialize; calculating; getting initial vocabulary meaning similarity degree among vocabulary in the vocabulary data bank; basing on the initial vocabulary meaning similarity degree; calculating initial semantic similarity degree among text; iterating semantic similarity degree among each text and vocabulary meaning similarity degree among vocabulary till constriction; constructuring final vocabulary meaning similar matrix with final vocabulary similarity degree; transforming the text vocabulary frequency vector of the initial text to the new text vocabulary text vocabulary frequency vector; calculating text similarity degree in the text collection. This invention can improve related property of current text especially about short text.
Owner:蒙圣光 +1

Method and system for naming a cluster of words and phrases

The present invention provides a method, system and computer program for naming a cluster, or a hierarchy of clusters, of words and phrases that have been extracted from a set of documents. The invention takes these clusters as the input and generates appropriate labels for the clusters using a lexical database. Naming involves first finding out all possible word senses for all the words in the cluster, using the lexical database; and then augmenting each word sense with words that are semantically similar to that word sense to form respective definition vectors. Thereafter, word sense disambiguation is done to find out the most relevant sense for each word. Definition vectors are clustered into groups. Each group represents a concept. These concepts are thereafter ranked based on their support. Finally, a pre-specified number of words and phrases from the definition vectors of the dominant concepts are selected as labels, based on their generality in the lexical database. Therefore, the labels may not necessarily consist of the original words in the cluster. A hierarchy of clusters is named in a recursive fashion starting from leaf clusters. Dominant concepts in child clusters are propagated into their parent to reduce the labeling complexity of parent clusters.
Owner:MICRO FOCUS LLC

Multiple layer information object repository

Techniques for relating data stored in one or more storage systems for an enterprise include managing information chunks in one or more storage systems. Each chunk comprises a unit of data for storage and retrieval operations. The techniques also include managing a vocabulary database. The vocabulary database includes data structures describing atomic concepts among names in an enterprise-specific vocabulary and data structures describing relationships among the atomic concepts. Content in a document is arranged based at least in part on data in the vocabulary database. The content is based at least in part on an information object or “chunk” in the storage system. Thus, content originally unrelated and authored over time by many different persons and organizations can be related using the business vocabulary concepts and relationships in the vocabulary database.
Owner:CISCO TECH INC

Multi-stage pattern reduction for natural language processing

A computer program product for controlling the computer's processor to perform responsive actions a natural language input has: (1) vocabulary, phrase and concept databases of words, phrase and concepts, respectively, that can be recognized in the inputted communication, wherein each of these database elements is representable by a designated semantic symbol, (2) means for searching the inputted communication to identify the words in the communication that are contained within the vocabulary database, (3) means for expressing the communication in terms of the word semantic symbols that correspond to each of the words identified in the inputted communication, (4) means for searching the communication when expressed in terms of its corresponding word semantic symbols so as to identify the phrases in the communication that are contained within the phrase database, (5) means for expressing the communication in terms of the phrase semantic symbols that correspond to each of the phrases identified in the communication, (6) means for searching the communication when expressed in terms of its corresponding phrase semantic symbols so as to identify the concepts in the communication that are contained within the concept database, and (7) means for expressing the communication in terms of the concept semantic symbols that correspond to each of the concepts identified in the inputted communication, wherein these concept semantic symbols are recognizable by the processor and can cause the processor to take action responsive to the inputted communication.
Owner:SONUM TECH

Multi-platform visual pronunciation dictionary

The multi-platform visual pronunciation dictionary is capable of cross-referencing words and phrases between a user's native language and a foreign language by presenting to the user a correct translation and pronunciation in a recorded video presentation by a native speaker of the foreign language. Monolinguistic cross-referencing may also be provided. The dictionary provides a user interface and lexical database designed to enable the learner to visualize and hear the target language. An electronic dictionary is provided and includes an interface with a visual display capable of playing high quality recordings showing a model speaker's face speaking the lexical item. The visual pronunciation dictionary has a plurality of high-quality synchronized video and sound recordings of a plurality of lexical items in a language spoken by a native speaker that is stored in a database and accessible by a user interface device. A dedicated SD-video-capable electronic dictionary may also be provided.
Owner:ANNAZ FAWAZ Y +1

Method and apparatus for improving the transcription accuracy of speech recognition software

A virtual vocabulary database is provided for use with a with a particular user database as part of a speech recognition system. Vocabulary elements within the virtual database are imported from the user database and are tagged to include numerical data corresponding to the historical use of the vocabulary element within the user database. For each speech input, potential vocabulary element matches from the speech recognition system are provided to the virtual database software which creates virtual sub-vocabularies from the criteria according to predefined criteria templates. The software then applies vocabulary element weighting adjustments according to the virtual sub-vocabulary weightings and applies the adjustment to the default weighting provided by the speech recognition system. The modified weightings are returned with the associated vocabulary elements to the speech engine for selection of an appropriate match to the input speech.
Owner:COIFMAN ROBERT E

Method and apparatus for improving the transcription accuracy of speech recognition software

A virtual vocabulary database is provided for use with a with a particular user database as part of a speech recognition system. Vocabulary elements within the virtual database are imported from the user database and are tagged to include numerical data corresponding to the historical use of the vocabulary element within the user database. For each speech input, potential vocabulary element matches from the speech recognition system are provided to the virtual database software which creates virtual sub-vocabularies from the criteria according to predefined criteria templates. The software then applies vocabulary element weighting adjustments according to the virtual sub-vocabulary weightings and applies the adjustment to the default weighting provided by the speech recognition system. The modified weightings are returned with the associated vocabulary elements to the speech engine for selection of an appropriate match to the input speech.
Owner:COIFMAN ROBERT E

Computer assisted hand language communication method under special session context

The invention discloses a computer assisted hand language communication method under special session context, which comprises the following steps: setting motion freedom range of each joint of a bone model through building the virtual bone model and special session context vocabulary database, and building the communication response corresponding relation of each joint motion of a virtual human and special vocabulary; identifying hand language information by picture processing and biological specificity through acquiring the hand language information sent by a dumb person to obtain the meaning of the acquired hand language information, converting the hand language sent by the dumb person into characters or language for informing an ordinary person, otherwise, automatically matching the corresponding answer speech after obtaining the character information input by the ordinary person or obtaining the meaning of the hand language sent by the identified dumb person, and processing the character information or answer speech and informing the dumb person by the hand language conversion, and realizing the session communication between the ordinary person and the dumb person under special occasion. The method can realize the communication between the ordinary person and the dumb person under special occasion.
Owner:XIAN UNIV OF TECH

Speech recognition for recognizing speaker-independent, continuous speech

A speech recognition method and apparatus are provided for converting a voice stream into a digital voice stream representation. A method for performing speech recognition on a voice stream according to a first method embodiment includes the steps of determining one or more candidate transnemes in the voice stream, mapping the one or more candidate transnemes to a transneme table to convert the one or more candidate transnemes to one or more found transnemes, and mapping the one or more found transnemes to a transneme-to-vocabulary database to convert the one or more found transnemes to one or more speech units.
Owner:NURV CENT TECH

System for generating data from social media messages for the real-time evaluation of publicly traded assets

A system for generating data from social media messages for the real-time evaluation of publicly traded assets includes an ingest component for ingesting the social media messages and a filter module eliminating expressions not considered useful language from social media messages and configuring input social media message into useful formats to form filtered social media messages. The system also includes a language processor processing the filtered social media messages based upon lexical databases to form filter and processed social media messages. The system further includes a sentiment calculator applying rules to the filtered and processed social media messages so as to compute a representation of sentiment values associated with the social media messages. A graphical user interface displaying the sentiment values is also provided.
Owner:KUBERA LOGIC LLC

Vocabulary generation system

Increasingly, conversational systems are used in coaching or supportive contexts, either in an embodied form (e.g., as an avatar in an app or website) or just in a speech-driven for (e.g. Siri). There is a need to keep such systems interesting and appealing over time in order to prevent the user from reducing use of the system or abandoning the system all together. The present system is configured to learn new expressions from user utterances and use them based on their predicted utility during interactions with the user. The present system includes components configured for learning new vocabulary and selecting vocabulary for generating new utterances from the system. This way, the system continually expands its vocabulary database with expressions familiar to and / or used by the user and will be able to engage the user with new utterances so that the user does not lose interest in the system.
Owner:KONINKLJIJKE PHILIPS NV

Voice signal repairing method and mobile terminal

The invention provides a voice signal repairing method and a mobile terminal, relating to the technical field of mobile terminals. The method comprises the following steps: when detecting that a received original voice signal has interruption, converting a continuous part of the original voice signal into reference characters; according to the reference characters, determining missing characters corresponding to a missing part of the original voice signal from a stored vocabulary database; converting the missing characters into a compensation voice signal; and inserting the compensation voice signal into the position of the missing part of the original voice signal, and playing the original voice signal inserted into the compensation voice signal. Therefore, the problem that after a voice signal is repaired through an existing method, the semantics expressed by the voice signal is still incomplete is solved, and thus the conversation quality is improved.
Owner:VIVO MOBILE COMM CO LTD

Speech recognition optimizing system aiming at locale language use preference and method thereof

InactiveCN101329868AImprove recognition rateSpeech recognition optimization and precisionSpeech recognitionThe InternetSpeech identification
The invention provides a speech recognition optimization system aiming at the using preference of regional languages, which comprises a lexicon establishment and classification module, a grammar model initialization module, a lexical database, a grammar weight calculation and grammar model generation module, a lexicon application recording module and a telephone speech recognition system. The lexicon establishment and classification module, the grammar model initialization module, the lexical database, the grammar weight calculation and grammar model generation module and the lexicon application recording module are arranged in a computer; the lexicon establishment and classification module, the grammar model initialization module, the telephone speech recognition system, the lexicon application recording module, the lexical database, the grammar weight calculation and grammar model generation module are connected in sequence and the telephone speech recognition system is connected with a speech input and output device through PSTN or the Internet. The system of the invention can effectively break through the bottle neck of algorithm optimization and is more suitable for the requirements of the application of industry.
Owner:林超

Method and apparatus for constructing new chinese words by voice input

A method and apparatus for constructing new Chinese words by voice input is disclosed. The invention provides a method of adding new words to a speech recognition system, for example, a speaker-independent Chinese speech recognition system, for updating its vocabulary database. In the invention, voice signals indicating a description of Chinese characters / syllables are input sequentially, and feature parameters are derived from the voice signals. The feature parameters are compared with a description constraint unit to determine corresponding characters or syllables. The characters or syllables are stored in a storage unit. After confirmation by users, the characters or syllables are combined into a new word.
Owner:DELTA ELECTRONICS INC

Question and answer database expansion apparatus and question and answer database expansion method

A question and answer database expansion apparatus includes: a question and answer database in which questions and answers corresponding to the questions are registered in association with each other, a first speech recognition unit which carries out speech recognition for an input sound signal by using a language model based on the question and answer database, and outputs a first speech recognition result as the recognition result, a second speech recognition unit which carries out speech recognition for the input sound signal by using a language model based on a large vocabulary database, and outputs a second speech recognition result as the recognition result, and a question detection unit which detects an unregistered utterance, which is not registered in the question and answer database, from the input sound based on the first speech recognition result and the second speech recognition result, and outputs the detected unregistered utterance.
Owner:HONDA MOTOR CO LTD

Semantically weighted searching in a governed corpus of terms

A method and system for conducting semantically weighted searches in a governed corpus of terms is provided. A search expression having a plurality of terms for performing a search in the governed corpus of terms is received. The governed corpus of terms comprises a plurality of corpus expressions each comprising a plurality of terms, each term within the governed corpus of terms being associated precisely with a single concept within a lexical database. At least one concept of the lexical database is assigned to each term in the search expression based on a syntactical analysis. A semantic similarity is calculated between pairs of concepts of the search expression and one of the corpus expressions. A total semantic similarity is calculated between the search expression and the one of the corpus expressions by aggregating the semantic similarities of the pairs of concepts based on an order of significance of the terms.
Owner:SAP AG

System and method for providing an interactive voice recognition system

A framework is described for providing a service to a customer via a Interactive Voice Recognition system (IVR) using natural language expressions. The expressions are evaluated using rules-based programming rules. Evaluated expressions determine an eligibility of a business service to be offered to a customer. Interaction with the customer comprises selecting a semantically correct natural language expression from an appropriate vocabulary database.
Owner:SBC KNOWLEDGE VENTURES LP

Statistical spell checker

Methods, systems, and computer media implement a statistical spell checker for extracting suggested spell-check candidates for a query containing an unrecognized word. Vocabulary statistics are maintained, including recording a plurality of adjacent word sequences found in a document corpus. When a user query is received that contains a word not in the vocabulary database, i.e., an unrecognized word, the vocabulary statistics are consulted to find word sequences containing the same preceding word and / or succeeding word. The found word sequences may be returned in order based upon the conditional probability that given the recognized preceding and / or succeeding word(s), the unrecognized word is meant to be the suggested spell-checked word.
Owner:CIMPRESS SCHWEIZ

A vocabulary database construction method and the corresponding hunting and comparison method for voice identification system

InactiveCN101217035AWith polyphonic word recognition functionClose to pronunciation habitsSpeech recognitionSpecial data processing applicationsAcoustic modelOperand
The invention relates to a method of building a character stock in a speech recognition system and a searching and comparing method thereof to solve the problem of calculating the same character repeatedly and reducing the whole operand. The method comprises the following steps of: 1) providing the data of polyphonic characters; 2) typing in the data; 3) building an acoustic model; 4) storing the data and the corresponding acoustic model thereof into the character stock. The character stock in the invention has the function of polyphonic characters recognition, so that the speech recognition system is closer to the pronunciation habit of the average users with human elements, therefore, enabling the users to follow the conventional pronunciation and to receive the correct recognition result.
Owner:WUDI SCI & TECH (XIAN) CO LTD

Alphanumeric message composing method using telephone keypad

An interactive method for composing an alphanumeric message by a caller using a telephone keypad includes storing (215) a lexical database (135) from which unigram probabilities, forward conditional probabilities, and backward conditional probabilities for a plurality of words can be recovered; storing a received sequence of key codes (405) representing a sequence in which keys on a telephone style keypad are keyed; generating a word trellis including candidate words (415) derived from the sequence and the lexical database; determining a most likely phrase (420) from the candidate words, the unigram probabilities, forward conditional probabilities, and backward conditional probabilities; generating a most likely message (425) from the most likely phrase and presenting the most likely message to the caller; and confirming that the most likely message is the alphanumeric message (430).
Owner:GOOGLE TECH HLDG LLC

Full-text retrieval device and method for interactive electronic technical manual of shipping equipment

The present invention discloses a full-text retrieval device for an interactive electronic technical manual of shipping equipment. The full-text retrieval device comprises a common source database, a specialized vocabulary extraction module, an abbreviation extraction module, a first segmentation module, a technical information term database, an equipment part name database, an abbreviation database, a general vocabulary database, a retrieval record database, a user retrieval command communication module, a retrieval module, a second segmentation module, an index database and an index module. Element label characteristics and document content in data module documents are composited, query is carried out by utilization of specialized vocabularies, weight of the specialized vocabularies in documents and retrieval keywords is increased, so that the system can carry out query in certain semantic levels, returned retrieved results are closer to retrieval intention of users, and therefore high recall rate and accuracy of the retrieval system are ensured.
Owner:NAVAL UNIV OF ENG PLA

System and method for searching word through word meaning based on computer network

InactiveCN101488130AAchieve precise matchingAvoid search resultsSpecial data processing applicationsWeb siteSearch words
The invention provides a word meaning searching system based on a computer network and a method thereof. The system comprises a server end which is provided with a database and a client end. The database comprises a Chinese character database comprising simplified Chinese characters and complex Chinese characters, a Chinese attribute database, a lexical database and a lexical attribute database. The client end is provided with a word meaning searching condition input device comprising a lexical meaning input frame and a natural classification selection frame. The server end is provided with a searching device for word meaning searching, a working platform for the processing of database information, a publishing platform for publishing database information by a website to users, a monitoring working platform, a management platform of the publishing platform and a sequencing selection module and a screening selection module for sequencing and screening primary searching results. The method comprises the following steps: users on the client end enter the server end, display a word searching condition selection frame on a page, input searching conditions and sequence and screen the primary searching results. With the invention adopted, words under the condition of appointed meaning and natural classification can be obtained.
Owner:文小凡 +1

Succession chinese character input method, electronic product for use in succession chinese character input method

A succession Chinese character input method comprising the steps of: (a) inputting a Chinese character, and (b) fetching succession Chinese characters relative to the inputted Chinese character from a database of vocabulary of common succession Chinese characters and/or a database of vocabulary of self-edited succession Chinese characters, (c) and/or displaying fetched succession Chinese characters on a display screen, (d) selecting the desired succession Chinese character from the display screen for input. The database of vocabulary of common succession Chinese characters is prepared by collecting common succession Chinese characters subject to the characteristic that there is a successive relationship between every two concatenate Chinese characters in a Chinese sentence. The database of vocabulary of self-edited succession Chinese characters is prepared by collecting the inputted succession Chinese character which is automatically stored in the database of vocabulary of self-edited succession Chinese characters for further selection when no succession Chinese character was obtained from the database of vocabulary of common succession Chinese characters after input of one Chinese character. Whenever a Chinese character is inputted, the recently selected succession Chinese character of the same inputted Chinese character will be displayed on the first place. So as to increase the chance of inputting the Chinese characters by selecting Chinese characters from the display screen directly and to achieve a simple, convenient and fast input effect.
Owner:CHENG YU CHIH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products