Patents

Literature

Patsnap Eureka AI that helps you search prior art, draft patents, and assess FTO risks, powered by patent and scientific literature data.

495 results about "Synonym" patented technology

Filter

Efficacy Topic

Property

Owner

Technical Advancement

Application Domain

Technology Topic

Technology Field Word

Patent Country/Region

Patent Type

Patent Status

Application Year

Inventor

A synonym is a word or phrase that means exactly or nearly the same as another lexeme (word or phrase) in the same language. Words that are synonyms are said to be synonymous, and the state of being a synonym is called synonymy. For example, the words begin, start, commence, and initiate are all synonyms of one another. Words are typically synonymous in one particular sense: for example, long and extended in the context long time or extended time are synonymous, but long cannot be used in the phrase extended family. Synonyms with exactly the same meaning share a seme or denotational sememe, whereas those with inexactly similar meanings share a broader denotational or connotational sememe and thus overlap within a semantic field. The former are sometimes called cognitive synonyms and the latter, near-synonyms, plesionyms or poecilonyms.

Context vector generation and retrieval

InactiveUS7251637B1Reduce search timeRapid positioningDigital computer detailsBiological neural network modelsCo-occurrenceDocument preparation

A system and method for generating context vectors for use in storage and retrieval of documents and other information items. Context vectors represent conceptual relationships among information items by quantitative means. A neural network operates on a training corpus of records to develop relationship-based context vectors based on word proximity and co-importance using a technique of “windowed co-occurrence”. Relationships among context vectors are deterministic, so that a context vector set has one logical solution, although it may have a plurality of physical solutions. No human knowledge, thesaurus, synonym list, knowledge base, or conceptual hierarchy, is required. Summary vectors of records may be clustered to reduce searching time, by forming a tree of clustered nodes. Once the context vectors are determined, records may be retrieved using a query interface that allows a user to specify content terms, Boolean terms, and / or document feedback. The present invention further facilitates visualization of textual information by translating context vectors into visual and graphical representations. Thus, a user can explore visual representations of meaning, and can apply human visual pattern recognition skills to document searches.

Context vector generation and retrieval

Context vector generation and retrieval

Context vector generation and retrieval

Owner:FAIR ISAAC & CO INC

Determining query term synonyms within query context

ActiveUS7636714B1Improve user queryImproved synonym selectionData processing applicationsDigital data information retrievalSearch termsSynonym

A method is applied to search terms for determining synonyms or other replacement terms used in an information retrieval system. User queries are first sorted by user identity and session. For each user query, a plurality of pseudo-queries is determined, each pseudo-query derived from a user query by replacing a phrase of the user query with a token. For each phrase, at least one candidate synonym is determined. The candidate synonym is a term that was used within a user query in place of the phrase, and in the context of a pseudo-query. The strength or quality of candidate synonyms is evaluated. Validated synonyms may be either suggested to the user or automatically added to user search strings.

Determining query term synonyms within query context

Determining query term synonyms within query context

Determining query term synonyms within query context

Owner:GOOGLE LLC

Transliteration for query expansion

ActiveUS20100017382A1Better search resultPoor coverageNatural language translationWeb data indexingTransliterationTheoretical computer science

Methods, systems, and apparatus, including computer program products, for identifying candidate synonyms of transliterated terms for query expansion. In one aspect, a method includes identifying multiple transliterated terms in a target language. For each transliterated term of the multiple transliterated terms in the target language, the transliterated term is mapped to one or more terms in a source language. For a first transliterated term of the multiple transliterated terms in the target language, one or more second transliterated terms of the multiple transliterated terms in the target language are identified as candidate synonyms of the first transliterated term, where each of the one or more second transliterated terms is mapped to at least one term in the source language that is also mapped from the first transliterated term.

Transliteration for query expansion

Transliteration for query expansion

Transliteration for query expansion

Owner:GOOGLE LLC

Term synonym acquisition method and term synonym acquisition apparatus

InactiveUS20150006157A1Reduce the impactImprove accuracyNatural language translationDigital data information retrievalSynonymAuxiliary system

A term synonym acquisition apparatus includes: a first generating unit which generates a context vector of an input term in an original language and a context vector of each synonym candidate in the original language; a second generating unit which generates a context vector of an auxiliary term in an auxiliary language that is different from the original language, where the auxiliary term specifies a sense of the input term; a combining unit which generates a combined context vector based on the context vector of the input term and the context vector of the auxiliary term; and a ranking unit which compares the combined context vector with the context vector of each synonym candidate to generate ranked synonym candidates in the original language.

Term synonym acquisition method and term synonym acquisition apparatus

Term synonym acquisition method and term synonym acquisition apparatus

Term synonym acquisition method and term synonym acquisition apparatus

Owner:NEC CORP

Operating system and method of operating

ActiveUS20130103405A1Improve convenienceImprove accuracySemantic analysisRoad vehicles traffic controlProgramming languageOperational system

An operation determination processing section of a center extracts words included in the utterance of a driver and an operator, reads an attribute associated with each word from a synonym and related word in which an attribute is stored so as to be associated with each word, reads a domain of a candidate or the like for the task associated with the attribute from the synonym and related word in which domains of a candidate for a task associated with the read attribute or domains of a task to be actually performed are stored, totals the domains read for each word for words included in the utterance of the driver or the like, and estimates those related to a domain with a highest total score as the candidate for the task and the task to be actually performed. In this manner, it is possible to estimate the task with high accuracy.

Operating system and method of operating

Operating system and method of operating

Operating system and method of operating

Owner:TOYOTA JIDOSHA KK

Internet searching using semantic disambiguation and expansion

InactiveUS20050080776A1Data processing applicationsWeb data indexingInternet searchingRelevant information

The invention provides a system and a method of searching for information in a database using a query. In the method, it comprises the steps of: disambiguating the query to identify keyword senses associated with the query; disambiguating information in the database according to the keyword senses; indexing the information in the database according to the keyword senses; expanding the keyword senses to include relevant semantic synonyms for the keyword senses to create a list of expanded keyword senses; searching the database to find relevant information for the query using the expanded keyword senses; and providing search results of the included information containing the keyword senses and other semantically related words senses. The system comprises modules which disambiguate queries and information and indexes the information in a database of word senses.

Internet searching using semantic disambiguation and expansion

Internet searching using semantic disambiguation and expansion

Internet searching using semantic disambiguation and expansion

Owner:IDILIA

Technique for relationship discovery in schemas using semantic name indexing

InactiveUS20060253476A1Digital data processing detailsText processingSemantic matchingData mining

Techniques are provided for semantic matching. A semantic index is created for one or more schemas, wherein each of the one or more schemas includes one or more word attributes, and wherein each of the one or more word attributes includes one or more tokens, wherein the semantic index identifies one or more keys and one or more values for each key, wherein each value specifies one of the one or more schemas, a word attribute from the specified schema, and a token of the specified word attribute, and wherein the specified token is a synonym of the key. For a source word attribute from one of the one or more schemas, the source word attribute is used as a key to index the semantic index to identify one or more matching word attributes.

Technique for relationship discovery in schemas using semantic name indexing

Technique for relationship discovery in schemas using semantic name indexing

Technique for relationship discovery in schemas using semantic name indexing

Owner:IBM CORP

Method and apparatus for identifying documents relevant to a search query in a medical information resource

InactiveUS20070088695A1Improve accuracyMultiplier can be highDigital data information retrievalSpecial data processing applicationsInformation resourceComputerized system

A computerized system and method for providing information for use in medical care. Documents in a medical information resource may have several associated sections, such as title, headings, text, keyword and document type sections. Display of search results resulting from a user's query may be determined based on at least one document section in which the search engine identifies at least one search term. The search engine may generate a set of search terms for identifying documents relevant to a user's query, at least in part, by using a search term synonym resource that includes a plurality of search terms arranged in groups of associated synonyms. Synonyms in an associated group may be arranged in a hierarchical structure such that each synonym in the associated group has a parent, sibling or child relationship with each other synonym in the associated group.

Method and apparatus for identifying documents relevant to a search query in a medical information resource

Method and apparatus for identifying documents relevant to a search query in a medical information resource

Method and apparatus for identifying documents relevant to a search query in a medical information resource

Owner:UPTODATE

Synonym extension of search queries with validation

InactiveUS7120574B2Data processing applicationsNatural language data processingSubject matterWeb search query

A computer search involves expanding a user query with two synonym dictionaries—actions and object—and then validating the expanded queries by comparison with entries in a Subject-Action-Object Knowledge Database (SAO KB) in a discipline corresponding to the query. The latter is prepared from natural language texts and contains fields with subjects, actions, objects, and “main parts of objects” extracted from the object.

Synonym extension of search queries with validation

Synonym extension of search queries with validation

Synonym extension of search queries with validation

Owner:ALLIUM US HLDG LLC

Question and answer method based on knowledge graph, and agricultural encyclopedia question and answer system

ActiveCN108804521AImprove satisfactionData processing applicationsNatural language data processingQuery statementEncyclopedia

The invention provides a question and answer method based on a knowledge graph, and an agricultural encyclopedia question and answer system. A natural language question raised by a user can be automatically analyzed; a topological structure based on a syntax tree is formed; retrieval and comparison are carried out through the topological structure and a question template in a grammar library; according to a mapping relation between the topological structure and a predicate nominatum, and a mapping relation between a synonym set and a relation or an attribute in the knowledge graph, a question-mapped predicate is obtained; in combination with an entity identified in the question, a final structured knowledge graph query statement is generated; retrieval is carried out in the knowledge graphaccording to the query statement; and a final result is returned. When the relevant topological structure cannot be retrieved in a question template library, the question answering is carried out bycalling common question-answer pairs of an FAQ question library. The question and answer system can give accurate answer retrieval for the question posed by the user, so that the satisfaction degree of the user to the agricultural encyclopedia question retrieval is improved.

Question and answer method based on knowledge graph, and agricultural encyclopedia question and answer system

Question and answer method based on knowledge graph, and agricultural encyclopedia question and answer system

Question and answer method based on knowledge graph, and agricultural encyclopedia question and answer system

Owner:南京柯基数据科技有限公司

Automatically finding acronyms and synonyms in a corpus

ActiveUS20090006359A1Search results are accurateDigital data information retrievalDigital data processing detailsComputer scienceSynonym

Acronym and synonym pairs can be identified and retrieved automatically in a corpus and / or across an enterprise based on customer settings globally or for a single instance. Possible acronym and synonym term pairs can be identified using a rule such as a heuristic, user-defined rule. Rules selected by the user can be used to rank acronym and synonym pairs using factors such as occurrence frequency and maximum term length. A rule interpreter engine executes the user defined rule set to properly identify and retrieve the user selected acronym and synonym pairs through the utilization of a shallow pause read step. Finally, the user selected acronym and synonym pairs are ranked according to the user preferences, and can be displayed or held for subsequent use in searching.

Automatically finding acronyms and synonyms in a corpus

Automatically finding acronyms and synonyms in a corpus

Automatically finding acronyms and synonyms in a corpus

Owner:ORACLE INT CORP

Full text retrieval system based on natural language

InactiveCN101246492AIntelligent Information ServiceConvenient information serviceNatural language data processingSpecial data processing applicationsNatural language understandingConcept search

The invention discloses a full text retrieval system based on natural language understanding, comprising: a database server, an information receiving judging module, a natural language processing module, a retrieving module, an indexing module, an index database and a result set processing module. The system of the invention provides two resolution strategies, that is, word classification static with semantic analysis associated with automatic segmentation and expanding inquired word static according to Hownet rule for low intelligence situation of current search engine. The deployed system converts information retrieval from current key word-based layer to knowledge (or concept)-based layer; the invention is capable of using techniques such as word classification, synonym, concept search, phrase identification, etc. with understanding and processing ability to knowledge. The search engine is provided with intelligence and humanization of information service. The user is allowed using natural language for information retrieval. The invention is capable of adding user selection behavior in interactive operation mode, so as to provide more convenient, more precise search service.

Full text retrieval system based on natural language

Full text retrieval system based on natural language

Full text retrieval system based on natural language

Owner:HUAZHONG UNIV OF SCI & TECH

Apparatus for automatic theme detection from unstructured data

ActiveUS20130268534A1Natural language translationSemantic analysisSubject matterUnstructured data

This apparatus provides a system and method of determining significant repeating themes in a collection of documents. The apparatus operates unsupervised and leverages a natural language processing mechanism supported with lexicon, synonym and taxonomy dictionaries to determine themes and establish their relevance using a two-level hierarchical structure. The apparatus also assigns meaningful names to identified themes and determines a set of rules that describe the theme such that it can be applied to categorize other documents outside of the collection as well.

Apparatus for automatic theme detection from unstructured data

Apparatus for automatic theme detection from unstructured data

Apparatus for automatic theme detection from unstructured data

Owner:CLARABRIDGE

Method of self enhancement of search results through analysis of system logs

InactiveUS20050065774A1Web data indexingText database indexingEeg dataQuery analysis

An automatic search index / meta data self-enhancement system includes a search system log analyzer, which periodically looks through the search system log, of a database, for search queries that did not bring satisfactory results; a search query analyzer which applies query enhancement techniques to the unsatisfactory queries by using glossary terms, synonyms, known typos, translated words, etc. to enhance the queries and categorize them; a relevant document finder which, based on the enhanced query terms and their categorization and subject, uncovers documents that were not previously found and links the documents to the query terms in the search index; and a search index / meta data enhancer, that enhances the meta / data of the documents based on the enhanced query terms in the search index, to reflect these new keywords to allow documents turned up by the enhanced query to be returned when similar future searches are entered by users.

Method of self enhancement of search results through analysis of system logs

Method of self enhancement of search results through analysis of system logs

Method of self enhancement of search results through analysis of system logs

Owner:IBM CORP

Device and method for hiding information and device and method for extracting information

InactiveUS7167825B1Improve robustnessEasy to identifyData stream serial/continuous modificationSecret communicationParaphraseLanguage analysis

A device for hiding information in a text comprises a mechanicanism for providing the text, means for linguistically analyzing the text to produce text components, for determining a plurality of formulation alternatives for the text by varying the order of the text components and, optionally, in addition by using synonyms for text components, determining every formulation alternative is grammatically correct for the text and has essentially the same meaning as the text. Certain partial information is allocated to every sequence and / or to every synonym or to every paraphrase.

Device and method for hiding information and device and method for extracting information

Device and method for hiding information and device and method for extracting information

Device and method for hiding information and device and method for extracting information

Owner:POTTER THOMAS

Ensuring that a synonym for a query phrase does not drop information present in the query phrase

ActiveUS8661012B1Avoid poor resultsDigital data information retrievalDigital data processing detailsSystem identificationSynonym

One embodiment of the present invention provides a system that identifies a synonym for a query phrase in a manner that ensures that the synonym does not drop information from the query phrase. First, the system identifies a synonym for the query phrase and synonyms for sub-components of the query phrase. If the identified synonym for the query phrase is also a synonym for a subcomponent of the query phrase, the system does not use the identified synonym as a synonym for the query phrase.

Ensuring that a synonym for a query phrase does not drop information present in the query phrase

Ensuring that a synonym for a query phrase does not drop information present in the query phrase

Ensuring that a synonym for a query phrase does not drop information present in the query phrase

Owner:GOOGLE LLC

Information retrieval system with a neuro-fuzzy structure

InactiveUS6845354B1Increase flexibilityData processing applicationsNatural language data processingParallel processingInformation index

An intelligent information retrieval system for finding information components corresponding to an input query word. The system includes a synonym block for finding synonymous indexed keywords of the query word; an information indexing block for finding corresponding information components based on the indexed keyword; a component ranking-filtering block for ranking and filtering the found information components and outputting the desired information components being selected; a synonym adjusting block for adjusting the fuzzy mechanism of the synonym block based on the found information components; and a filtering adjusting block for adjusting the fuzzy mechanism of the ranking-filtering block based on the found information components. The aforementioned synonym block and information-indexing block are implemented by neuro-fuzzy networks for accelerating parallel processing and automatic learning. Further, the synonym block can tolerate input errors by way of query word encoding and position shift compensation.

Information retrieval system with a neuro-fuzzy structure

Information retrieval system with a neuro-fuzzy structure

Information retrieval system with a neuro-fuzzy structure

Owner:INSTITUTE FOR INFORMATION INDUSTRY

Method and device for processing medical intelligent question and answer data

ActiveCN107993724AMedical data miningSemantic analysisQuestions and answersRetrieval result

The invention provides a method and a device for processing medical intelligent question and answer data, and relates to the technical field of intelligent questions and answers. The method includes extracting question keywords in user query question data; performing synonymy transformation, and determining synonym sets of the question keywords; performing matching lookup in a preset question andanswer knowledge base and a preset rule knowledge base according to the synonym sets of the question keywords; if succeeded, outputting answer result data corresponding to the question keywords and synonyms thereof in the synonym sets of the question keywords; if failed, subjecting the question keywords to semantic extension to acquire the question keywords and the synonym sets of hyponyms of thesynonyms; continuing matching lookup according to the question keywords and the synonym sets of the hyponyms of the synonyms, and generating a retrieval result list containing retrieval results; performing similarity calculation on the retrieval results in the retrieval result list to determine similarities among the retrieval results; sequencing and outputting the retrieval results according to the similarities of the retrieval results.

Method and device for processing medical intelligent question and answer data

Method and device for processing medical intelligent question and answer data

Method and device for processing medical intelligent question and answer data

Owner:易保互联医疗信息科技(北京)有限公司

Multilayer quotation recommendation method based on literature content mapping knowledge domain

ActiveCN105653706AImprove the efficiency of obtaining citationsExpress research topicsSpecial data processing applicationsInformation processingData set

The invention discloses a multilayer quotation recommendation method based on a literature content mapping knowledge domain, and belongs to the field of information recommendation and intelligent information processing. The method comprises the following steps: firstly, obtaining the query requirement of a user, wherein the query requirement consists of the key words of the title and the digest of a thesis which needs to recommend a quotation thesis or quotation literature; then, on the basis of the literature content mapping knowledge domain, expanding and querying a retrieval word, wherein the mapping knowledge domain consists of the research object word and the research behavior word node of the literature, and edges which express various semantic relations including synonymy, synonym, an up and down position, part-whole, juxtaposition and the like; and finally, constructing the inverted index of the literature in a data set, selecting a candidate quotation, calculating the similarity between the candidate quotation and query, and adopting a gradient progressive regression tree to carry out quotation recommendation. The method carries out multilayer quotation recommendation on the basis of the literature content mapping knowledge domain, enlarges the range of the candidate quotation, accurately expresses the research object and contents of the thesis, improves efficiency for users to obtain a relevant literature and has a wide application prospect.

Multilayer quotation recommendation method based on literature content mapping knowledge domain

Multilayer quotation recommendation method based on literature content mapping knowledge domain

Multilayer quotation recommendation method based on literature content mapping knowledge domain

Owner:BEIJING INSTITUTE OF TECHNOLOGYGY

Method for inputting word related to candidate word in input method and system

ActiveCN101183281AExpand your vocabularyRich languageSpecial data processing applicationsInput/output processes for data processingUser inputSpeech sound

The invention provides an input method for relevant words of a candidate word in input method and the system of the input method, belonging to the technical field of computers; wherein the method comprises the following steps: the input candidate word is received; the relevant words of the candidate word are searched from a preset relevant word lexicon according to the candidate word, wherein the relevant words of the candidate word are one or more of near-synonym, synonym or antonym of the candidate word; the relevant words of candidate word are displayed; the selection instruction of the relevant words of the input candidate word is received, and the relevant word of candidate word corresponding to the instruction is output. The invention has the advantages that: when users input a candidate word, the relevant words of the candidate word can be searched in the relevant word lexicon according to the candidate word, therefore the vocabulary of users is enlarged, the speech talent and the language expression of users are improved, and the language of users is more abundant and more individualized.

Method for inputting word related to candidate word in input method and system

Method for inputting word related to candidate word in input method and system

Method for inputting word related to candidate word in input method and system

Owner:SHENZHEN SHI JI GUANG SU INFORMATION TECH

Electronic medical record text structuring method

InactiveCN106095913AEfficient extractionData processing applicationsNatural language data processingMedical recordTreatment effect

The invention discloses an electronic medical record text structuring method. The method comprises the following steps that 1, a medical knowledge database is loaded; 2, an electronic medical record text is read; 3, word segmentation is carried out on short sentences through a forwards maximum match algorithm, and words in sentences, word properties and relative position relations are obtained; 4, whether the semantics in the sentences for disease information description is positive or negative is judged; 5, disease information elements are extracted; 6, the steps 2-5 are repeated till all interesting contents in the electronic medical record are obtained; 7, different expressions of the disease information elements are combined, the same disease information is combined according to a medical synonym bank, and redundant information is removed; 8, disease description information elements are stored in a structure / class mode, the structuring process is finished, disease relevant information can be effectively extracted from medical record description text, structural expression for disease information is formed, and therefore disease occurrence regularity, definite diagnosis ways, treatment effects and the like are deeply explored.

Electronic medical record text structuring method

Electronic medical record text structuring method

Electronic medical record text structuring method

Owner:广州同构科技有限公司

Clustering method for question sentences in question-and-answer platform and system thereof

InactiveCN101630312AEasy accessAccurately obtainedSpecial data processing applicationsCluster algorithmSentence analysis

The invention discloses a clustering method for question sentences in a question-and-answer platform and a system thereof. The technical scheme is as follows: the question sentences in the question-and-answer platform is analyzed according to the semantic feature of the question sentences to obtain analysis results; the semantic feature comprises the question type and the comparison feature of the question sentences and thesaurus correlative to the content of the question sentences; and aiming at the question sentences which is analyzed by the semantic feature, a clustering algorithm for evaluating the semantic similarity of the question sentences is adopted to obtain clustering results of the question sentences in the question-and-answer platform. The system comprises a question sentences analysis module and a clustering algorithm module. Aiming at the problem that the clustering method for the question sentences in the question-and-answer platform and the system thereof are not existed in the prior art, the technical scheme of the invention fills the gap, thereby not only realizing fast and exact clustering method and system in the question-and-answer platform, but also improving user experience.

Clustering method for question sentences in question-and-answer platform and system thereof

Clustering method for question sentences in question-and-answer platform and system thereof

Clustering method for question sentences in question-and-answer platform and system thereof

Owner:TENCENT TECH (SHENZHEN) CO LTD

Method and system for uploading files

ActiveCN102868765AOptimize upload methodSupport automatic identification functionTransmissionSpecial data processing applicationsFile verificationSynonym

The invention relates to a method and a system for uploading files. The method includes that a user selects files to be uploaded and submits hashed value of the files to be uploaded. A request for file uploading and based on HTTPOST is sent to a server. File type of the files to be uploaded is judged, the next step is conducted on yes judgment, and failure warning is returned to the user to finish uploading on no judgment. The method further includes recognizing content of the files to the uploaded, reminding the user that identical files exist, returning pre-existing file address and finishing uploading if identical files exist in the server, starting uploading if no identical files exist in the server, calculating the hashed value of the uploaded files, comparing the hashed value with submitted hashed value, reminding that file uploading succeeds, returning the file address to the user and finishing uploading if the values are identical, and returning the uploading failure warning and finishing uploading if the values are not identical. The method and the system resolve the problems of synonym file re-uploading, file type judgment, file verification and the like.

Method and system for uploading files

Method and system for uploading files

Method and system for uploading files

Owner:LETV CLOUD COMPUTING CO LTD

Method and system for linking entities

ActiveCN106202382ALinks are fast and accurateText database indexingSpecial data processing applicationsEntity linkingContextual similarity

The invention discloses a method and a system for linking entities. The method includes acquiring to-be-linked entities from given texts; acquiring entity names and abbreviation word banks from preset knowledge bases and establishing synonym banks of the entity names on the basis of the preset knowledge bases; carrying out searching in the synonym banks by the aid of entity keywords; linking the entity keywords for searching and the entity names in the preset knowledge bases if a certain entry matched with the synonym banks is found by means of searching; generating candidate entities if the certain entry is not found by means of matching and carrying out disambiguation linking in context similarity evaluation modes. The synonym banks contain the entity names acquired from the preset knowledge bases and information data related to the entity names. The entity keywords are acquired by means of word segmentation and are used as search terms. The entity names in the knowledge bases correspond to the entry. The method and the system in an embodiment of the invention have the advantage that the entity linking accuracy can be improved.

Method and system for linking entities

Method and system for linking entities

Method and system for linking entities

Owner:南京柯基数据科技有限公司

Second language writing advisor

ActiveUS20070033002A1Increase choiceImprove the second language textNatural language translationSpecial data processing applicationsSemantic featureComputer science

A writing advisor program (20) receives a proposed text in an author's second language (L2) and determines at least one candidate replacement word for a selected word based on a determined language model (p(c)) and a determined corruption model (p(r|c)). The determined language model reflects correct usage of the text in the second language, independent of the native or first language (L1) of the author, based on (L2) corpora. The determined corruption model is based on some a priori knowledge about probable corruption paths leading the author to realize some inadequate expression in the second language instead of the correct, intended expression. Different types of corruption paths may be used that include bidirectional translations, false-friends, synonyms, common semantic features, second language internal cognates, preposition alternatives, and first language inserts.

Second language writing advisor

Second language writing advisor

Second language writing advisor

Owner:III HLDG 6

Word Use Difference Information Acquisition Program and Device

InactiveUS20090089046A1Easy to understandNatural language data processingSpecial data processing applicationsGeneral purposeFrequent use

A device or computer implemented program for accurately and automatically obtaining general-purpose information regarding the usage difference between a plurality of synonyms and quasi-synonyms, such as the types of words with which the synonyms and quasi-synonyms are often used, is provided with: means for receiving the input of a plurality of words; means for extracting sentence data including an inputted word from a corpus; means for analyzing the sentence structure of the sentence data and extracting nouns that are in a grammatical relationship with the inputted word included in the sentence data; means for extracting the nodes representing the nouns and the nodes representing the semantic category of the noun from a thesaurus and forming a directional graph for each inputted word; means for comparing a plurality of directional graphs and extracting the difference nodes; and means for outputting the extracted difference nodes as information relating to the usage difference of the inputted words.

Word Use Difference Information Acquisition Program and Device

Word Use Difference Information Acquisition Program and Device

Word Use Difference Information Acquisition Program and Device

Owner:NAT INST OF INFORMATION & COMM TECH

Second language writing advisor

ActiveUS7664629B2Improve the second language textNatural language translationSpecial data processing applicationsSemantic featureSynonym

A writing advisor program (20) receives a proposed text in an author's second language (L2) and determines at least one candidate replacement word for a selected word based on a determined language model (p(c)) and a determined corruption model (p(r|c)). The determined language model reflects correct usage of the text in the second language, independent of the native or first language (L1) of the author, based on (L2) corpora. The determined corruption model is based on some a priori knowledge about probable corruption paths leading the author to realize some inadequate expression in the second language instead of the correct, intended expression. Different types of corruption paths may be used that include bidirectional translations, false-friends, synonyms, common semantic features, second language internal cognates, preposition alternatives, and first language inserts.

Second language writing advisor

Second language writing advisor

Second language writing advisor

Owner:III HLDG 6

Method for automatically creating keyword index table

InactiveCN103064969AImprove precisionImprove recallSpecial data processing applicationsWord listDocumentation

The invention discloses a method for automatically creating a keyword index table. The method includes subjecting a file to be translated to word segmentation process to obtain a word list of the file, and subjecting the word list to part-of-speech tagging; filtering candidate keywords in the word list to obtain a coarse candidate word collection and codes of each sense of the candidate keywords; subjecting the candidate keywords to synonym chain construction according to semantic similarity of the words to obtain a synonym chain collection; acquiring word weight of vocabularies in the synonym chain collection and extracting keywords to form a keyword collection according to the word weight; and comparing the keyword collection with an existing reference library keyword index collection, providing a relevant file collection if the existing reference library keyword index collection contains the candidate keywords, otherwise, adding the candidate keywords to the reference library keyword index collection, and simultaneously, creating an index. Compared with traditional keyword extraction methods, the method has the advantages that precision rates and recall rates are obviously increased.

Method for automatically creating keyword index table

Method for automatically creating keyword index table

Owner:IOL WUHAN INFORMATION TECH CO LTD

Synonym expansion method and device for search information

ActiveCN108509474AImprove rationalityImprove accuracyNatural language data processingSpecial data processing applicationsFeature setAlgorithm

The invention puts forward a synonym expansion method and device for search information. The method comprises the following steps that: carrying out word segmentation processing on the search information to obtain at least one segmented word of the search information; obtaining the candidate synonym set of the segmented word, wherein the candidate synonym set comprises at least one synonym of thesegmented word; forming a synonym pair which contains the synonym and the segmented word by aiming at each synonym; carrying out feature extraction on the synonym pair to obtain a synonym pair featureset; according to the feature set, predicting the synonym pair to obtain a target probability that the synonym pair is predicted as reasonable replacement; and if the target probability exceeds a preset threshold value, forming a synonym expansion item by the segmented word and the synonym, and searching and obtaining a search result on the basis of the synonym expansion item. Through the method,synonym replacement rationality and accuracy can be improved, the recall rate and the accuracy of a search result are improved, and therefore, the technical problems in the prior art that synonym replacement is inaccurate and a search result recall rate is poor can be solved.

Synonym expansion method and device for search information

Synonym expansion method and device for search information

Synonym expansion method and device for search information

Owner:TENCENT TECH (SHENZHEN) CO LTD

Data-enhanced machine translation method based on similar word and synonym replacement

ActiveCN108920473AImprove translation qualityAlleviate the problem of unregistered wordsNatural language translationSpecial data processing applicationsNerve networkAlgorithm

The invention belongs to the technical field of processing or transformation of natural languages, and discloses a data-enhanced machine translation method based on similar word and synonym replacement. The characteristics that word vectors are finally clustered well are utilized to obtain a similar word table and a synonym table with high quality; the similar word table and the synonym table areconstructed using the word vectors obtained in the training process of a large language, and similar words and synonyms in a scarce small language are replaced; a parallel corpus of the small languageis expanded, and a neural network machine translation model of the small language is trained by the adoption of an encoding-decoding structure and a neural network of an attention mechanism. Trainingdata is expanded, parameters of a neural network translation model can be well studied in enough data, and the problem of unregistered words in the neural machine translation can be alleviated, so that the translation quality of the translation model is improved. When the translation quality of the entire network on a development set is no longer significantly improved, the network parameters have been well studied.

Data-enhanced machine translation method based on similar word and synonym replacement

Data-enhanced machine translation method based on similar word and synonym replacement

Data-enhanced machine translation method based on similar word and synonym replacement

Owner:GLOBAL TONE COMM TECH

Popular searches

Visual perception Textual information Lexicon Vector generation Text corpus Graphics Skill sets Visual Pattern Recognition Knowledge base Knowledge level