Patents
Literature
Patsnap Copilot is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Patsnap Copilot

1256 results about "Named-entity recognition" patented technology

Named-entity recognition (NER) (also known as entity identification, entity chunking and entity extraction) is a subtask of information extraction that seeks to locate and classify named entity mentions in unstructured text into pre-defined categories such as the person names, organizations, locations, medical codes, time expressions, quantities, monetary values, percentages, etc.

Enquiry statement analytical method and system for information retrieval

The invention discloses a query sentence analyzing method based on understanding of natural languages and a system thereof, and belongs to the technical field of information retrieval. The query sentence analyzing method comprises the following steps: (1) automatic segmenting, named entity identification and part-of-speech tagging of an input Chinese query sentence are implemented; (2) syntax structure of the segmented sentence is analyzed so as to obtain a syntax structural tree, and meaning of each word is determined according to the sentence after the part-of-speech tagging; (3) according to the syntax structure and the meaning of each word, semantic roles of predicates in the sentence are tagged; and (4) according to the analyzed result of the sentence from the levels of syntactics, syntax and semantics, keywords are expanded and the keywords that can reflect user information retrieval requirements are extracted. The query sentence analyzing system of the invention comprises a syntactic analyzing module, a syntax analyzing module, a semantic analyzing module and a keyword extracting module. The query sentence analyzing method and system can greatly improve the accuracy of query results and provide desired query results for users.
Owner:PEKING UNIV

Question and answer method based on knowledge map

The invention provides a question and answer method based on a knowledge map. The question and answer method based on a knowledge map provided in the invention is realized by subject entity matching,relationship matching and answer determination. The subject entity matching mainly comprises naming entity identification and entity linking. The naming entity identification is aimed at identifying naming entities such as names of people, names of places, and names of organizations in natural language questions q. The entity linking corresponds the identified naming entity to a certain entity inthe knowledge base, that is, finding out an entity s in triples; Relationship matching is to understand the semantics expressed by question q through natural language understanding technology, and match the relationship p in the triples (s, p, o) in the search space in order to determine the semantics of the question and its corresponding relationship with the knowledge base. The candidate subjectentity is obtained through entity identification and entity linking, and the relationship matching can obtain the candidate relationship, thereby obtaining several candidate triples; the answer determination is to rank the candidate triples according to entity recognition score, relationship match score, etc. to determine the final answer.
Owner:BEIHANG UNIV

Chinese natural language interrogative sentence semantization knowledge base automatic question-answering method

The invention discloses a Chinese natural language interrogative sentence semantization knowledge base automatic question-answering method. The method includes the following steps that Chinese natural language processing is performed on a fact type question input by a user, word segmentation, part-of-speech tagging and identification and expanding of a named entity are achieved, and a semantic dependency tree is generated; a generalization template and a semantic analysis technology are used for acquiring time, space, a fact entity, a fact object and the like in an interrogative sentence, then semantic processing is performed, composition element attributes relevant to all events in the interrogative sentence and values of the attributes are extracted, a plurality of 'attribute-value' pairs are generated, to-be-answered elements are substituted by interrogatives, and a complex fact triple set is formed; after a triple where a to-be-answered part is located is combined with other relevant fact triple sets to form knowledge base query with conditional constraints, and query matching based on similarity calculation is performed in a knowledge base, a result is extracted from the knowledge base, and a final answer is obtained. Fast and accurate query response to the knowledge base is achieved.
Owner:NANJING UNIV

Named entities recognition method based on bidirectional LSTM and CRF

The invention discloses a named entities recognition method based on bidirectional LSTM and CRF. The named entities recognition method based on the bidirectional LSTM and CRF is improved and optimizedbased on the traditional named entities recognition algorithm in the prior art. The named entities recognition method based on the bidirectional LSTM and CRF comprises the following steps: (1) preprocessing a text, extracting phrase information and character information of the text; (2) coding the text character information by means of the bidirectional LSTM neural network to convert the text character information into character vectors; (3) using the glove model to code the text phrase information into word vectors; (4) combining the character vectors and the word vectors into a context information vector and putting the context information vector into the bidirectional LSTM neural network; and (5) decoding the output of the bidirectional LSTM with a linear chain condition random field to obtain a text annotation entity. The invention uses a deep neural network to extract text features and decodes the textual features with the condition random field, therefore, the text feature information can be effectively extracted and good effects can be achieved in the entity recognition tasks of different languages.
Owner:南京安链数据科技有限公司

Method and device for establishing medical knowledge graph, and auxiliary diagnosis method

The invention discloses a method and device for establishing a medical knowledge graph, and an auxiliary diagnosis method. The method for establishing the medical knowledge graph comprises the steps that a user dictionary is established according to a medical database; electronic medical record data is processed, and named entity recognition is conducted; correlation relations are established for each recognized entity; and the medical knowledge graph is established according to the correlation relations. The auxiliary diagnosis method based on the medical knowledge graph comprises the steps that a patient's chief complaint data and inspection data are acquired and processed, so that a symptom entity and a sign entity of the patient can be obtained; a disease entity correlated with the symptom entity and the sign entity is searched in the medical knowledge graph, and a posterior probability of each disease entity in a set composed of the corresponding symptom entity and the sign entity is computed respectively; and the disease entity with the maximum posterior probability and data corresponding to correlated nodes of the disease entity are output. According to the invention, intelligent auxiliary diagnosis is provided for clinical medical science, so that working burdens of medical workers are relieved; medical stress is relieved; and occurrence rate of medical accidents is reduced.
Owner:HEFEI UNIV OF TECH

Method and system for automatically constructing knowledge maps for mass unstructured texts

The invention belongs to the technical field of computer software, and discloses a method and a system for automatically constructing knowledge maps for mass unstructured texts. The method comprises the steps of: abstracting a named entity recognition problem into a sequence labeling problem by giving a sentence and labeling each word in the sequence of sentences; designing effective features according to the training data, learning various classification models, and using trained classifiers to predict relationships; linking multiple existing knowledge to create a large-scale and unified knowledge network from the top; and capturing and integrating entity information from three online encyclopedias, open websites, related knowledge bases, or search engine logs. According to the method andthe system for automatically constructing knowledge maps for mass unstructured texts, the construction speed of the knowledge maps can be greatly improved, the time efficiency is improved, and the human resource cost is reduced by more than 30%. In addition, the method and the system have better domain portability, and the construction of the knowledge map can be quickly implemented by only optimizing the entities and relationship extraction algorithms in the invention.
Owner:GLOBAL TONE COMM TECH

Multi-task named entity recognition and confrontation training method for medical field

The invention discloses a multi-task named entity recognition and confrontation training method for medical field. The method includes the following steps of (1) collecting and processing data sets, so that each row is composed of a word and a label; (2) using a convolutional neural network to encode the information at the word character level, obtaining character vectors, and then stitching withword vectors to form input feature vectors; (3) constructing a sharing layer, and using a bidirection long-short-term memory nerve network to conduct modeling on input feature vectors of each word ina sentence to learn the common features of each task; (4) constructing a task layer, and conducting model on the input feature vectors and the output information in (3) through a bidirection long-short-term network to learn private features of each task; (5) using conditional random fields to decode labels of the outputs of (3) and (4); (6) using the information of the sharing layer to train a confrontation network to reduce the private features mixed into the sharing layer. According to the method, multi-task learning is performed on the data sets of multiple disease domains, confrontation training is introduced to make the features of the sharing layer and task layer more independent, and the task of training multiple named entity recognition simultaneously in a specific domain is accomplished quickly and efficiently.
Owner:ZHEJIANG UNIV

Dependency semantic-based Chinese unsupervised open entity relationship extraction method

The invention relates to a dependency semantic-based Chinese unsupervised open entity relationship extraction method. The method comprises the following steps of preprocessing an input text: performing Chinese word segmentation, part-of-speech tagging and dependency grammar analysis on the input text; performing named entity identification on the input text; arbitrarily selecting two entities from identified entities to form candidate entity pairs; searching for a dependency path between two entities in the candidate entity pairs; and analyzing whether a syntactic structure mapped by the dependency path is matched with a normal form of a dependency semantic normal form set or not, if yes, extracting words or phrases from the residual part of the input text according to the matched normal form to serve as relational words, forming a relational triple by the extracted relational words and the candidate entity pairs, and if not, performing normal form matching of a next group of the candidate entity pairs; and outputting the relational triple. Compared with the prior art, the method has the advantages that the calculation complexity is low; the extraction efficiency is high; distance position limitation is overcome; a simple sentence also can be extracted and the like.
Owner:TONGJI UNIV

Geographical science domain named entity recognition method

ActiveCN107133220AEntity recognition implementationCorrect mislabeling issueSemantic analysisSpecial data processing applicationsDomain nameConditional random field
The invention discloses a geographical science domain named entity recognition method, which is used for recognizing geographical science core term entities and geographical location entities. The method mainly comprises three steps of (1) establishing a geographical science domain dictionary, and using a new word discovery algorithm to identify new words in the geographical science domain in an unsupervised way; (2) training and testing based on a conditional random field (CRF) model and a multichannel convolutional neural network (MCCNN) model; (3) carrying out error correcting and fusion on entities recognized by the models by using a rule-based method. According to the geographical science domain named entity recognition method, the new words of the domain are identified as the dictionary in an unsupervised way by using the new word discovery algorithm, so that the work distinguishing effect is improved. The semantic vectors of the words are learnt from large-scale unmarked data in an unsupervised way, and basic characteristics of the words are synthesized and are taken as the input characteristics of the MCCNN model, so that manual selection and construction of the characteristics are avoided. The predicting results of the two models are fused by means of a custom rule, so that the problem of error marking in a recognition process can be corrected.
Owner:SOUTHEAST UNIV

Online traditional Chinese medicine text named entity identifying method based on deep learning

The invention discloses an online traditional Chinese medicine text named entity identifying method based on deep learning. The method includes the steps that online traditional Chinese medicine text data are obtained through a web crawler, and named entities of the obtained online traditional Chinese medicine text data are labeled with existing terminological dictionaries and human assistance; a word2vec tool is used for carrying out learning on large-scale label-free linguistic data, and word vectors with fixed length are obtained and used for forming a corresponding glossary; word segmentation is carried out on the online traditional Chinese medicine text data, words are converted into the word vectors with the fixed length by searching for the glossary, the word vectors serve as input of a convolutional neural network, and a blank character is used for filling when sentence length is insufficient; output of the convolutional neural network serves as input of a bidirectional long-short-time memory recurrent neural network, and an identification result of the online traditional Chinese medicine text data words to be identified is output. Compared with a traditional method for named entity identifying, the method reduces complexity and workload of feature extraction, simplifies the processing process and remarkably improves identification efficiency.
Owner:SOUTH CHINA UNIV OF TECH

Industry comment data fine grain sentiment analysis method

The invention relates to an industry comment data fine grain sentiment analysis method. The industry comment data fine grain sentiment analysis method is applied to Internet data analysis and comprises obtaining comment data of e-commerce industry goods and preprocessing the comment data; establishing initial industry sentiment word libraries and computing distribution of words under different sentiment polarities through 1-gram and 2-gram; performing Chinese word segmentation on the comment data; based on the sentiment word libraries established through the 1-gram and the 2-gram, utilizing combined sentiment models to perform word modeling to obtain the probability distribution of the words which belong to different topics under different sentiment distributions; utilizing context information to re-determine the sentiment alignment of sentiment words in sentences; performing named entity identification and extracting comment characteristics through conditional random fields to compute the sentiment alignment of comment words of the comment characteristics. The industry comment data fine grain sentiment analysis method computes the sentiment of the comment words through the two dimensions of topic and sentiment to achieve fine grain sentiment analysis on the industry comment data, thereby achieving high precision and interpretability of analysis results.
Owner:中科嘉速(北京)信息技术有限公司

Text information associating and clustering collecting processing method based on domain knowledge model

The invention provides a text information associating and clustering collecting processing method based on a domain knowledge model. The method comprises the following steps that a text information training set is searched, stemming preprocessing is conducted, and feature word vectors of a text participle sequence of the information training set are extracted through Chinese named entity identification and domain dictionary query modes; representative feature words of a target event are extracted through topic graph model learning training, and a weighted value of topic associating affiliation is calculated; a feature word set is built according to the topic associating affiliation weighted value, calculated through training, of the feature words, and an event topic word template is built; feature word vectors of a participle sequence accessed to text in real time are extracted through the Chinese named entity identification and domain dictionary query modes; the similarity distance of the feature word vectors and all the target event knowledge templates is calculated; the association relationship of multiple texts to the same topic target event is determined according to the similarity threshold, and classification reorganization is conducted by means of a similarity distance ordering rule.
Owner:10TH RES INST OF CETC

Entity relationship recognition method and apparatus

The present invention relates to an entity relationship recognition method and apparatus. The method comprises obtaining a statement sequence from a target text in a corpus, and performing named entity recognition and dependency grammar marker on the statement sequence to obtain a marked text sentence; matching and retrieving the marked text sentence on basis of an entity relationship seed to obtain a training example; replacing the entity relationship seed word in the training example with predetermined identification, processing the training example after replacement combined with the named entity recognition and the dependency grammar marker, and generating a candidate rule; fuzzifying the candidate rule to obtain fuzzy rules; determining whether the fuzzy rules comprise a new rule; and retrieving the corpus according to the fuzzy rules to obtain a seed set when the fuzzy rules comprise the new rule, and using the obtained seed set as an entity relationship recognition result. Manual participation can be effectively reduced, dependence on the calibrated corpus is reduced, a new entity relationship can be found timely, and the entity relationship recognition method and apparatus are self-adaptive to entity relationship mining in different fields.
Owner:LETV HLDG BEIJING CO LTD +1

Electronic medical record text named entity recognition method based on pre-trained language model

The invention belongs to the technical field of medical information data processing, and particularly relates to an electronic medical record text named entity recognition method based on a pre-training language model, which comprises the following steps: collecting an electronic medical record text from a public data set as an original text, and preprocessing the original text; labeling the preprocessed original text entity based on the standard medical term set to obtain a labeled text; inputting the annotation text into a pre-training language model to obtain a training text represented bya word vector; constructing a BiLSTM-CRF sequence labeling model, and learning the training text to obtain a trained labeling model; and taking the trained labeling model as an entity recognition model, and inputting a test text to output a labeled category label sequence. According to the method, text features and semantic information in the deep language model are obtained through training in the super-large-scale Chinese corpus, a better semantic compression effect can be provided, the problem that manual annotation is tedious and complex is avoided, the method does not depend on dictionaries and rules, and the recall ratio and accuracy of named entity recognition are improved.
Owner:SUZHOU INST OF BIOMEDICAL ENG & TECH CHINESE ACADEMY OF SCI
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products