Patents
Literature
Patsnap Copilot is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Patsnap Copilot

473 results about "Sequence labeling" patented technology

In machine learning, sequence labeling is a type of pattern recognition task that involves the algorithmic assignment of a categorical label to each member of a sequence of observed values. A common example of a sequence labeling task is part of speech tagging, which seeks to assign a part of speech to each word in an input sentence or document. Sequence labeling can be treated as a set of independent classification tasks, one per member of the sequence. However, accuracy is generally improved by making the optimal label for a given element dependent on the choices of nearby elements, using special algorithms to choose the globally best set of labels for the entire sequence at once.

Chinese medical knowledge atlas construction method based on deep learning

ActiveCN106776711AEasy to handleRelationship Accurate and ComprehensiveWeb data indexingSemantic analysisKnowledge unitHealthcare associated
The invention relates to the technology of a knowledge atlas, and aims to provide a Chinese medical knowledge atlas construction method based on deep learning. The Chinese medical knowledge atlas construction method comprises the following steps: obtaining relevant data of a medical field from a data source; using a word segmentation tool to carry out word segmentation on unstructured data, and using an RNN (Recurrent Neural Network) to finish a sequence labeling task to identify entities related to medical care, so as to realize the extraction of knowledge units; carrying out feature vector construction on the entity, and utilizing the RNN to carry out sequence labeling and finish the identification of a relationship among the knowledge units; carrying out entity alignment, and then utilizing the extracted entities and the relationship between the entities to construct the knowledge atlas. According to the Chinese medical knowledge atlas construction method, a recurrent neural network is artfully used for extracting the knowledge units and identifying the relationship among the knowledge units so as to favorably finish the processing of the unstructured data. According to the Chinese medical knowledge atlas construction method, features suitable for the medical care field are put forward to carry out a training task of a network. Compared with general features, the features put forward by the method can better represent a medical entity, and therefore, the relationship among the extracted knowledge units can be more accurate and comprehensive.
Owner:ZHEJIANG UNIV

Method and system for automatically constructing knowledge maps for mass unstructured texts

The invention belongs to the technical field of computer software, and discloses a method and a system for automatically constructing knowledge maps for mass unstructured texts. The method comprises the steps of: abstracting a named entity recognition problem into a sequence labeling problem by giving a sentence and labeling each word in the sequence of sentences; designing effective features according to the training data, learning various classification models, and using trained classifiers to predict relationships; linking multiple existing knowledge to create a large-scale and unified knowledge network from the top; and capturing and integrating entity information from three online encyclopedias, open websites, related knowledge bases, or search engine logs. According to the method andthe system for automatically constructing knowledge maps for mass unstructured texts, the construction speed of the knowledge maps can be greatly improved, the time efficiency is improved, and the human resource cost is reduced by more than 30%. In addition, the method and the system have better domain portability, and the construction of the knowledge map can be quickly implemented by only optimizing the entities and relationship extraction algorithms in the invention.
Owner:GLOBAL TONE COMM TECH

Electronic medical record text named entity recognition method based on pre-trained language model

The invention belongs to the technical field of medical information data processing, and particularly relates to an electronic medical record text named entity recognition method based on a pre-training language model, which comprises the following steps: collecting an electronic medical record text from a public data set as an original text, and preprocessing the original text; labeling the preprocessed original text entity based on the standard medical term set to obtain a labeled text; inputting the annotation text into a pre-training language model to obtain a training text represented bya word vector; constructing a BiLSTM-CRF sequence labeling model, and learning the training text to obtain a trained labeling model; and taking the trained labeling model as an entity recognition model, and inputting a test text to output a labeled category label sequence. According to the method, text features and semantic information in the deep language model are obtained through training in the super-large-scale Chinese corpus, a better semantic compression effect can be provided, the problem that manual annotation is tedious and complex is avoided, the method does not depend on dictionaries and rules, and the recall ratio and accuracy of named entity recognition are improved.
Owner:SUZHOU INST OF BIOMEDICAL ENG & TECH CHINESE ACADEMY OF SCI

Electronic official document entity extraction method

PendingCN110297913AGeneralization abilitySolve the time-consuming and labor-intensive problem of manually labeling a large amount of corpusSemantic analysisNeural architecturesPart of speechAlgorithm
The invention provides an electronic official document entity extraction method. The electronic official document entity extraction method comprises the following steps: A, preprocessing; B, constructing features; C, training an entity extraction model; D, obtaining a corpus; E, obtaining a word vector; F, training an algorithm model. According to the method, a traditional sequence labeling algorithm and a deep learning algorithm are combined, the advantage that a traditional sequence algorithm needs less corpus labeling is utilized, a semi-supervised method is adopted to expand corpuses, andthe problem that time and labor are wasted when a large number of corpuses need to be manually labeled in the deep learning algorithm is solved. Maximum forward and reverse dictionaries, syntax and semantic features are added into the CRF model, and front and rear boundary word features of entity words are fully considered, so that the algorithm has generalization ability. A dilated CNN and BiLSTM-CRF are combined, the dilated CNN takes a character-level vector and a character-level position feature as external features, and the external features and a part-of-speech vector are spliced into aword vector, so that more semantics and up-and-down related information can be expressed to a certain extent.
Owner:CETC BIGDATA RES INST CO LTD

Chinese electronic medical record named entity recognition method and system based on attention mechanism

The invention discloses a Chinese electronic medical record named entity recognition method and system based on an attention mechanism, and belongs to the field of text information mining. The technical problem to be solved by the invention is how to identify named entities in an electronic medical record more accurately and conveniently based on a neural network and an attention mechanism. According to the technical scheme, the method comprises the following steps: S1, obtaining word vector and part-of-speech vector representation of Chinese word part-of-speech and splicing the word vector and the part-of-speech vector; S2, splicing the word vector and the part-of-speech vector, and inputting the spliced word vector and part-of-speech vector into a Double-LSTMs neural network model for feature extraction to obtain more accurate implicit strata vector representation; S3, adding an attention layer, and endowing relatively important information in the text with a higher weight; S4, endowing the weight with a hidden layer vector obtained by corresponding forward encoding and a hidden layer vector obtained by reverse encoding, and respectively splicing the hidden layer vectors to serveas feature vectors; and S5, carrying out sequence labeling based on the conditional random field model to realize an identification task of the named entity.
Owner:山东健康医疗大数据有限公司

Sequence labeling model training method, electronic medical record processing method and related device

The embodiment of the invention relates to the technical field of natural language processing, and provides a sequence labeling model training method, an electronic medical record processing method and a related device. The method comprises the steps: obtaining a sample sequence and a standard label sequence of the sample sequence; inputting the sample sequence into a pre-established sequence labeling model, and obtaining an initial vector sequence of the sample sequence by utilizing an initial feature network of the sequence labeling model; inputting the initial vector sequence into a featureextraction network of a sequence labeling model, and obtaining a feature sequence by adopting an attention mechanism; inputting the feature sequence into a label prediction network of a sequence labeling model to obtain a training label result of the sample sequence; and based on the training label result and the standard label sequence, performing iterative correction on the sequence labeling model to obtain a trained sequence labeling model. According to the embodiment of the invention, an attention mechanism is introduced to better learn long-distance feature information in the sequence, so that the accuracy of sequence labeling is effectively improved.
Owner:NEW H3C BIG DATA TECH CO LTD

Subject term extraction method and system based on sequence labeling model

ActiveCN104794169AAchieve preliminary extractionSpecial data processing applicationsData miningMachine learning
The invention discloses a subject term extraction method and system based on a sequence labeling model, and belongs to the technical field of data extraction. The method includes the steps that firstly, labeling and class label setting are performed on subject terms in training linguistic data to obtain a labeling sequence, a subject term extraction model is obtained through training with the training linguistic data serving as an observation sequence and the labeling sequence serving as a state sequence, and the subject terms in the linguistic data to be extracted are preliminarily extracted with the model serving as an extractor; then, preliminary extraction results are screened according to the similarity between the subject terms to obtain the true subject terms belonging to corresponding subject fields. According to the extraction method and system, when the subject terms are extracted, by performing labeling on the subject terms in a small quantity of training linguistic data, rapid and accurate extraction of the subject terms in the linguistic data is achieved, meanwhile, existing knowledge hierarchy structures of the subject fields can be gradually improved, and the defects of a traditional subject term extraction method are overcome.
Owner:明博教育科技股份有限公司 +1
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products