Patents
Literature
Hiro is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Hiro

52 results about "Sequence annotation" patented technology

Sequence annotation. Sequence annotation is the "process of marking specific features in a DNA, RNA or protein sequence with descriptive information about structure or function".

Mammary gland electronic medical record entity recognition system based on multi-standard active learning

The invention relates to a mammary gland electronic medical record entity recognition system based on multi-standard active learning, and the system is characterized in that the system comprises a preprocessing module; an entity identification module; and an active learning module. According to the invention, the active learning selection strategy for text sequence annotation is designed by considering three aspects of annotation data volume, sentence annotation cost and data sampling balance, so the total annotation workload is reduced. On the one hand, the system can be used for constructingsystems such as breast disease risk patient identification marks, disease medicine recommendation and auxiliary decision diagnosis, doctors are helped to improve the execution efficiency of breast disease standardized diagnosis and treatment, and scientific bases and suggested schemes are provided; on the other hand, doctors can be assisted in finding out potential abnormal conditions in the diagnosis and treatment process, the misdiagnosis and missed diagnosis rate is reduced, the curing probability of breast disease patients is increased, and important value is achieved for intelligent development of breast disease research.
Owner:DONGHUA UNIV +1

Judicial judgment case information structural processing system

The invention discloses a judicial judgment case information structural processing system, which is applicable to the fields of information extraction and natural language processing. The system includes a judiciary judgement case information structured presentation module, an establishment judiciary judgement case information sequence annotation model module, an attribute trigger word managementmodule and a generation structured judiciary judgement case information module. According to the case types given by the users, the establishment judiciary judgement case information sequence annotation model module constructs a training set of judicial judgment case information sequence tagging and training sequence tagging model, and combines the attribute trigger word set to generate structuredjudicial judgment case information according to the method of generating structured judicial judgment case information. The system of the invention realizes the structured processing of the case information of the judicial judgment according to the case type and the case information of the judicial judgment provided by the user, and aims at providing an effective mode for extracting the structured information from the unstructured judicial judgment text.
Owner:HEFEI UNIV OF TECH

Method for constructing antibiotic resistance genbank

The invention discloses a bioinformatics method for constructing an antibiotic resistance genbank in the biotechnology field. The method comprises the following steps that: searching the protein sequence of a resistance gene in a GenBank; selecting a sequence with high accuracy as an initial sequence; adopting a Clustalw method for comparison; constructing a hidden Markov model, and searching a GenBank protein database to obtain all sequences which contain protein conserved sites; according to the E values of the sequences and the annotation information of the sequences in the GenBank database, removing the sequences which are highly homologous and do not conform to requirements; and after repeated sequences are removed, adding species annotation information; and integrating all protein sequences to finish database construction. By use of the method, the annotation information and the comparison similarity of a sequence can be comprehensively measured, and sequence collection speed andaccuracy can be improved. By use of the method provided by the invention, the construction of the antibiotic resistance genbank can be finished, and a basic data is provided for researching the primer design, the data analysis and the sequence annotation of the resistance gene.
Owner:RES CENT FOR ECO ENVIRONMENTAL SCI THE CHINESE ACAD OF SCI +1

Named entity identification method based on time convolution network

PendingCN110442860ASolve the defect that timing information cannot be obtainedSolve the problem of not being able to remember long-term informationNeural architecturesNeural learning methodsNamed-entity recognitionOne-hot
The invention relates to a named entity identification method based on a time convolution network. The method comprises the following steps of: firstly, constructing a feature representation layer which mainly consists of a word vector and a character feature layer, wherein the word vector layer and the character vector layer respectively accept words and characters as input, and respectively mapdiscrete One-hot representations to respective continuous dense low-dimensional feature spaces; splicing the word vectors and the character-level vectors to represent features of the words in a particular semantic space; secondly, taking the spliced features as input of a time convolution network, extracting different features through the time convolution network with different fusion convolutionkernel sizes, and obtaining final features h1h2... hn; finally, taking the obtained features as input of a CRF layer; and after the CRF further restrains context annotation, outputting sequence annotation results y1y2... yn. Compared with an existing LSTM network, the TCN network has the advantages that the recognition precision is slightly improved, and the training time is only about 1/3 of thatof the LSTM network.
Owner:DALIAN UNIV

Training file generation and evaluation method and device, computer system and storage medium

The invention discloses a training file generation and evaluation method and device, a computer system and a storage medium, and the method comprises the steps: receiving an original file, obtaining the domain information and training entity of the original file, and processing the original file according to the domain information and training entity, and obtaining a labeled file; identifying semanteme of the annotation file through a preset natural language understanding model, and performing sequence annotation on the annotation file to obtain a training file; and inputting the training fileinto an intelligent search model corresponding to the domain information to obtain a training result, calculating the training result through a hit analysis algorithm to obtain a hit rate, and summarizing the training file and the hit rate to generate a hit analysis report. The technical effect of automatically obtaining the training file is achieved, the generation quality and the generation speed of the training file are guaranteed, and the problem that the labeling quality of the training sample cannot be guaranteed due to the fact that the real hit rate of the training sample cannot be obtained at present is solved.
Owner:深圳平安医疗健康科技服务有限公司

Method for automatically extracting subject of argumentative article

The present invention relates to a method for automatically extracting a subject of an argumentative article, and belongs to the technical application field of natural language processing. The method disclosed by the present invention comprises: based on the sequence annotation strategy of the random field of the statistical condition, by analyzing semantic features and position characteristics of the subject in the title of the argumentative article and combining with performance of the trained corpus, establishing a commonly used word dictionary and an important word dictionary; using information such as dictionaries and words, locations and the like to carry out sequence feature annotation on the title of the argumentative article; and using the annotated corpus to train and generate the model, so that unknown data can be predicted, the relatively high accuracy can be ensured, and the applicability of the algorithm in different scenarios can be improved. According to the method disclosed by the present invention, automatic extraction of the subject in the argumentative article by the computer can be effectively realized, the main display object of the article can be displayed in an intuitive form, related information of the object can be quickly mastered by the reader in a facilitated manner, related content retrieval and comparison can be facilitated, and automatically extracted phrases can be provided for the computer to carry out various follow-up analysis.
Owner:贺惠新

Adversarial interpolation sequence-based annotation data enhancement method and device, equipment and medium

PendingCN113297355AGood effectSolve the problem that less affects the accuracy of the modelSemantic analysisText database queryingLinguistic modelAlgorithm
The invention discloses an adversarial interpolation sequence-based annotation data enhancement method and device, equipment and a medium. The method comprises the following steps: acquiring first sample data containing sequence labels; inputting the first sample data into a preset language model, outputting candidate word vectors conforming to context semantic constraints, and forming enhanced second sample data according to the candidate word vectors; and interpolating the first sample data and the second sample data by adopting an adversarial interpolation method to obtain interpolated enhanced sample data. According to the sequence annotation data enhancement method provided by the embodiment of the invention, the language model is utilized to provide the candidate word vector conforming to the context constraint, and the adversarial interpolation is utilized to consider the task characteristics, so that a more difficult sample which enables a machine learning algorithm to easily generate misjudgment is generated, the effect of the sequence model under low resources is improved. The problem that the accuracy of the model is influenced by less annotation data is solved.
Owner:CHINA PING AN LIFE INSURANCE CO LTD

Method and system for extracting dynamic information of smart home industry

The invention provides a method and a system for extracting dynamic information of the smart home industry, and provides a method for constructing automatic industry dynamic trend capture and automatically generating a report on the basis of industry dynamic data capture and extraction tasks in the field of smart home. According to the method, a smart industry dynamic data extraction mode combining industry priori knowledge and natural language processing sequence annotation can be provided on the basis of the smart home industry background and article structural information extraction, and meanwhile, an industry research report is automatically generated in combination with a text classification model based on deep learning and paragraph abstract extraction of multiple types of indexes. Moreover, the method is a process where a machine learning algorithm is deeply combined with business features of the smart home industry, a natural language analysis business process with a good prediction effect is researched through a large number of practices, the algorithm is efficient and highly targeted, and the process flow highly conforms to a data analysis business, and the success rate of data extraction and report generation is relatively high.
Owner:南京数脉动力信息技术有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products