Patents
Literature
Patsnap Copilot is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Patsnap Copilot

70 results about "Text filtering" patented technology

Short text classification method based on topic word vectors and convolutional neural network

The invention discloses a short text classification method based on a topic word vector and a convolutional neural network, which comprises the following steps: 1) a data acquisition stage: acquiringshort text data according to requirements, and labeling the short text data as a training set; 2) a data preprocessing stage: performing word segmentation, stop word removal, useless text filtering and the like on the text; 3) representing short text features, namely respectively representing a theme level and a word vector level; 4) carrying out subject term vector joint training; 5) optimizing and iterating parameters of the convolutional neural network classification model; and 6) performing category prediction on the new sample. According to the invention, short text data characteristics are combined; in the feature representation stage, a topic vector and a word vector are combined for representation; semantic feature expansion is carried out on the data characteristics of the short text, text semantic information is further mined by utilizing the local sensitive information extraction capability of the convolutional neural network in the classification model training stage, and indexes such as short text classification task category prediction accuracy can be improved.
Owner:NANJING UNIV

Automatic generating system for role Chinese mouth shape cartoon

The invention discloses an automatic generating system for a role Chinese mouth shape cartoon, which comprises a dialogue text filtering and coding module, a dialogue phonetic segmentation module, a dialogue segmentation code integrating module and a role Chinese mouth shape cartoon generating module, wherein the dialogue text filtering and coding module performs phrase segmentation, pinyin mouth shape coding, integral recognition mark setting and coding and filtering on a dialogue text to generate and output a dialogue mouth shape code, an integral dialogue recognition coding mark and a dialogue mouth shape filtering and coding sequence; the dialogue phonetic segmentation module performs phonetic sampling and phonetic energy statistics on dialogue audio to generate and output dialogue phonetic segmentation candidate result sequences; the dialogue segmentation code integrating module is connected with the dialogue text filtering and coding module and the dialogue phonetic segmentation module and used for integrating and correcting the dialogue phonetic segmentation candidate result sequences to generate and output a dialogue segmentation code sequence; and the role Chinese mouth shape cartoon generating module is connected with the dialogue segmentation code integrating module and used for generating and outputting the role Chinese mouth shape cartoon according to the dialogue segmentation code sequence. The system can automatically finish the manufacture of the whole role Chinese mouth shape cartoon without loading a corresponding phonetic library during processing.
Owner:INST OF AUTOMATION CHINESE ACAD OF SCI

Short text labeling method, system and device for large-scale classification system

The invention belongs to the field of text classification, particularly relates to a short text labeling method, system and device for a large-scale classification system, and aims to solve the problem that the short text labeling system for the large-scale classification system is low in stability under the condition of limited data. The method comprises the steps that a first short text information set to be classified is acquired, and preprocessing is carried out based on a forward maximum matching segmented word and a word2vec word vector representation technology to obtain a second shorttext information set; based on a rule-based classification method and a supervised neural network classification method, perform binary classification on a second short text information set, then perform short text filtering, perform first-level and second-level classification labels of each short text based on the same classification method, and perform third-level and fourth-level classificationlabels of each short text based on a label propagation method of semi-supervised learning. According to the method, the stability of the short text label system oriented to the large-scale classification system is ensured under the condition of limited data.
Owner:INST OF AUTOMATION CHINESE ACAD OF SCI +1

Social-media short text filtering method based on structure and text information

InactiveCN107562728ATo achieve the purpose of filtering junk dataEasy to handleSpecial data processing applicationsFeature extractionCharacteristic space
The invention discloses a social-media short text filtering method based on structure and text information. The method includes the following steps that 1, the structural characteristics of a short text are judged, and junk information is deleted; 2, the core of the text is extracted, a judge structure judges whether a retained segment text contains the core information of a described event, if nocore information exists, the information is determined as junk information, and if the core information exists, core components are extracted; 3, textual features are extracted, and the core components of the text obtained in the step 2 are mapped to a characteristic space. By scanning a participle set of the text, such structural characteristics whether junk information exists or not can be judged, and mass data in the social network is thus easily and efficiently processed; by identifying characteristics of words, sentence patterns and the like, the feature selection purpose can be achieved, based on the method in which word2vec word vectors are added so as to obtain the average, a sentence vector is constructed, the calculation amount of a classifier model in the training process is reduced, and the semantic information of the text can be well represented.
Owner:UNIV OF ELECTRONIC SCI & TECH OF CHINA

Method of recommending personalized treatment scheme for stroke patient

ActiveCN111524571ASolve the problem of inconsistent input lengthReduce training timeTherapiesMedical automated diagnosisMedical recordNerve network
The invention discloses a method of recommending a personalized treatment scheme for a stroke patient. The method comprises the following steps: S1, preprocessing text information about physical examination and evaluation results in electronic medical records of patients; S2, expressing words, sentences and documents in the physical examination and evaluation results in the electronic medical records of the patients in a vector manner; S3, training a neural network model based on document vectors to obtain a personalized treatment scheme recommendation model; and S4, carrying out unified dataexpression, word segmentation and text filtering processing on the physical examination and evaluation results in an electronic medical record of a new patient, then carrying out document vector representation, and inputting represented document vectors into the personalized treatment scheme recommendation model to obtain a recommended personalized treatment scheme. According to the method, evaluation and physical examination information in the electronic medical record of the patient is taken as a document, the process of personalized treatment scheme recommendation is converted into a multi-label classification problem, the personalized treatment scheme can be recommended according to the physical examination results and the evaluation results of the patient, an auxiliary decision is provided for a doctor, and the burden of the doctor is reduced.
Owner:UNIV OF ELECTRONICS SCI & TECH OF CHINA
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products