Patents
Literature
Patsnap Copilot is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Patsnap Copilot

200results about How to "Realize automatic labeling" patented technology

Chinese domain term recognition method based on mutual information and conditional random field model

The invention discloses a Chinese domain term recognition method based on mutual information and a conditional random field model. The Chinese domain term recognition method includes the following steps: (1) gathering domain text corpus and marking all the punctuations, spaces, numbers, ASSCII (American Standard Code for Information Interchange) characters and characters except Chinese characters in the corpus; (2) setting character strings and computing the mutual information values of the character strings, (3) computing the left comentropy and the right comentropy of every character string, (4) defining character string evaluation function, setting evaluation function threshold, computing the evaluation function values of every character string, determining that every character string is a word, comparing in sequence the evaluation function value of the former character with the evaluation function value of the latter character in the character string and segmenting character meaning character strings one by one, (5) utilizing conditional random fields to train a conditional random field model and recognizing domain terms with the conditional random field model. When the Chinese domain term recognition method is used to recognize terms, the data sparsity of legitimate terms is overcome, the amount of calculation of conditional random fields is reduced, and the accuracy of the Chinese domain term recognition is improved.
Owner:SHANGHAI UNIV

Face image gender marking method and face gender detection method

The invention discloses a face image gender marking method and a face gender detection method. The detection method includes the following steps that: 1) face images and context information thereof are obtained; 2) the genders of each of obtained face images to be marked is marked; 3) feature vectors of each of gender marked images are extracted, and a machine learning algorithm is utilized to train the face images which have been subjected to gender marking so as to generate a face gender recognition model; and 4) as for face images to be detected, the feature vectors of the face images to be detected are extracted, and the face gender recognition model is utilized to perform gender detection on the face images to be detected. The gender of each of the obtained faces to be marked is marked in the following manners that: name keywords of candidates are extracted from the context information of the images, and a network is searched, and the method returns a result webpage; the genders of the images are determined according to the word frequency of gender relevant words in the result webpage; and a face image technology platform and a face attribute analysis algorithm are respectively adopted to detect the genders of the images; and the gender of the images are marked based on above recognition results. With the face image gender marking method and the face gender detection method of the invention adopted, face image marking efficiency and gender detection efficiency can be greatly improved.
Owner:BEIJING KUANGSHI TECH

Abnormal flow detection system and method

The invention provides an abnormal flow detection system and method, and the system comprises a flow feature analysis and selection module which screens flow features according to the flow features ofa ubiquitous power IOT, and carries out the dimension reduction of the flow features through a KPCA algorithm; a flow reference model construction module which is used for extracting flow characteristics after dimension reduction, constructing a restricted Boltzmann machine model based on an RBM model and an SOM clustering algorithm and training to complete construction of a reference model; a flow benchmark model training module which is used for carrying out anomaly degree division on the trained benchmark model according to a comparison divergence degree algorithm, and dividing the benchmark model into a normal benchmark model and an abnormal benchmark model; and an abnormal flow detection module which is used for extracting and calculating flow characteristics to be detected, and carrying out abnormal flow detection according to the similarity between the output of the reference model and the original input characteristic data. According to the invention, the automatic category labeling of the traffic data can be completed, and the network traffic anomaly detection accuracy is relatively high.
Owner:SHANGHAI JIAO TONG UNIV +1

Method and apparatus for labeling face images

The invention provides a method and an apparatus for labeling face images. The method comprises: acquiring a face image set which corresponds to a specific personal name and comprises a plurality of face images, wherein each face image only comprises one face; establishing a whole object assumption that both random face images in the face image set belong to the same person, and verifying whether the whole object assumption is tenable by a predetermined face verification model; and if verifying the whole object assumption is tenable, labeling the plurality of face images in the face image set as a result that the face images in the face image set belong to the specific personal name. According to the scheme, a great number of images crawled from the Internet can be rapidly discriminated and screened, error images are efficiently removed, labor and time cost of labeling the face images is greatly saved, and a powerful guarantee is provided for a subsequent face identification model based on automatic labeling, which has high face image training accuracy; and meanwhile, according to the scheme, an existing data monopoly in the industry can be broken through, which is convenient for popularization of a face identification technology, thereby promoting the progress of the technology.
Owner:BEIJING QIHOO TECH CO LTD

Online marking method for three-dimensional model component

The invention discloses an online marking method for a three-dimensional model component. The method includes the following steps that preprocessing is conducted, and features and over-segmentation pieces of triangular patches are obtained; initial marking is conducted, a segmentation model is initialized, seed area marking is conducted on an initial three-dimensional model which is input by a user, a seed marking set is obtained, online learning is conducted on the seed marking set, and the partition model is learned; online marking is conducted, the segmentation model is used for conducting segmentation marking on an input three-dimensional model of the same type, a middle marking result is obtained, whether the result is approved or not is judged by the user, if yes, the process is over, and the middle marking result is a final marking result, and if not, the user conducts area correction on the middle marking result, the seed marking set is obtained, online learning is conducted on the seed marking set, the segmentation model is updated, the updated segmentation model is used for conducting segmentation marking on the three-dimensional model, the middle marking result is obtained, continuous iteration is conducted until the middle marking result is approved by the user, and the final three-dimensional model marking result is obtained.
Owner:NANJING UNIV

Named entity identification method and system for legal instrument multi-strategy fusion

The invention discloses a named entity recognition method and system for legal document multi-strategy fusion, and the method comprises the following steps: building a source data corpus, carrying outthe part-of-speech tagging and sequence tagging of the source data corpus, and carrying out the model pre-training; training the labeled data through a BiLSTM-Attention-CRF (Bipolar Long Short Term Memory-Attention-Content Random Field) model to obtain a trained first model; improving the trained first model; establishing a target data corpus, randomly extracting data from the target data of thelegal instrument, and generating a plurality of training sets; carrying out transfer learning on the plurality of training sets, and training the improved first model to obtain models trained by the plurality of training sets; and integrating the models trained by the plurality of training sets by adopting a voting mechanism in ensemble learning to obtain a second model, and performing named entity identification of legal documents by the second model to obtain a final named entity identification result. According to the method, the accuracy and recall rate of named entity recognition are improved under the condition of insufficient annotation corpora.
Owner:SOUTH CHINA NORMAL UNIVERSITY

Sample labeling method and device, server and machine readable storage medium

The embodiment of the invention provides a sample labeling method and device, a server and a machine readable storage medium. The method comprises the steps of obtaining a sample set, taking a plurality of labeled samples in the sample set as the training samples, and respectively training a plurality of preset network models of different network structures to obtain the plurality of trained network models; aiming at any one unlabeled sample in the sample set, respectively inputting the unlabeled sample into the plurality of trained network models to obtain an output result of each network model, fusing the output results to obtain the labeling information of the unlabeled sample, and labeling the unlabeled sample by utilizing the labeling information. The sample set comprises the plurality of labeled samples and the plurality of unlabeled samples, the ratio of the number of the labeled samples to the total number of the samples in the sample set is smaller than a preset ratio, and thelabeling personnel only need to label a small number of samples, so that the workload of manual labeling is reduced, the risks of wrong labeling and missing labeling of manual labeling are reduced, and the sample labeling efficiency is improved.
Owner:BEIJING QIYI CENTURY SCI & TECH CO LTD

Method and device for operating autonomous vehicle

The invention discloses a method and device for operating an autonomous vehicle. According to a specific implementation mode, the method for operating the autonomous vehicle comprises the steps of collecting a driving scheme and driving scene information adopted by a driver during performing abnormal intervention on a semi-autonomous vehicle, determining the level of risk according to the drivingscheme; decomposing object information, vehicle running state information and driving environment information of a driving environment where the vehicle is in from the driving scene information; learning the object information and the corresponding level of risk to recognize a risk object associated with the level of the risk; for the risk object, learning the object information, the vehicle running state information, the driving environment information and the driving scheme to determine an association relationship between the combination of the object information of the risk object, the vehicle running state information and the driving environment information and the driving scheme as a driving strategy; and using the driving strategy to determine a candidate driving scheme. According tothe specific implementation mode of the method for operating the autonomous vehicle, the candidate driving scheme is optimized according to behaviors of the driver.
Owner:BEIJING BAIDU NETCOM SCI & TECH CO LTD

Decision tree model training method, and method and apparatus for determining data attributes in OCR result

The invention discloses a decision tree model training method, and a method and an apparatus for determining data attributes in an OCR result. The decision tree model training method comprises the steps of obtaining a sample medical data picture, and performing OCR on the sample medical data picture to generate a first OCR result, wherein the first OCR result is a two-dimensional character string array, and each column of data in the two-dimensional character string array is used for indicating data which belongs to a same attribute column; extracting first feature information of each piece of data in the first OCR result; obtaining first labeled data corresponding to each piece of the data in the first OCR result, wherein the first labeled data is used for indicating an attribute which each piece of the data belongs to; and performing training according to the first feature information and the first labeled data to generate a decision tree model used for determining the data attributes in the OCR result. According to the method, the purpose of automatically labeling the data attributes in the recognition result is achieved; the consumption cost in a to-be-recognized picture recognition process is effectively reduced; and the recognition efficiency is improved.
Owner:天方创新(北京)信息技术有限公司

Short text classification method based on multiple weak supervision integration

ActiveCN111444342AHandling Imbalanced Classification Problems EfficientlyImbalanced Classification Problem SolvingNatural language data processingSpecial data processing applicationsOriginal dataClassification methods
The invention discloses a short text classification method based on multiple weak supervision integration, and the method comprises the steps: obtaining an original data set and a knowledge base, andcarrying out the data preprocessing; carrying out knowledge extraction on the preprocessed data; representing the extracted knowledge as an annotation function, and using the annotation function for data annotation; carrying out label integration through a conditional independent model; training a classification model based on a full-connection neural network; evaluating and optimizing the classification model to obtain an optimal model; and performing short text classification by utilizing the optimal model. According to the short text classification method based on multiple weak supervisionintegration, explicit knowledge and implicit knowledge are completely expressed in a mode of combining keyword matching, regular expression and remote supervision clustering; by means of probability labels generated by a label integration mechanism, automatic labeling of label-free data is achieved, the problem of data sparsity of short texts is relieved, and the problem of unbalanced classification of the short texts is effectively solved.
Owner:湖南董因信息技术有限公司

Pipeline weld joint automatic labeling method based on Revit platform

The invention relates to a pipeline weld joint automatic labeling method based on a Revit platform. The pipeline weld joint automatic labeling method comprises the following steps: 1, establishing a BIM of a pipeline, and establishing a platform by taking Revit as graphic support software; and 2, placing a workpiece under the platform, classifying pipe weld problems, carrying out pipeline weldingseam labeling analysis; filtering out all pipelines in the current three-dimensional view; traversing all pipelines to obtain parameter information of a starting end pipeline, a pipeline connector andall pipelines, calculating the length of the pipelines, solving the number of weld joints, sorting the pipeline connectors, labeling from the starting connector, calculating whether the length of thepipelines is larger than the length N of a single steel pipe or not, and conducting labeling at the position N automatically when the length is larger than N. The pipeline weld joint automatic labeling method has the advantages that (1) one-key labeling is realized quickly; (2) the labeling is accurate; (3) serial numbers can be marked, and pipeline diameter and material information can be pickedup; and (4) the size is accurate, the construction is more facilitated, and the weld joint position needing flaw detection can be quickly found.
Owner:XI'AN UNIVERSITY OF ARCHITECTURE AND TECHNOLOGY

Natural language labeling method based on artificial intelligence and related equipment

The invention relates to the field of artificial intelligence, and discloses a natural language labeling method based on artificial intelligence and related equipment. The method comprises the steps of obtaining a to-be-labeled natural language text; through a pre-trained sequence labeling model, carrying out semantic coarse-grained sequence labeling on the sequence labeling model to obtain a labeling sequence; determining a target word in a natural language text according to the labeling sequence, and determining a target role type of the target word; obtaining a preset template word corresponding to the target role type, and calculating a similarity value between the preset template word and the target role type; and according to the similarity value and a preset sub-category judgment rule, determining a sub-category corresponding to the target word as a target sub-category, and performing semantic fine-grained sequence labeling on the natural language text to obtain a labeled text.In addition, the invention further relates to a blockchain technology, and the natural language text to be labeled and/or the labeled text can be stored in the blockchain. According to the method, thecorpus annotation efficiency for language model training can be improved.
Owner:CHINA PING AN LIFE INSURANCE CO LTD

Target detection box generation method and image data automatic labeling method and system

The invention provides a target detection box generation method and an image data automatic labeling method and system. The generation method comprises the following steps: acquiring a real-time image with a target object, a world coordinate of a center point of the target object under a world coordinate system, a world coordinate of an image collector and attitude information of the image collector; according to the world coordinate of the central point of the target object in the world coordinate system, the world coordinate of the image collector and the attitude information of the image collector, obtaining the pixel coordinate of the central point of the target object through coordinate transformation from the world coordinate system to a pixel coordinate system; and taking the pixel coordinate of the central point of the target object as the central point of a target detection box, and in combination with the calibrated side length or diameter of the target detection frame, generating the target detection box. According to the method, on the premise that human intervention is not needed, automatic generation of the target detection data is truly realized, and the operation process is simplified.
Owner:NAT INNOVATION INST OF DEFENSE TECH PLA ACAD OF MILITARY SCI
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products