Patents
Literature
Patsnap Copilot is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Patsnap Copilot

101results about How to "Improve labeling accuracy" patented technology

Indoor scene semantic annotation method based on RGB-D data

ActiveCN104809187ASolve the problem of difficult to choose annotation primitives appropriatelyImprove adverse effectsCharacter and pattern recognitionSpecial data processing applicationsNatural language processingRecursion
The invention relates to an indoor scene semantic annotation method based on RGB-D data. According to the method, a coarse-to-fine global recursion feedback semantic annotation framework based on the RGB-D data is built, in addition, the whole semantic annotation framework is divided into two major parts including the coarse-granularity region stage semantic label deduction and fine-granularity pixel stage semantic label refinement. The framework is different from the traditional region stage or pixel stage semantic annotation framework, the framework rebuilds the relationship between the coarse-granularity region stage semantic label deduction and the fine-granularity pixel stage semantic annotation, and a reasonable global recursion feedback mechanism is introduced, so that the coarse-granularity region stage semantic annotation result and the fine-granularity pixel level semantic annotation result realize the alternate iterative updating optimization. Through adopting the mode, the multi-mode information of different region layers in the scene images is better merged, and the general problem that an annotation base element is difficult to be properly selected in the traditional indoor scene semantic annotation scheme is solved to a certain degree.
Owner:NANJING UNIV OF POSTS & TELECOMM

Relation extraction method in combination with clause-level remote supervision and semi-supervised ensemble learning

The invention discloses a relation extraction method in combination with clause-level remote supervision and semi-supervised ensemble learning. The method is specifically implemented by the following steps of 1, aligning a relation triple in a knowledge base to a corpus library through remote supervision, and establishing a relation instance set; 2, removing noise data in the relation instance set by using syntactic analysis-based clause identification; 3, extracting morphological features of relation instances, converting the morphological features into distributed representation vectors, and establishing a feature data set; and 4, selecting all positive example data and a small part of negative example data in the feature data set to form a labeled data set, forming an unlabelled data set by the rest of negative example data after label removal, and training a relation classifier by using a semi-supervised ensemble learning algorithm. According to the method, the relation extraction is carried out in combination with the clause identification, the remote supervision and the semi-supervised ensemble learning; and the method has wide application prospects in the fields of automatic question-answering system establishment, massive information processing, knowledge base automatic establishment, search engines, specific text mining and the like.
Owner:ZHEJIANG UNIV

Corpus text processing method and device and electronic equipment

The invention provides a corpus text processing method and device and electronic equipment. The method comprises the steps of inputting a corpus text set to be processed into a language model, and obtaining feature vectors of corpus texts; performing clustering processing on the corpus text set based on a clustering algorithm and the feature vectors of the corpus texts to obtain corpus classification information; modifying intention category annotation information annotated by the target corpus text to obtain the target corpus text; and adding the target corpus text into the original trainingsample to train a language model to obtain an optimized language model. According to the invention, clustering processing is carried out on the corpus text set through the language model and the clustering algorithm, and the intention category annotation information annotated by the target corpus information in the corpus classification information is corrected to train the language model, so thatthe language model can be iteratively optimized in the use process, the generalization ability of the language model and the clustering algorithm is improved, and the labeling accuracy of the intention category labeling information corresponding to the corpus text is improved.
Owner:NETEASE (HANGZHOU) NETWORK CO LTD

Collaborative filtering-based teaching video labeling method

The invention discloses a collaborative filtering-based teaching video labeling method. The collaborative filtering-based teaching video labeling method mainly solves the shortcoming of the low accuracy of teaching video labeling in the prior art. The method is implemented through the steps of inputting a teaching video and performing caption key frame extraction on the teaching video according to captions to obtain D key frames; performing caption extraction on the D key frames through optical character software and performing text correction and deleting on obtain captions to obtain D text documents; performing shot segmentation on the teaching video by combining the D text documents with a Gibbs sampler to segment the teaching video into M shots; labeling a part of the M shots, computing the cosine similarity between the labeled shots and unlabeled shots through a collaborative filtering method, and selecting five words with the highest cosine similarity to label the unlabeled shots. The collaborative filtering-based teaching video labeling method takes the caption information in the teaching video into consideration, thereby effectively describing the teaching video, improving the labeling accuracy of the teaching video and being applicable to video teaching.
Owner:山西恒奕信源科技有限公司

OCR image sample generation method and device, printed matter verification method and device, equipment and medium

The invention relates to artificial intelligence, and provides an OCR image sample generation method and device, a printing form verification method and device, equipment and a medium, and the methodcomprises the steps: receiving an image generation instruction, and obtaining an image sample; inputting the image sample into a preset font typesetting generation model, obtaining first annotation information by performing text detection and character recognition on the image sample, and obtaining a simulation result generated by reconstruction of the font typesetting generation model; inputtingthe image sample and the simulation image into a preset style synthesis model, extracting style features and content features by the style synthesis model, and generating a synthesis result by the style synthesis model; and obtaining an OCR image sample label, recording the synthesized image as an OCR image sample corresponding to the image sample, and associating the OCR image sample with the OCRimage sample label. In addition, the invention also relates to a blockchain technology, and the information can be stored in the blockchain node. According to the invention, the OCR image sample withthe same texture style as the image sample is automatically generated, and the sample label is automatically labeled.
Owner:PINGAN INT SMART CITY TECH CO LTD

Digital image multi-semantic annotation method based on spatial dependency measurement

The invention belongs to a digital image multi-semantic annotation method which is characterized by comprising the following steps in sequence: (1) inputting a plurality of digital images with known semantemes and all digital images to be annotated into a computer; (2) acquiring a characteristic vector set of all images by extracting characteristics; (3) establishing a mark vector of marked images and a final mark vector set of all images; (4) calculating a Gram matrix of the characteristic vector set; (5) acquiring a measurement value of the dependency degree of the characteristic vector set and the mark vector set by using a spatial dependency measurement method; (6) gradually increasing the dependency measurement value to the maximum in the iterative process, thereby obtaining confidence values that the images to be annotated belong to semantemes; and (7) setting a threshold, and judging the semantemes of the images to be annotated. The digital image multi-semantic annotation method has the advantages that firstly, the annotation effect can be improved by adopting a great number of images which are not semantically annotated, secondly, the method is applicable to the situation of multi-semantic annotation situation, and thirdly, the calculation speed is relatively high.
Owner:HAINAN UNIVERSITY

Iterative construction method and device for military scenario text event extraction corpus

The invention discloses an iterative construction method and device for a military scenario text event extraction corpus. The method comprises the following steps of 1, preprocessing, and obtaining anoriginal data set represented by a word sequence; 2, constructing a seed data set, defining an event template, constructing an event trigger word dictionary, forming the seed data set through manualannotation, and dividing the seed data set into a seed training set and a test set; 3, training a model, training a machine learning model by using the seed training set, testing the model by using the test set, and optimizing the model parameters according to a test result to obtain a first learning model; 4, selecting an unlabeled training corpus, and inputting the unlabeled training corpus intothe first learning model to obtain a prediction result set; 5, correcting the prediction result set to form a new annotation corpus; and 6, through the continuous iteration, generating the training sets in sequence to form the event extraction corpus. According to the iterative construction method for the military scenario text event extraction corpus, the corpus construction efficiency is improved, the manual annotation cost is reduced, and the relatively higher corpus annotation accuracy is obtained.
Owner:NAT UNIV OF DEFENSE TECH

Image classification method and device, electronic equipment and readable storage medium

The invention relates to the field of intelligent decision, and discloses an image classification method, which comprises the following steps: training a pre-constructed first convolutional neural network model by using a first annotated image set to obtain a first image classification model; performing image screening and segmentation processing on the to-be-labeled image set to obtain a segmented image set; performing classification labeling on the segmented image set by using a first image classification model to obtain a second labeled image set; combining the first annotated image set and the second annotated image set to obtain an annotated image set; using the annotation image set to carry out iteration annotation training on a pre-constructed second convolutional neural network model to obtain a target image classification model; and classifying the to-be-classified image by using the target image classification model to obtain a classification result. The invention also relates to a block chain technology, and the annotated image set can be stored in a block chain node. The invention further provides an image classification device, electronic equipment and a storage medium. According to the invention, the accuracy of image classification can be improved.
Owner:深圳赛安特技术服务有限公司

Digital labeling method and device for three-dimensional object

The invention relates to a digital labeling method and device for a three-dimensional object, belongs to the technical field of artificial intelligence, and solves the problems of high difficulty in knowledge labeling of the three-dimensional object and low labeling accuracy in the prior art. The method comprises the following steps: acquiring two-dimensional pictures, pose information and labeling labels of a three-dimensional object at different angles to form a training sample set; constructing an offset matrix and an enhanced angle rotation matrix of the sample based on the pose information in each sample; training the annotation network model by using the training sample set and the corresponding offset matrix and the enhanced angle rotation matrix to obtain an optimized annotation network model; and marking a to-be-marked three-dimensional object by using the optimized marking network model based on the two-dimensional picture and the pose information of the to-be-marked three-dimensional object. According to the method, the three-dimensional object labeling is converted into the two-dimensional labeling, the labeling difficulty is reduced, the association relationship between the labeling and the three-dimensional object is established through the artificial intelligence model, and the labeling accuracy of the three-dimensional object is improved.
Owner:北京京航计算通讯研究所

Text classification method and device and electronic equipment

The invention provides a text classification method and device and electronic equipment. The method comprises the following steps: inputting a to-be-classified text into a trained text classification model, and obtaining a text category of the to-be-classified text; the training mode of the text classification model is as follows: determining a plurality of text categories and an attribute rule of each text category based on the text data of which the statistical frequency is higher than a preset threshold value and / or the text data of which the semantic similarity meets a preset condition; and based on the determined text category and the attribute rule of the text category, labeling a plurality of sample texts, and training the initial model based on the plurality of sample texts carrying labeling information to obtain a text classification model. According to the mode, the text category and the attribute rule of the text category are obtained through manual summarization according to a small amount of selected representative unannotated text data, and then the text is automatically annotated according to the summarized rule, so that the annotated text with relatively high labeling accuracy is obtained; therefore, the classification accuracy of the text classification model obtained by training according to the annotated text is relatively high.
Owner:NETEASE (HANGZHOU) NETWORK CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products