Patents
Literature
Patsnap Copilot is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Patsnap Copilot

53results about How to "No need for manual labeling" patented technology

Certificate photo matting method and system based on end-to-end convolutional neural network

The invention relates to the technical field of image processing, and particularly discloses a certificate photo matting method based on an end-to-end convolutional neural network, which comprises thefollowing steps: synthesizing a foreground image in a matting mask of a certificate portrait photo with an Internet photo to obtain a training image; generating a coarse segmentation Trimap network according to the lightweight semantic segmentation network model; performing refined matting on the coarsely segmented Trimap network according to the encoding and decoding network to obtain a refinedmatting network; cascading the coarsely segmented Trimap network and the refined matting network to obtain an end-to-end network model; and inputting the training image into the end-to-end network model for fine adjustment to obtain a trained end-to-end network model. The invention further discloses a certificate photo matting system based on the end-to-end convolutional neural network, a storagemedium and a processor. According to the certificate photo matting method based on the end-to-end convolutional neural network, the corresponding matting prospect can be automatically obtained for thecertificate photo, manual annotation is not needed, and the matting efficiency is improved.
Owner:TRAFFIC MANAGEMENT RES INST OF THE MIN OF PUBLIC SECURITY

New energy lithium battery surface defect detection method based on adaptive deep learning

The invention discloses a new energy lithium battery surface defect detection method based on adaptive deep learning. The method comprises the following steps: carrying out nonlinear mapping on a lithium battery surface grayscale image; transforming the decoupled irradiation component and reflection component to a frequency domain; performing filtering, inverse Fourier transform and exponential transform on the frequency domain data to obtain a reconstructed lithium battery image; based on morphological processing and background differencing, enhancing gray scale response at the defect; carrying out image segmentation and connected domain analysis and screening processing, and taking a result as a labeled image; designing an operator to simulate illumination details, and carrying out sample enhancement operation on the surface grayscale image of the lithium battery; training a deep convolutional neural network based on the enhanced sample image set and the labeled image; and achievinglithium battery surface defect detection based on the trained network. By utilizing the method, the detection efficiency can be improved and the false detection rate can be reduced in a lithium battery surface defect detection scene.
Owner:芜湖楚睿智能科技有限公司

Recommended question out-of-syllabus detection method and device, electronic equipment and storage medium

The embodiment of the invention provides a recommended question out-of-syllabus detection method and device, electronic equipment and a storage medium. The method comprises the steps of determining anoriginal question text, a recommended question text corresponding to the original question text and a to-be-detected teaching material version; inputting the original question text, the recommended question text and the version of the to-be-detected teaching material version into an out-of-syllabus detection model to obtain an out-of-syllabus detection result output by the out-of-syllabus detection model; wherein the out-of-syllabus detection model is used for analyzing the correlation between the original question text and each chapter under the version of the to-be-detected teaching material and the correlation between the recommended question text and each chapter under the version of the to-be-detected teaching material based on an attention mechanism, and determining an out-of-syllabus detection result based on the correlation. According to the method and device, the electronic equipment and the storage medium provided by the embodiment of the invention, the problem of out-of-syllabus of the question pushing engine is solved, out-of-syllabus detection does not need manual annotation, and the method is simple, convenient, efficient, low in cost and high in accuracy.
Owner:IFLYTEK CO LTD

Audio data processing method and device, medium and computing device

One embodiment of the invention provides an audio data processing method. The audio data processing method includes the steps: acquiring a spectrum corresponding to audio data; dividing the spectrum corresponding to the audio data into a homophonic tone spectrum and a non-homophonic tone spectrum, wherein the frequency in the homophonic tone spectrum is an integer multiple of the frequency of thefundamental tone; and determining the emotional information expressed by the audio data according to the characteristic information of the homophonic tone spectrum and the characteristic information of the non-homophonic tone spectrum. The audio data processing method can accurately identify the emotion expressed by the audio so as to enable a user to search the related audio of expressing the corresponding emotion according to the emotion dimension so as to bring a better experience for the user, by dividing the spectrum corresponding to the audio data into the homophonic tone spectrum and the non-homophonic tone spectrum, and determining the emotion information expressed by the audio data according to the characteristic information of the homophonic tone spectrum and the characteristic information of the non-homophonic tone spectrum. Besides, the embodiment of the invention provides an audio data processing device, a medium and a computing device.
Owner:HANGZHOU NETEASE CLOUD MUSIC TECH CO LTD

Image set establishing method and device and storage medium

The embodiment of the invention discloses an image set establishing method and device and a storage medium. The method comprises the steps of obtaining a single-label image set and a multi-label imageset; converting the contents of the tags into word identifiers according to a semantic network to obtain a word identifier set, a converted single-tag image set and a converted multi-tag image set; constructing a hierarchical semantic structure according to the word identifier set and the semantic network; carrying out label supplement on the images in the converted single-label image set according to the semantic relationship between the word identifiers in the hierarchical semantic structure to obtain a supplemented single-label image set; based on the co-occurrence probability between theword identifier in the supplemented single-label image set and the word identifier in the converted multi-label image set, carrying out label supplementing on the images in the supplemented single-label image set to obtain a final supplemented image set; and establishing a target multi-label image set according to the finally supplemented image set and the converted multi-label image set. According to the scheme, the establishment efficiency of the large-scale multi-label image set and the label labeling quality can be improved.
Owner:TENCENT TECH (SHENZHEN) CO LTD

High-generalization cross-domain road scene semantic segmentation method and system

The invention discloses a high-generalization cross-domain road scene semantic segmentation method. The method comprises the following steps: generating a virtual image and a corresponding label through a game engine; generating a global/local texture migration image by using the virtual image; sending the virtual image and the global/local texture migration image to a neural network for training; performing consistency constraint on the global/local texture migration image trained by the neural network; calculating loss values of the virtual image trained by the neural network and the local texture migration image and the local texture migration image subjected to consistency constraint with the labels respectively, and training a semantic segmentation model according to the loss values; and performing semantic segmentation by using the trained semantic segmentation model. According to the method, data enhancement is realized through global texture migration and local texture migration of a virtual image, a neural network is attacked, and a model is forced to learn cross-domain invariant shape information; moreover, the method only carries out network training in the source domain, achieves a reliable cross-domain segmentation effect, and has very high generalization performance.
Owner:SICHUAN UNIV

Audio data processing method and device, medium and computing device

One embodiment of the invention provides an audio data processing method. The audio data processing method includes the steps: acquiring a spectrum corresponding to audio data; dividing the spectrum corresponding to the audio data into a homophonic tone spectrum and a non-homophonic tone spectrum, wherein the frequency in the homophonic tone spectrum is an integer multiple of the frequency of thefundamental tone; and determining the emotional information expressed by the audio data according to the characteristic information of the homophonic tone spectrum and the characteristic information of the non-homophonic tone spectrum. The audio data processing method can accurately identify the emotion expressed by the audio so as to enable a user to search the related audio of expressing the corresponding emotion according to the emotion dimension so as to bring a better experience for the user, by dividing the spectrum corresponding to the audio data into the homophonic tone spectrum and the non-homophonic tone spectrum, and determining the emotion information expressed by the audio data according to the characteristic information of the homophonic tone spectrum and the characteristic information of the non-homophonic tone spectrum. Besides, the embodiment of the invention provides an audio data processing device, a medium and a computing device.
Owner:HANGZHOU NETEASE CLOUD MUSIC TECH CO LTD

A knowledge map construction method based on trusted web resources

The invention discloses a knowledge graph construction method based on credible webpage resources, and the method comprises the steps: obtaining a link in reference data of each encyclopedia page, andrecording a theme in the corresponding encyclopedia page; crawling a text in a linkedwebpage and the link in the webpage, and adding the link in the webpage into a queue to determine a training set of atheme model; training an LDA model according to the crawled webpage text and theme label; crawling the web pages in the queue, adding links in the web pages into the queue, directly outputting theweb pages as knowledge extraction when the crawled links of the web pages belong to one-hop links, otherwise, calculating topic distribution of documents of the web pages by using theLDA model and clustering; respectively calculating a TrustRank value of each webpage for each clustering cluster; selecting a webpage for credibility labeling, and training a knowledge source identification model in combination with features; and obtaining new to-be-identified webpages in batches, calculating features, and outputting the webpages for knowledge extraction when the webpages are identified as knowledge sources. The knowledge graph constructed by the method is more comprehensive and higher in quality.
Owner:中国科学院电子学研究所苏州研究院

Intelligent speech recognition method and device based on classification identification, and related equipment

The invention discloses an intelligent speech recognition method and device based on classification identification and related equipment, and the method comprises the steps: carrying out the endpoint detection and feature extraction of original speech data, so as to obtain speech feature data; identifying the industry attribute of the original voice data through the attached industry attribute identifier when the original voice data is input; recognizing the voice feature data through a trained universal voice recognition model to obtain a recognition result; judging whether the confidence coefficient in the recognition result is lower than a preset value or not; inputting the corresponding text of which the confidence coefficient is lower than a preset value into the corresponding industry vertical module for optimization identification; and selecting the text corresponding to the maximum confidence coefficient as a final recognition result according to the confidence coefficient in the recognition result and the optimized confidence coefficient. The industry attribute identifier is attached when the original voice data is input, so that a corresponding industry vertical module can be conveniently called subsequently to carry out optimization identification; manual labeling is not needed, and high accuracy is maintained through secondary verification, recognition and correction.
Owner:E SURFING IOT CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products