Patents
Literature
Hiro is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Hiro

399results about "Audio data clustering/classification" patented technology

Chinese song emotion classification method based on multi-modal fusion

The invention discloses a Chinese song emotion classification method based on multi-modal fusion. The Chinese song emotion classification method comprises the steps: firstly obtaining a spectrogram from an audio signal, extracting audio low-level features, and then carrying out the audio feature learning based on an LLD-CRNN model, thereby obtaining the audio features of a Chinese song; for lyricsand comment information, firstly constructing a music emotion dictionary, then constructing emotion vectors based on emotion intensity and part-of-speech on the basis of the dictionary, so that textfeatures of Chinese songs are obtained; and finally, performing multi-modal fusion by using a decision fusion method and a feature fusion method to obtain emotion categories of the Chinese songs. TheChinese song emotion classification method is based on an LLD-CRNN music emotion classification model, and the model uses a spectrogram and audio low-level features as an input sequence. The LLD is concentrated in a time domain or a frequency domain, and for the audio signal with associated change of time and frequency characteristics, the spectrogram is a two-dimensional representation of the audio signal in frequency, and loss of information amount is less, so that information complementation of the LLD and the spectrogram can be realized.
Owner:BEIJING UNIV OF TECH

A twin network model training method, a twin network model measuring method, a twin network model training device, a twin network model measuring device, a medium and equipment

The invention relates to a twin network model training method, a twin network model measuring method, a twin network model training device, a twin network model measuring device, a medium and equipment. The method comprises the steps of pre-training a label classification model; then constructing a twin network model by using the trained label classification model in a mode of increasing coding neural network branches; therefore, a twin network model used for data similarity measurement in a recommendation system can be obtained through training based on a multi-task learning mode including label classification learning and measurement learning. Through a mode of staged training and multi-task learning constraint, the stability and generalization ability of the model can be effectively improved, and the accuracy of the trained twin network model for data similarity measurement in the recommendation system is improved. Furthermore, data similarity measurement can be carried out based on the trained twin network model, and the accuracy of data similarity measurement is effectively improved. And the trained twin network model is used for song similarity measurement, so that the accuracy of song similarity measurement can be effectively improved.
Owner:HANGZHOU NETEASE CLOUD MUSIC TECH CO LTD

Image classification method and device, electronic equipment and storage medium

The invention relates to an image classification method and device, electronic equipment and a storage medium, relates to the technical field of computers, and is used for solving the problem of relatively low accuracy of an image classification technology in related technologies, and the method comprises the steps: classifying images in a to-be-recognized data set, and determining category labelsof the images in the to-be-recognized data set; extracting text features of each image and text features of the category label of each image, wherein the text features of the images are used for representing the state of an object in the images; determining the matching degree of each image and the corresponding category label according to the text feature of each image and the text feature of the category label of the corresponding image; and determining a target image corresponding to the category label from the images corresponding to the same category label according to the determined matching degree. According to the embodiment of the invention, after the images are classified, the images of the same class label are further screened according to the matching degree of the state of the object in the image and the class label of the image, so that the classification accuracy is improved.
Owner:BEIJING DAJIA INTERNET INFORMATION TECH CO LTD

Music style classification method and device, computer device and storage medium

PendingCN110188235AFast classificationAddressing the Limitations of Manual ClassificationNeural architecturesSpecial data processing applicationsShock waveFrequency spectrum
The invention discloses a music style classification method and device, a computer device and a storage medium. The method comprises: acquiring a data set; preprocessing audio in the data set, and inputting the preprocessed audio into a preset deep convolutional neural network for training to obtain a trained network model; preprocessing the to-be-classified audio, and inputting the to-be-classified audio into the network model to obtain a music style recognition result of the to-be-classified audio; wherein the preprocessing comprises the step of separating a harmonic sound source and a shockwave sound source of the processed audio; and converting the original sound source, the harmonic sound source and the shock wave sound source of the processed audio into spectrograms. According to the music style classification method, the computer device and the storage medium provided by the invention, the audio is converted into the spectrogram, the deep convolutional neural network is trainedby using the spectrogram, and the to-be-classified audio frequency is classified and identified by using the trained network model, so that the high-precision classification of the audio frequency can be successfully realized, the classification speed is high, and the limitation of manual classification is solved.
Owner:PING AN TECH (SHENZHEN) CO LTD

Multimedia resource classification method, apparatus, computer device, and storage medium

The invention discloses a multimedia resource classification method, a device, a computer device and a storage medium, belonging to the computer technical field. The method comprises the following steps of: acquiring multimedia resources and extracting a plurality of characteristic information of the multimedia resources; Clustering a plurality of feature information to obtain at least one clustering set, determining clustering description information of each clustering set, each clustering set comprising at least one feature information, and each clustering description information for indicating a feature of a clustering set; Determining at least one target feature description information of the multimedia resource based on the clustering description information of each clustering set, each target feature description information representing an association between one clustering description information and the rest of the clustering description information; The multimedia resources are classified based on at least one target feature description information of the multimedia resources, and the classification result of the multimedia resources is obtained. By adopting the invention,the accuracy of multimedia resource classification can be improved.
Owner:TENCENT TECH (SHENZHEN) CO LTD

Control method for scene sound effect and electronic equipment

Embodiments of the invention disclose a control method for a scene sound effect and electronic equipment. The method comprises the following steps that the electronic equipment starts a service with a monitoring function after the electronic equipment is started; the electronic equipment monitors an audio track of the electronic equipment through the service with the monitoring function and determines whether the audio track of the electronic equipment has audio output; the audio track of the electronic equipment and an application in the electronic equipment have a mapping relation; if the electronic equipment determines that the audio track of the electronic equipment has audio output, the electronic equipment determines an application having the mapping relation with the audio track of the electronic equipment according to the mapping relation; and the electronic equipment obtains the scene sound effect corresponding to the application and sets the current sound effect of the electronic equipment to the scene sound effect. The process does not need a person to participate in setting of the scene sound effect, so that the operations are simplified and the using efficiency of the electronic equipment is improved on the premise of guaranteeing the higher accuracy rate of the scene sound effect.
Owner:GUANGDONG OPPO MOBILE TELECOMM CORP LTD

Audio representation learning method based on multilayer time sequence pooling

The invention provides an audio representation learning method based on multilayer time sequence pooling, and belongs to the technical field of audio classification. The method comprises the followingsteps: firstly, extracting spectral characteristics of each audio sample in a training set and a to-be-represented audio, segmenting the spectral characteristics into fragments with equal lengths, training a CNN network by utilizing a fragment-level time-frequency characteristic set of the training set, and then extracting fragment-level characteristic representation of the to-be-represented audio by utilizing the trained CNN network; taking the extracted fragment-level feature representation of the to-be-represented audio as the input of a multi-layer time sequence pooling network, sequentially performing nonlinear feature mapping and time sequence coding operation on the input data by each time sequence pooling layer of the multi-layer time sequence pooling network, and finally outputting a representation vector of the to-be-represented audio. According to the invention, the problem of lack of a feature representation technology capable of flexibly and efficiently capturing the timesequence dynamic information of the audio sample of any time length in the prior art is solved. The method can be used for robust and advanced audio representation.
Owner:HARBIN INST OF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products