Patents
Literature
Hiro is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Hiro

258 results about "Cepstrum coefficients" patented technology

The cepstral coefficients are the coefficients of the Fourier transform representation of the logarithm magnitude spectrum.

Method for identifying local discharge signals of switchboard based on support vector machine model

The invention discloses a method for identifying local discharge signals of a switchboard based on a support vector machine model. The method comprises a model training process and an audio identifying process, and particularly comprises the following steps of: preprocessing audio signals; extracting effective audios according to short-time energy and a zero-crossing rate; segmenting the effective audios and extracting characteristic parameters such as Mel cepstrum coefficients, first order difference Mel cepstrum coefficients, high zero-crossing rate and the like of each segment of the audios; training a sample set by using a support vector machine tool, and establishing a corresponding support vector machine model; after preprocessing audio signals to be identified and extracting and segmenting the effective audios, classifying and identifying segment-characteristic-based samples to be tested according to the support vector machine model; and post-processing classification results, and judging whether partial discharge signals exist. By using the method, the existence of the partial discharge signals of the switchboard is accurately identified, the happening of major accidents involving electricity is prevented and avoided, economic losses caused by insulation accidents are reduced, and the power distribution reliability is improved.
Owner:SOUTH CHINA UNIV OF TECH

A Robust Speech Feature Extraction Method Based on Sparse Decomposition and Reconstruction

The invention discloses a robust speech characteristic extraction method based on sparse decomposition and reconfiguration, relating to a robust speech characteristic extraction method with sparse decomposition and reconfiguration. The robust speech characteristic extraction method solves the problems that 1, the selection of an atomic dictionary has higher the time complexity and is difficult tomeet the sparsity after signal projection; 2, the sparse decomposition of signals has less consideration for time relativity of speech signals and noise signals; and 3, the signal reconfiguration ignores the prior probability of atoms and mutual transformation of all the atoms. The robust speech characteristic extraction method comprises the following detailed steps of: step 1, preprocessing; step 2, conducting discrete Fourier transform and solving a power spectrum; step 3, training and storing the atom dictionary; step 4, conducting sparse decomposition; step 5, reconfiguring the speech spectrum; step 6, adding a Mel triangular filter and taking the logarithm; and step 7, obtaining sparse splicing of Mel cepstrum coefficients and a Mel cepstrum to form the robust characteristic. The robust speech characteristic extraction method is used for the fields of multimedia information processing.
Owner:哈尔滨工业大学高新技术开发总公司

Optimal codebook design method for voiceprint recognition system based on nerve network

The invention relates to an optimal codebook design method for a voiceprint recognition system based on a nerve network. The optimal codebook design method comprises following five steps: voice signal input, voice signal pretreatment, voice signal characteristic parameter extraction, three-way initial codebook generation and nerve network training as well as optimal codebook selection; MFCC (Mel Frequency Cepstrum Coefficient) and LPCC (Linear Prediction Cepstrum Coefficient) parameters are extracted at the same time after pretreatment; then a local optimal vector quantization method and a global optimal genetic algorithm are adopted to realize that a hybrid phonetic feature parameter matrix generates initial codebooks through three-way parallel algorithms based on VQ, GA and VQ as well as GA; and the optimal codebook is selected by judging the nerve network recognition accuracy rate of the three-way codebooks. The optimal codebook design method achieves the remarkable effects as follows: the optimal codebook is utilized to lead the voiceprint recognition system to obtain higher recognition rate and higher stability, and the adaptivity of the system is improved; and compared with the mode recognition based on a single codebook, the performance is improved obviously by adopting the voiceprint recognition system of the optimal codebook based on the nerve network.
Owner:CHONGQING UNIV

Recording device clustering method based on Gaussian mean super vectors and spectral clustering

InactiveCN106952643AEffectively describe the difference in characteristicsSpeech recognitionSpecial data processing applicationsDevice typeMean vector
The invention provides a recording device clustering method based on Gaussian mean super vectors and spectral clustering. The method comprises the steps that the Melch frequency cepstrum coefficient MFCC characteristic which characterizes the recording device characteristic is extracted from a speech sample; the MFCC characteristics of all speech samples are used as input, and a common background model UBM is trained through an expectation maximization EM algorithm; the MFCC characteristic of each speech sample is used as input, and UBM parameters are updated through a maximum posteriori probability MAP algorithm to acquire the Gaussian mixture model GMM of each speech sample; the mean vector of all Gaussian components of each GMM is spliced in turn to form a Gaussian mean super vector; a spectral clustering algorithm is used to cluster the Gaussian mean super vectors of all speech samples; the number of recording devices is estimated; and the speech samples of the same recording device are merged. According to the invention, the speech samples collected by the same recording device can be found out without knowing the prior knowledge of the type, the number and the like of the recording devices, and the application scope of the method is wide.
Owner:SOUTH CHINA UNIV OF TECH

Music recommendation method based on similarities

The invention discloses a music similarity detection method based on mixed characteristics and a Gaussian mixed model. According to the basic thought, the method comprises the steps of using a gamma-tone cepstrum coefficient for conducting similarity detection, and using weighting similarities of various characteristics as a final detection result; providing a modulation spectrum characteristic based on a frame shaft, using the characteristic for representing a music long-time characteristic, and using the combination of the long-time characteristic and a short-time characteristic as the input of modeling in the next step; using the Gaussian mixed model for conducting modeling on the music characteristics, firstly, utilizing a dynamic K mean value method for conducting initialization on the model, then, using an expectation-maximization algorithm for conducting model training, obtaining accurate model parameters, and finally using a log-likelihood ratio algorithm for obtaining the similarities between the pieces of music. According to the music similarity detection method, the obtaining of the music characteristics is more sufficient and thorough, the accuracy degree of music recommendation is improved, the characteristic vector dimensionality can be reduced, the information memory content of a music database is reduced, and the accuracy degree of the music recommendation is improved.
Owner:DALIAN UNIV OF TECH

Voice signal characteristics extracting method based on tensor decomposition

The invention discloses a voice signal characteristics extracting method based on tensor decomposition and belongs to the technical field of voice signal processing. The voice signal characteristics extracting method based on the tensor decomposition comprises the following steps: having multi-layer wavelet decomposition to voice signals after framing, respectively extracting MR frequency cepstrum coefficients, the corresponding first order difference coefficient and second order difference coefficient from a plurality of component information after the wavelet decomposition to form a characteristic parameter vector, establishing a third order voice tensor and having tensor decomposition to the third order voice tensor, and calculating component information order and characteristic projection of characteristic parameters. Marticulated results are characteristics carried by each frame of voice signals. Compare with the traditional characteristic parameters, the voice signal characteristics extracting method based on the tensor decomposition has the advantages of enhancing representational ability to the voice signals, acquiring characteristics which carries more comprehensive voice signals, and improving the effects of voice signal processing systems such as voice identifying signal processing system, speaker identifying signal processing system.
Owner:INNER MONGOLIA UNIV OF SCI & TECH +1

Underwater acoustic signal target classification and recognition method based on deep learning

The invention belongs to the technical field of underwater acoustic signal processing, and particularly relates to an underwater acoustic signal target classification and recognition method based on deep learning. The method comprises the following steps: (1) carrying out feature extraction on an original underwater acoustic signal through a Gammatone filtering cepstrum coefficient (GFCC) algorithm; (2) extracting instantaneous energy and instantaneous frequency by utilizing an improved empirical mode decomposition (MEMD) algorithm, fusing the instantaneous energy and the instantaneous frequency with characteristic values extracted by a GFCC algorithm, and constructing a characteristic matrix; (3) establishing a Gaussian mixture model GMM, and keeping the individual characteristics of theunderwater acoustic signal target; And (4) finishing underwater target classification and recognition by using a deep neural network (DNN). According to the underwater acoustic signal target classification and recognition method, the problems that a traditional underwater acoustic signal target classification and recognition method is single in feature extraction and poor in noise resistance can be solved, the underwater acoustic signal target classification and recognition accuracy can be effectively improved, and certain adaptability is still achieved under the conditions of weak target acoustic signals, long distance and the like.
Owner:HARBIN ENG UNIV

Noise diagnosis algorithm for rolling bearing faults of rotary equipment

The invention discloses a noise diagnosis algorithm for rolling bearing faults of rotary equipment. Firstly, a sound pick-up device collects running noise signals of a rolling bearing, and the signalsare subjected to preliminary fault judgment through a bearing normality and anomaly pre-classification model based on an anomaly detection algorithm; secondly, according to a fault pre-judgment result, the abnormal signals (the faults occur) pass through a neural network filter to filter normal components in the signals of the bearing, the output net abnormal signals are connected to a subsequentfeature extraction module, and the normal signals (no faults occur) are directly connected to the feature extraction module; the feature extraction module extracts Mel-cepstrum coefficients (MFCC) ofthe signals to serve as eigenvectors, feature reconstruction is carried out by utilizing a gradient boosted decision tree (GBDT) to form composite eigenvectors, and principal component analysis (PCA)is used for carrying out dimensionality reduction on features; and finally, feature signals are input into an improved two-stage support vector machine (SVM) ensemble classifier for training and testing, and at last, high-accuracy fault type diagnosis is achieved. According to the algorithm, the bearing faults can be effectively detected and relatively high fault identification accuracy is kept;and the algorithm has relatively high effectiveness and robustness for detection and classification of the bearing faults.
Owner:CHINA UNIV OF MINING & TECH

Speech recognition method and system

The invention belongs to the speech recognition technical filed and relates to a speech recognition method and system. The method includes the following steps that: speech signals are acquired; analog-digital conversion is performed on the speech signals, so that corresponding speech digital signals can be generated; preprocessing is performed on the speech digital signals, and speech feature parameters are extracted according to corresponding preprocessing results, and a time sequence of extracting the speech feature parameters is utilized to construct a corresponding feature sequence; the speech feature parameters are matched with speech models in a template library, and the feature sequence is decoded according to a search algorithm, and therefore, a corresponding recognition result can be generated. According to the speech recognition method and system of the invention, time-domain GFCC (gammatone frequency cepstrum coefficient) features are extracted to replace frequency-domain MFCC (mel frequency cepstrum coefficient) features, and DCT conversion is adopted, and therefore, computation quantity can be reduced, and computation speed and robustness can be improved; and the mechanism of weighted finite state transformation is adopted to construct a decoding model, and smoothing and compression processing of the model is additionally adopted, and therefore, decoding speed can be increased.
Owner:徐洋

Rolling bearing fault classification method and system based on spectral kurtosis and neural network

ActiveCN110017991AFilter out noise interferenceExtract fault signal componentsMachine part testingClassification methodsConvolutional neural network
The invention provides a rolling bearing fault classification method and system based on spectral kurtosis and neural network. The method comprises the following steps: filtering the bearing fault signals based on the spectral kurtosis; extracting Mel cepstrum coefficient characteristics and differential characteristics of the filtered bearing fault signals to obtain a Mel cepstrum coefficient characteristic set and a differential characteristic set; randomly extracting a plurality of characteristics from the Mel cepstrum coefficient characteristic set and the differential characteristic set respectively, and sequentially arranging the characteristics according to the extraction sequence to form a Mel cepstrum coefficient characteristic diagram and a differential characteristic diagram which are represented by a two-dimensional matrix with preset size, so as to form a training set; inputting the Mel cepstrum coefficient characteristic diagram and the differential characteristic diagramin the training set into corresponding channels of a double-channel convolutional neural network for training to obtain a rolling bearing fault classification model; and carrying out fault classification on the bearing fault signals received in real time by using the rolling bearing fault classification model.
Owner:SHANDONG UNIV

Alzheimer's disease preliminary screening method based on speech feature non-negative matrix decomposition

InactiveCN108198576ACharacterize the difference in characteristicsThe result is validSpeech analysisSupport vector machine classifierScreening method
The invention discloses an Alzheimer's disease preliminary screening method based on speech feature non-negative matrix decomposition. The Alzheimer's disease preliminary screening method includes thefollowing steps: extracting acoustic features including fundamental frequency, energy, harmonic-to-noise ratios, formants, glottal waves, linear prediction coefficients, and constant Q cepstrum coefficients, from speech samples of Alzheimer's patients and normal humans, and splicing the features into a feature matrix; using the non-negative matrix decomposition algorithm to decompose the featurematrix, and obtaining the dimensionality-reduced feature matrix; using the dimensionality-reduced feature matrix as an input, and training a support vector machine classifier; and inputting the dimensionality-reduced feature matrix of a test speech sample into the trained support vector machine classifier, and determining whether the test speech is speech of normal humans or speech of Alzheimer'spatients. The invention adopts non-negative matrix decomposition to perform dimensionality reduction transformation on high-dimensional input acoustic features, the dimensionality-reduced feature matrix has better discrimination, and the method can obtain more excellent effects in Alzheimer's disease preliminary screening.
Owner:SOUTH CHINA UNIV OF TECH

Judgment method of ultra-high voltage equipment local discharge detection data

The invention discloses a judgment method of ultra-high voltage equipment local discharge detection data. The judgment method comprises the following steps: sampling a continuous ultrasonic frequencysignal to reduce to the continuous sound wave frequency signal capable of being heard by the human ear; continuously intercepting a frame sound wave frequency signal with a set time length; extractinga Mayer frequency cepstrum coefficient of the frame sound wave frequency signal as a to-be-identified fault discharge feature; sending the extracted to-be-identified fault discharge feature into a CNN convolution neural network, enabling the to-be-identified fault discharge feature to enter a fault classifier of a CNN convolution neural network output classification layer through CNN convolutionneural network analysis, wherein the CNN convolution neural network identifies the to-be-identified fault discharge feature and outputs the to-be-identified fault discharge feature according to the fault classifier formed by learning the known fault discharge feature in advance. The mode learning and identification are performed on the fault type by directly using the convolution neural network CNN, the identification accuracy rate is improved, and the manual intervention is reduced or avoided.
Owner:STATE GRID CORP OF CHINA +1

Individualized song recommending system based on vocal music characteristics

The embodiment of the invention discloses an individualized song recommending system based on vocal music characteristics. The method includes the following steps of extracting characteristics, wherein the voice register characteristics, speed characteristics and tone characteristics of singing data are extracted, the voice register characteristics include absolute voice register and relative voice register, the speed characteristics include the number of beats per minute, and the tone characteristics include a Gaussian mixture model of Mel frequency cepstrum coefficient training; recommending songs systematically, wherein a corresponding song in a music library is found through a key sound matching algorithm according to a snatch sung by a user, and voice register conformity detection, song conformity detection and singer conformity detection are conducted. Singer recommending and song recommending are conducted according to extracted user characteristics. By means of the method, whether a current song is suitable for being sung by the user or not can be evaluated, and singers matched with the user vocal music ability and songs suitable for being sung by the user are further recommended. From the perspective of user singing, the traditional music recommending range is expanded, and higher practical value is achieved.
Owner:BEIJING UNIV OF POSTS & TELECOMM
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products