Patents
Literature
Hiro is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Hiro

33results about How to "Improve hearing quality" patented technology

Prior signal-to-noise ratio estimating method based on MMSE error criterion

The invention discloses a prior signal-to-noise ratio estimating method based on an MMSE error criterion and used for voice enhancement, and belongs to the technical field of voice signal processing. Aimed at the prior signal-to-noise ratio estimating problem in the voice enhancement technology, the method comprises the steps of: firstly carrying out preliminary estimation on a prior signal-to-noise ratio of noised voices based on the MMSE error criterion, carrying out Wiener filtering calculation on an obtained prior signal-to-noise ratio estimated value to obtain a first system gain factor, carrying out calculation on the first system gain factor an amplitude spectrum value of the noised voices to obtain a voice power spectrum estimated value, then utilizing the obtained voice power spectrum estimated value and a power spectrum estimated value of noise to carry out estimation once again, and obtaining a final prior signal-to-noise ratio estimated value. The prior signal-to-noise ratio estimated value is substituted into a subsequent voice enhancing step for processing, and de-noised estimated voice clearing signals are obtained. The prior signal-to-noise ratio estimating method based on the MMSE error criterion can effectively inhibit background noise components in estimated cleared voices, and excessive damages to the cleared voice components are avoided, so that the hearing quality of the estimated cleared voices is improved, and the performance of a voice enhancement algorithm is improved.
Owner:SYSU CMU SHUNDE INT JOINT RES INST +1

Method for forecasting bandwidth expansion frequency band signal and decoding device

ActiveCN103971694AGuaranteed hearing qualityImprove hearing qualitySpeech analysisBandwidth extensionSignal on
Provided in embodiments of the present invention are a prediction method and decoding device for a bandwidth expansion frequency band signal. The method comprises: demultiplexing a received bitstream to acquire a frequency domain signal; determining whether or not the highest frequency point having bits allocated of the frequency domain signal is less than a predetermined starting frequency of a bandwidth expansion frequency band; if yes, predicting an excitation signal of the bandwidth expansion frequency band on the basis of excitation signals in a predetermined frequency band range of the frequency domain signal and of the predetermined starting frequency point of the bandwidth expansion frequency band; otherwise, predicting the excitation signal of the bandwidth expansion frequency band on the basis of the excitation signals in the predetermined frequency band range of the frequency domain signal, of the predetermined starting frequency point of the bandwidth expansion frequency band, and of the highest frequency point having bits allocated; and, predicting a bandwidth expansion frequency band signal on the basis of the predicted excitation signal of the bandwidth expansion frequency band and of a frequency domain envelope of the bandwidth expansion frequency band. The technical solution of embodiments of the present invention is capable of effectively ensuring between preceding and subsequent frames the continuity of the excitation signals predicted for the bandwidth expansion frequency band signal, thus ensuring the auditory quality of a restored bandwidth expansion frequency band signal.
Owner:CRYSTAL CLEAR CODEC LLC

Bridge connection computing method of digital teleconference

The invention discloses a bridge connection computing method of a digital teleconference, which is characterized in that: a time delay vibrating processing mechanism is adopted for modifying a multi-sectional code stream; VAD voice activity detection which combines single frame detection with long time window detection and sample rate matching algorithm are used for reducing the invalid number of channels which enter bridge connection algorithm and reducing computing false rate; and short-term amplitude computing and funnel audio mixing computing are used for reducing operand. The invention has the benefit effects as follows: 1) the method adapts to large time delay vibrating under the condition of IPNET, can offer multi-sectional code stream modification, offer continuous and homogeneous voice code stream for terminals, and improve the audio quality of the decoded voice; 2) the adoption of the VAD voice activity detection and the sample rate matching algorithm can reduce the invalid number of channels which enter bridge connection algorithm and reduce computing false rate of the bridge connection; and 3) the adoption of short-term amplitude computing method and funnel mixing computing method can greatly reduce the operand, avoid bridge connection misjudgment caused by innovation shocks and improve the audio mixing quality of the bridge connection.
Owner:CHONGQING JINMEI COMM

Digital audio camouflage and reconstruction method based on segmented sequences

The present invention discloses a digital audio camouflage and reconstruction method based on segmented sequences. The method comprises the steps of taking sub segment sequences obtained by dividing a published audio and a secret audio as a secret sequence and a published sequence, adding random perturbations to the sequences with the same element value, directly carrying out least squares matching on an equidistant transformation sequence obtained by the rotate right of the secret sequence and a corresponding position published sequence, finding a rotate right step length with a minimum residual error and a matched parameter, and thus camouflaging the secret audio as the published audio and further reconstructing the camouflaged audio as the secret audio. The method is easy to realize, since only the corresponding position equidistant transformation sequence matching is carried out, while the encoding time is reduced, the equidistant transformation number is raised, the matching precision is improved, the detail loss and overflow problems brought by the deformation from a smooth block to a complex block and the restoration from the smooth block to a complex texture block can be effectively avoided, and thus the auditory quality of camouflaged and reconstructed audios is improved further.
Owner:SHAANXI NORMAL UNIV

Bone conduction speech enhancement method based on differential operation and joint dictionary learning

PendingCN112185405AImprove hearing qualityEasy to reveal similarities and differencesSpeech analysisTime domainDictionary learning
The invention provides a bone conduction speech enhancement method based on differential operation and joint dictionary learning. In the training stage, in an indoor noise-free environment, a double-microphone array composed of bone conduction microphones and air conduction microphones is used for synchronously collecting training voices; short-time Fourier transform is performed on training signals of the bone conduction speech and the air conduction speech to obtain time-frequency spectrum amplitudes, and differential time-frequency spectrum amplitudes of the time-frequency spectrum amplitudes are calculated; and a joint speech dictionary of the bone conduction speech time-frequency spectrum amplitude and the differential time-frequency spectrum amplitude is learned on the time-frequencyspectrum. And at a detection stage, short-time Fourier transform is performed on the bone conduction speech to obtain a time-frequency spectrum amplitude and a phase, the is projected amplitude on abone conduction speech sub-dictionary of the joint speech dictionary, and a differential speech time-frequency spectrum amplitude is reconstructed by using an obtained optimal sparse representation coefficient and a differential time-frequency spectrum amplitude sub-dictionary of the joint speech dictionary. A bone conduction voice time-frequency spectrum is compensated and finally short-time inverse Fourier transform is performed to obtain an enhanced bone conduction voice time-domain signal.
Owner:UNIV OF SCI & TECH OF CHINA

Quantification noise reducing method and device

The application discloses a quantification noise reducing method and device. After an original digital signal is subjected to framing operation, digital signal sampling points with a maximum amplitude are found in two continuous frames of signals which are respectively a current frame and a next frame, and maximum object digital gain which prevents the maximum amplitude from overflowing is obtained via calculation; object sampling points having minimum own energy and adjacent energy in the current frame are used as gain switching points; according to a gain switching position and a gain switching step number, original digital gain is gradually switched to object digital gain at the gain switching points, and analog gain is correspondingly adjusted. According to the quantification noise reducing method and device, a plurality of object sampling points with the maximum energy are used as the gain switching points; thus, compared with technologies of the prior art, the quantification noise reducing method and device effectively optimize selection of the gain switching points, the gain switching points are enabled to have the minimum adjacent energy, switching noise is lowered to the minimum, and therefore quantification noise is lowered to the minimum and audio quality of signals can be improved.
Owner:SPREADTRUM COMM (SHANGHAI) CO LTD

Bone conduction speech enhancement method based on joint dictionary learning and sparse representation

The invention provides a bone conduction speech enhancement method based on joint dictionary learning and sparse representation. In a training stage, in an indoor noise-free environment, a special-shaped double-microphone array composed of a bone conduction microphone and an air conduction microphone is used for synchronously collecting training speech, and a joint training set of the bone conduction speech and the air conduction speech is constructed; and short-time inverse Fourier transform is performed on the training signals of the bone conduction speech and the air conduction speech to obtain a time-frequency spectrum amplitude, and a joint speech dictionary of the bone conduction speech and the air conduction speech is learnt on a time-frequency spectrum. In a detection stage, short-time Fourier transform is performed on the bone conduction speech to obtain a time-frequency spectrum amplitude and a phase; the amplitude is projected on a bone conduction speech sub-dictionary of the joint speech dictionary; the air-guided speech time-frequency spectrum amplitude is reconstructed by using the obtained sparse representation coefficient and the air-guided speech sub-dictionary ofthe joint speech dictionary, two methods are provided for enhancing the time-frequency spectrum of the bone conduction speech, and finally short-time inverse Fourier transform is performed to obtain an enhanced bone conduction speech time-domain signal, so that the speech sharpness is improved.
Owner:UNIV OF SCI & TECH OF CHINA

A method for indoor reverberation elimination

InactiveCN103413547BEnhanced Harmonic StructureEliminate Harmonic StructuresSpeech analysisSound producing devicesComputer moduleSelf adaptive
The invention relates to a method for eliminating indoor reverberations, and belongs to the technical field of signal processing. The method relates to a later period reverberation power spectrum estimation module, a spectrum subtraction module, a voice / voice-free detection module, an energy decrement module and a self-adaptation spectrum line enhancement module. Input of the later period reverberation power spectrum estimation module is the reverberation voice, output of the later period reverberation power spectrum estimation module is connected with the spectrum subtraction module, input of the spectrum subtraction module is the reverberation voice and the output of the later period reverberation power spectrum estimation module, output of the spectrum subtraction module is connected with the voice / voice-free detection module, output of the voice / voice-free detection module controls the output of the spectrum subtraction module, and the output of the voice / voice-free detection module controls the output of the spectrum subtraction module to be selectively connected with the energy decrement module or the self-adaptation spectrum line enhancement module. The energy decrement module or the self-adaptation spectrum line enhancement module outputs the final enhanced voice.
Owner:DALIAN UNIV OF TECH

Bridge Operation Method for Digital Telephone Conference

The invention discloses a bridge operation method for a digital telephone conference, which is characterized in that: a time delay and jitter processing mechanism is used to shape the code flow of multiple network segments; a VAD voice activation detection and a combination of single frame detection and long time window detection are adopted The sampling rate matching algorithm reduces the number of invalid channels entering the bridge operation and reduces the misjudgment rate of the operation; the short-term amplitude operation and funnel mixing operation are used to reduce the amount of calculation. The beneficial technical effects of the present invention are: 1) adapting to the large time delay jitter in the packet network environment, and providing code stream shaping of multiple network segments, providing continuous and uniform voice code streams for terminals, and improving the audio quality after decoding. 2) The VAD voice activation detection and sampling rate matching algorithm is adopted to reduce the number of invalid channels entering the bridge operation and reduce the misjudgment rate of the bridge operation. 3) The short-term amplitude calculation and funnel mixing method are adopted to greatly reduce the amount of calculation, avoid bridge misjudgment caused by impact interference, and improve the quality of bridge mixing.
Owner:CHONGQING JINMEI COMM

Digital Audio Camouflage and Reconstruction Method Based on Segmentation Sequence

The present invention discloses a digital audio camouflage and reconstruction method based on segmented sequences. The method comprises the steps of taking sub segment sequences obtained by dividing a published audio and a secret audio as a secret sequence and a published sequence, adding random perturbations to the sequences with the same element value, directly carrying out least squares matching on an equidistant transformation sequence obtained by the rotate right of the secret sequence and a corresponding position published sequence, finding a rotate right step length with a minimum residual error and a matched parameter, and thus camouflaging the secret audio as the published audio and further reconstructing the camouflaged audio as the secret audio. The method is easy to realize, since only the corresponding position equidistant transformation sequence matching is carried out, while the encoding time is reduced, the equidistant transformation number is raised, the matching precision is improved, the detail loss and overflow problems brought by the deformation from a smooth block to a complex block and the restoration from the smooth block to a complex texture block can be effectively avoided, and thus the auditory quality of camouflaged and reconstructed audios is improved further.
Owner:SHAANXI NORMAL UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products