Patents
Literature
Patsnap Copilot is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Patsnap Copilot

35 results about "Speech reconstruction" patented technology

Speech enhancement system and method based on MFrSRRPCA algorithm

ActiveCN109215671AReduces the possibility of false eliminationsValid reservationSpeech analysisTime domainTime–frequency analysis
The invention discloses a speech enhancement system and method based on a multi-subband short-time fractional Fourier spectrum random rearrangement robust principal component analysis MFrSRRPCA algorithm. The realization steps are: a time-frequency analysis module generates time-frequency information of noisy speech; the time-frequency analysis module generates time-frequency information of noisyspeech. The time-frequency subband division module divides the time-frequency amplitude spectrum of the noisy speech into a plurality of noisy subbands. Each time-frequency amplitude spectrum enhancement module randomly disrupts the sequence of each frame spectrum element in the corresponding noisy sub-band, and generates the corresponding enhancement sub-band by using a robust principal componentanalysis algorithm according to the noise intensity estimation value in the corresponding sub-band. The time-frequency subband recombination module composes all the enhancement subbands to enhance the time-frequency amplitude spectrum. The time-domain speech reconstruction module reconstructs the enhanced time-frequency amplitude spectrum into enhanced speech. The invention can improve the soundquality and intelligibility of the noisy speech, and can be used for the speech enhancement and noise reduction of the speech receiving system.
Owner:XIDIAN UNIV

Cross-modal generation method based on voice and face images

The invention relates to a cross-modal generation method based on voice and a face image. The method comprises the steps of voice reconstruction of a face and personalized voice synthesis of the faceimage. A voice reconstruction face model based on residual priori is provided for voice reconstruction of a face, and the face of the person is generated according to an input section of unknown voice. According to personalized voice synthesis of the face image, a face image personalized voice synthesis model based on residual priori is provided, and the voice of the person is synthesized according to the given face image and a section of text. The invention is scientific and reasonable in design, the effect of the voice reconstruction face model can generate the face image very similar to theoriginal face, the robustness is very high, the number of the generated faces is not a fixed number, the voice of any speaker is input, and the face similar to the speaker can be reconstructed. And the residual priori face image personalized speech synthesis model is also used for synthesizing the speech of the person according to any face image. In addition, the proposed residual priori knowledge method can accelerate convergence of the model and achieve a better effect.
Owner:TIANJIN UNIV

Voice processing system and method and intelligent fume hood system based on active noise reduction

ActiveCN112139191AImprove practicalityAccurate noise analysisDirt cleaningSpeech reconstructionNoise
The invention discloses a voice processing system and method and an intelligent fume hood system based on active noise reduction. The system comprises a voice collection module, an exhaust fan parameter obtaining module, a noise reduction module and a voice reconstruction module, wherein the voice collection module is used for collecting voice and converting the voice into a digital signal; the exhaust fan parameter obtaining module is used for obtaining the rotating speed of an exhaust fan; the noise reduction module is used for obtaining noise signals of the exhaust fan and converting the noise signals into noise reduction signals, the noise reduction module is used for receiving output data of the exhaust fan parameter obtaining module, fundamental frequency and sound pressure are obtained through calculation in combination with the number of blades of the exhaust fan, the diameter of an impeller and power, and therefore the noise signals are obtained; and the voice reconstruction module is used for superposing the noise reduction signals and output signals of the voice collection module to obtain a reconstruction signal. According to the system, the noise signal of the exhaustfan is obtained by directly measuring the rotating speed of the exhaust fan and combining the number of the blades of the exhaust fan, the noise analysis is accurate, and the noise reduction accuracyand the noise reduction effect in the exhaust environment using the exhaust fan are improved.
Owner:AVIC HUADONG OPTOELECTRONICS (SHANGHAI) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products