Patents
Literature
Patsnap Copilot is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Patsnap Copilot

182 results about "Pitch period" patented technology

Pitch is the fundamental period of the speech signal. It the perceptual correlate of fundamental frequency. It represents the vibration frequency of the vocal cords during the sound productions (like vowels, for example). It is generally stated that pitch is the fundamental frequency of the signal.

Voice-activity detection using energy ratios and periodicity

A voice activity detector (100) filters (204) out noise energy and then computes a high-frequency (2400 Hz to 4000 Hz) versus low-frequency (100 Hz to 2400 Hz) signal energy ratio (224), total voiceband (100 Hz to 4000 Hz) signal energy (214), and signal periodicity (208) on successive frames of signal samples. Signal periodicity is determined by estimating the pitch period (206) of the signal, determining a gain value of the signal over the pitch period as a function of the estimated pitch period, and estimating a periodicity of the signal over the pitch period as a function of the estimated pitch period and the gain value. Voice is detected (230–232) in a segment if either (a) the difference between the average high-frequency versus low-frequency signal energy ratio and the present segment's high-frequency versus low-frequency energy ratio either exceeds (310) a high threshold value or is exceeded (312) by a low threshold value, or (b) the average periodicity of the signal is lower (306) than a low threshold value, or (c) the difference between the average total signal energy and the present segment's total energy exceeds (304) a threshold value and the average periodicity of the signal is lower (304) than a high threshold value, or (d) the average total signal energy exceeds (412) a minimum average total signal energy by a threshold value and voice has been detected (410) in the preceding segment.
Owner:AVAYA INC

Voice noise reduction method for conference terminal based on neural network model

The invention provides a voice noise reduction method for a conference terminal based on the neural network model. The method comprises steps that S1, an audio file is collected by the conference terminal device to generate a digital audio signal in the time domain; S2, the digital audio signal is framed, and short-time Fourier transform is performed; S3, the amplitude spectrum of the frequency domain is mapped into a frequency band, and a Mel-frequency cepstral coefficient is further solved; S4, first-order and second-order differential coefficients are calculated through utilizing the Mel-frequency cepstral coefficient, a pitch correlation coefficient is calculated on each frequency band, and pitch period features and VAD features are further extracted; S5, input characteristic parameters of an audio are used as the input of the neural network model, the neural network is trained offline, the frequency band gain generating the noise reduction speech is learned, and the trained weightis solidified; S6, the neural network model is utilized to learn, the frequency band gain is generated, the outputted frequency band gain is mapped to the spectrum, the phase information is added, and a noise reduction speech signal is reduced through inverse Fourier transform. The method is advantaged in that real-time noise reduction can be achieved.
Owner:FUJIAN STAR NET WISDOM TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products