Patents
Literature
Patsnap Copilot is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Patsnap Copilot

84 results about "Change voice" patented technology

Method for realizing sound speed-variation without tone variation and system for realizing speed variation and tone variation

The invention discloses a system for realizing sound speed variation and tone variation, which comprises an input cache module, a tone variation processing module, a speed-variation no-tone-variation processing module and a data output module, wherein the input cache module is used for reading the sound signal data to be processed into the cache; the tone variation processing module is used for carrying out the tone variation processing on the sound signal to change the sound tone; the speed-variation no-tone-variation processing module is used for carrying out the speed-variation no-tone-variation processing on the sound signal, thereby changing the sound speed without changing the tone; and the data output module is used for outputting the speed-variation tone-variation signal. The speed-variation no-tone-variation processing module comprises a segmentation data module and a connection data module, wherein the speed-variation no-tone-variation processing module extracts a string of signal subfamilies (namely small sections of sound) from the original speech signal according to the coefficient of variation in speed by using a window function; and the connection data module connects the signal subfamilies according to the time sequence, thereby obtaining the speed-variation no-tone-variation signal. The invention realizes the speed-variation no-tone-variation function and the speed-variation tone-variation function of the audio frequency by using very low algorithm complexity, and does not introduce noise, thereby enhancing the quality of the processed sound.
Owner:刘盛举 +1

Shiatsu type fundamental frequency adjustment electronic artificial larynx

The invention relates to a shiatsu type fundamental frequency adjustment electronic artificial larynx. The shiatsu type fundamental frequency adjustment electronic artificial larynx is characterized by changing fundamental frequency of a glottal wave by a shiatsu switch button at any time so as to change voice tones and mainly comprising a shiatsu sensing part, a waveform generating and processing system, a power amplification circuit and an electricity-force conversion system. The shiatsu type fundamental frequency adjustment electronic artificial larynx is characterized in that a glottal waveform having individual voice characteristics is stored in the waveform generating and processing system; the fundamental frequency of the waveform is changed under the control of the switch / shiatsubutton at any time during the process of waveform generation; the generated waveform is converted to an analog signal through a digital to analog conversion module in the system; and the signal waveform output by a digital to analog converter is applied to the electricity-force conversion system after power amplification. The amplified waveform is converted to mechanical vibration through an electricity-force energy converter of a high magnetic field, the vibration is applied tp the neck of a patient through a vibration film to produce the glottal wave, and the waveform forms sound outside a lip after being modulated by a tongue, a nasal cavity, an oral cavity, a lip, and the like of the patient.
Owner:BEIHANG UNIV +1

Voiceprint identity authentication device and authentication optimization method and system

The invention discloses an authentication optimization method of a voiceprint identity authentication device. The authentication optimization method comprises the steps that the Mel-frequency cepstral coefficients corresponding to registration voice signals are extracted and preset number binding is performed on the Mel-frequency cepstral coefficients; the Mel-frequency cepstral coefficients act as an input layer and the bound numbers act as an output layer to perform differentiated deep belief network training and acquire the parameter space; the Mel-frequency cepstral coefficients are inputted to the differentiated deep belief network to acquire the hidden layer output to act as the feature vectors; all the feature vectors act as the input to construct a Gaussian mixture model; and the corresponding Mel-frequency cepstral coefficient of any registration voice signal is inputted to the differentiated deep belief network to acquire multiple hidden layer outputs, and the hidden layer outputs of which the degree of distinction is higher than the preset threshold are selected to act as the training data to update the Gaussian mixture model. The following spontaneously changed voice signal of the registrant acts as the raining data to update the Gaussian mixture model so as to be more adaptive to the present sound production state of the registrant, and the recognition rate can be guaranteed.
Owner:GUANGDONG UNIV OF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products