Patents
Literature
Patsnap Copilot is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Patsnap Copilot

218 results about "Voice source" patented technology

The voice source contains important lexical and non-lexical information. The non-lexical information can convey, for example, prosodic events, emotional status, as well as cues pertaining to the uniqueness of the speaker’s voice. In engineering applications, there is a need for a more accurate source model that could model different voice qualities.

Method for enhancing microphone array voice based on combined inhibition

The invention provides a method for enhancing microphone array voice based on combined inhibition. The method comprises the following steps of: structuring a microphone array for receiving external signals; analyzing the signals and obtaining time delays of different array signals relative to benchmark array signals in the microphone arrays opposite to a target voice source; respectively performing time delay compensation on digital signals corresponding to the two microphones, obtaining the compensated signals; respectively performing subband decomposition on the compensated array signals, and then forming fixed beams on each subband; meanwhile, respectively using blocking matrixes on each subband to obtain noise reference signals on each subband; and then respectively removing the noiseirrelative to the target voice form the fixed beam forming device on corresponding subband through an adaptive filtering processing algorithm, and then merging the subbands, thereby forming an initial gain signal; and meanwhile, making use of the previously compensated any two array signals to obtain a filter for inhibiting the noise signal related to the target voice through a recursive mutual power spectral density, thereby obtaining the final target voice signal through combining the initial gain signal.
Owner:ZHEJIANG UNIV

Multidimensional user identity identification method

The invention provides a multidimensional user identity identification method. The multidimensional user identity identification method comprises the steps of: detecting a human body in a photographic range by means of a camera, extracting facial features of the human body, comparing the facial features with user pictures prestored in a sample library, calculating a face matching coefficient, and preliminarily judging whether a user with permission exists in the photographic range; receiving user voice by using a microphone, converting an audio analog signal into a digital sequence, comparing the digital sequence with user voiceprints prestored in the sample library, calculating a voiceprint matching coefficient, and calculating the face matching coefficient and the voiceprint matching coefficient again to obtain a matching degree, so as to judge whether the user have the permission; and establishing a model for the user when judging that the user has the permission, carrying out human body dynamic tracking on the user, matching a voice source position with a position calculated through carrying out human body tracking on the user, judging that a command is issued by the user with permission and the command is valid when a voice position matches with an image position, and executing the command.
Owner:常州百芝龙智慧科技有限公司

Signal processing method for enhancing target voice signal pickup in sound environment

The invention relates to a signal processing method for enhancing target voice signal pickup in a sound environment. The method includes the steps that (1), parameters of an ESN are acquired through experiments, and a corresponding sound source model is established; (2), the model is used for two occasions, namely, when output of the model is certain expected target voice signals and input of the model is a mixture of sound environment reflected sound signals of a target voice source and the target voice signals, the model can be used for echo cancellation of field amplified sound; when the output of the model is the expected target voice signals and the input of the model is a mixture of sound environment reflected sound signals of another specific voice source and the target voice signals, the model can be used for echo cancellation of voice communication between two specific persons; (3), when the model is used by a target voice person in the actual sound environment and the pickup position changes, reflected signals of sound source signals pointed by training can be suppressed as well, and then corresponding enhanced target voice signals are output. By the adoption of the signal processing method, the influence of the change of the pickup position on quality of the voice signals is overcome.
Owner:FUQING BRANCH OF FUJIAN NORMAL UNIV

Voice processing method and apparatus, terminal device and storage medium

ActiveCN110491404AReduce input lagAchieve real-time noise reductionSpeech recognitionTerminal equipmentNetwork model
The invention discloses a voice processing method and apparatus, a terminal device and a storage medium. The method comprises the steps of: obtaining noise-containing audio data, wherein the noise-containing audio data comprise a voice source signal; preprocessing the noise-containing audio data, extracting the noise-containing audio features from the noise-containing audio data, and inputting thenoise-containing audio features into a pre-trained voice processing network model to obtain de-noised audio features, wherein the pre-trained voice processing network model comprises multiple cause and effect convolutional layers and at least one recurrent neural network layer, the multiple cause and effect convolutional layers are configured to output texture features of the corresponding voicesource signal according to the noise-containing audio features, and the at least one recurrent neural network layer is configured to output the de-noised audio features according to the texture features; and obtaining an estimated value of the voice source signal according to the de-noised audio features, and outputting the estimated value as de-noised noise-containing audio data. According to thevoice processing method and apparatus disclosed by the invention, real-time noise reduction of the noise-containing audio data is realized by the cause and effect convolutional layers and the recurrent neural network layer, and the voice noise reduction effect is improved.
Owner:广州方硅信息技术有限公司

Voice-tracking microphone and control method of voice-tracking microphone

The invention provides a voice-tracking microphone and a control method of the voice-tracking microphone. The voice-tracking microphone is characterized by comprising a body, a voice sensor set, a control circuit and a drive device. The body comprises a movable shaft, a base and a microphone body. The voice sensor set is used for picking up a corresponding upper voice signal, a corresponding lower voice signal, a corresponding left voice signal and a corresponding right voice signal according to a voice source. The control circuit comprises an A / D converter, a comparison circuit and a drive circuit. The drive device comprises a height adjustment mechanism and a direction adjustment mechanism, wherein the height adjustment mechanism is used for adjusting the height of the microphone body, the direction adjustment mechanism is used for adjusting the direction of the microphone body, the microphone body is adjusted to be higher when an upper volume value is larger than a lower volume value, the microphone body is adjusted to be lower when the upper volume value is smaller than the lower volume value, the microphone body is rotated leftward when a left volume value is larger than a right volume value, and the microphone body is rotated rightward when the right volume value is larger than the left volume value. According to the voice-tracking microphone, the voice sensor set adjusts the height and the direction of the microphone body according the position of the voice source so that voice tracking can be realized, and great convenience is brought to a user.
Owner:UNIV OF SHANGHAI FOR SCI & TECH

Self-adjusting pharyngeal cavity electronic larynx voice communication system and method

The invention relates to a pharyngeal cavity electronic larynx synthesis and communicated system which can be automatically regulated and a method thereof, wherein a device based on a software platform and external hardware of a computer comprises a camera, a microphone and an electronic larynx oscillator; the automatic control to the working state of the electric larynx and the voice source synthesis of the pharyngeal cavity is realized through extracting visual phonetic feature information of movement images of the face and the neck of a user, thereby the electronic larynx can be used without holding by hands, which is simpler and more convenient, and the problems that the synthesis voice source is not consistent with the electronic larynx applying part and the voice of the electronic larynx is mechanical and unnatural are solved; and meanwhile, the dynamic de-noising enhancement treatment is implemented for the voice rebuilding of the pharyngeal cavity electronic larynx, thereby the quality and the intelligibility of the rebuilt voice are improved; and the remote real-time communication of the voice of the electronic larynx is realized through the network transmission technique, the application range of the electronic larynx is further expanded, and the life quality of a patient after laryngectomy is improved.
Owner:XI AN JIAOTONG UNIV

Phase difference measurement-based microphone array direction finding method

ActiveCN104811886ASolve real-time direction finding problemsDirection finding speedElectrical apparatusPhase differenceVoice source
Disclosed is a phase difference measurement-based microphone array direction finding method. The phase difference measurement-based microphone array direction finding method comprises the steps of obtaining the sampling data of a microphone array; performing fast Fourier transform on the sampling data of the microphone array according to channel sequence numbers to obtain the frequency domain data of the microphone array; selecting available frequency units; according to the available frequency units, computing corresponding array distance difference vectors; utilizing the array distance difference vectors of all the available frequency units to compose the high-dimensional linear system of equations of voice signal direction vectors, cooperatively solving the direction vector estimated values of voice source signals; according to the direction vector estimated values of the voice source signals, computing the azimuth angle estimated values and the pitch angle estimated values of the voice source signals. The phase difference measurement-based microphone array direction finding method has the advantages of being high in direction finding speed and precision, and particularly through the phase difference information of multiple available frequency units of the voice signals, obtains high azimuth angle and pitch angle estimating performance.
Owner:XIDIAN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products