Patents
Literature
Hiro is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Hiro

38 results about "Vocal organ" patented technology

N any of the organs involved in speech production. a movable speech organ. the vocal apparatus of the larynx; the true vocal folds and the space between them where the voice tone is generated.

User registration and logon method by combining speaker speech identity authentication and account code protection in network games

The invention discloses a user registration and logon method by combining speaker speech identity authentication and account code protection in network games. A client receives and processes a speech signal, and a server carries out speaker speech identity authentication through a speech template. The method adopts speaker text related speech identity authentication related and account code protection, and has an anti-theft function. At the client, the method comprises: dual-cache sound storage scheme: acquiring and storing a sound signal; and end-point detection and feature extraction: detecting end points in the sampled sound signal to determine start/end frames of an effective speech signal, and extracting feature parameter (linear prediction cepstrum factor) of each frame. At the server, the method adopts dynamic programming to compute matching degree of speaker speed parameter and speech template. If the account code is thieved, since the vocal process and vocal organ of an illegal user are different from a registered user, the illegal user can not easily pass through the speech identity authentication. Even if the illegal user logs on by copying the account code and speech parameter, the server can compare with the prestored speech parameter and detect parameter conformity, causing speech identity authentication failure. After having successfully registered the account code, the user has to register a speech code by speaking and repeating the same text content until enough quantity of speech templates are successfully generated. The user needs to speak the speech code to log on. After the speaker speech identity authentication is successful, the server can confirm a user logon success immediately or after the user has input the correct account code; and after the speaker speech identity authentication fails, the server can determine a user logon failure immediately or confirm a user logon success after requiring the user to input the correct account code.
Owner:朱建政

Vocal organ visible speech synthesis system

ActiveCN102820030AHigh conversion sensitivitySmall amount of calculationSpeech synthesisVisible SpeechVoice frequency
The invention provides a vocal organ visible speech synthesis system which comprises a voice frequency analysis module, a parameter mapping module, an animation drive module and a motion analysis module; wherein, the voice frequency analysis module is used for receiving the input speech signal of a speaker, judging a mute section according to energy information, coding non-mute section of speech and outputting a speech line spectrum pair parameter; the parameter mapping module is used for receiving the speech line spectrum pair parameter transmitted in real time from the voice frequency analysis module, converting the speech line spectrum pair parameter into a model motion parameter by using the trained Gaussian mixture model; the animation drive module is used for receiving the model motion parameter generated in real time by the parameter mapping module, driving the motion of key points of a virtual vocal organ model so as to drive the motion of the whole virtual vocal organ model. According to the vocal organ visible speech synthesis system, the motion of the model is driven by the corresponding motion parameter generated directly by a frequency domain parameter of the input speech, and therefore, the vocal organ visible speech synthesis system has the advantage of being free from limitations of an online database and a physiological model.
Owner:中科极限元(杭州)智能科技股份有限公司

Pronunciation method of a three-dimensional visualized Mandarin Chinese pronunciation dictionary with rich emotional expression ability

InactiveCN103258340BCoherentFully describe the coarticulation phenomenonAnimationVocal organAnimation
The invention provides a pronunciation method of a three-dimensional visual Chinese mandarin pronunciation dictionary with pronunciation being rich in emotion expression ability and relates to the technical field of voice visualization, language teaching, vocal organ animation and facial animation. The method produces the vocal organ animation and produces the facial animation with vivid expressions at the same time. The method has the advantages that based on truly-captured motion data, a physiology motion mechanism of vocal organs and a harden markov model, the built vocal organ animation has coordination and consistency related to the facial animation, and coarticulation phenomena in continuous voice animation can be completely described; data driving models are embedded into physiology models by the utilization of advantages of the physiology models and advantages of the data driving models on describing face partial detail features and reality sense aspects, and the facial animation with high reality sense is generated. An objective performance test and a subjective performance test to a system verify the effectiveness of the pronunciation method in the aspect of intelligent auxiliary language teaching.
Owner:UNIV OF SCI & TECH OF CHINA

Method and system for predicting human body sound production effect

The invention provides a method and system for predicting a human body sound production effect, and relates to the technical field of voice improvement. The method for predicting the human body sound production effect comprises the following steps of acquiring sound production organ information of a user; acquiring corresponding predicted postoperative sound production organ information according to the sound production organ information of the user; and inputting the predicted postoperative sound production organ information into a preset sound production effect model to generate predicted sound production effect information, wherein the preset sound production effect model contains sound production organ information and sound production effects of a plurality of samples, and the predicted sound production effect information can be obtained by comparing the input predicted postoperative sound production organ information with the sound production organ information in the preset sound production effect model; and by predicting the sound production effect information, the user can know the postoperative sound production effect, so that the postoperative effect can be known in advance before the operation, the data support of the operation effect is provided for the user, and the user can conveniently know the operation scheme and effect.
Owner:张育青
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products