Patents
Literature
Patsnap Copilot is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Patsnap Copilot

260 results about "Voice Training" patented technology

A variety of techniques used to help individuals utilize their voice for various purposes and with minimal use of muscle energy.

Voice state detection method suitable for echo cancellation system

ActiveCN105957520AImprove accuracyOvercoming the problem of inaccurate detectionSpeech recognitionProximal pointSvm classifier
The invention relates to a voice state detection method suitable for an echo cancellation system. The voice state detection method relates to the field of voice interaction technologies based on an IP network. The voice state detection method comprises the steps of: constructing a support vector machine (SVM) classifier by utilizing noise training samples and voice training samples, wherein signals to be detected are far-end and near-end signals after blocking, carrying out VAD judgment on the block far-end signal by adopting the constructed SVM classifier based on a Gaussian mixture model, stopping updating and filtering of a filter and outputting a near-end voice signal directly if the judgment result is that no voice exists, and carrying out double-end conversation judgment when judging that voice exists at a far end; stopping updating coefficients of the filter when in double-end conversation, and filtering the near-end signal; otherwise, conducting coefficient updating and filtering of the filter according to the far-end signal. The voice state detection method improves accuracy of voice activity detection, prevents a double-end mute state from being misjudged to be a double-end conversation state, and prevents error updating and filtering of the filter without a reference signal.
Owner:BEIJING UNIV OF POSTS & TELECOMM

A method for a voice control remote controller and a voice remote controller

The invention provides a method for a voice controlled remote controller and a voice remote controller. The method comprises the following steps: 1), the voice controlled remote controller is made to enter a voice training mode; codes of buttons are set; and character voice instructions are set for functions, corresponding to the button codes, of a to-be-remotely-controlled device; 2), the button codes are collected and the character voice instructions are identified; 3), the collected button codes and the identified voice instructions are set to be correspondingly related to each other, and the button codes and the voice instructions which are set to be correspondingly related to each other are stored; 4), the voice remote controller is made to enter a voice recognition mode, and input voice control instructions are identified; 5), matching is carried out between the identified voice control instructions and the stored character voice instructions, and after the matching succeeds, the button codes corresponding to the matched character voice instructions are obtained; and 6), the button codes are emitted to the to-be-remotely-controlled device. According to the invention, problems of existing voice remote controllers that selection of one channel can not be carried out via a plurality of voices are solved.
Owner:上海闻通信息科技有限公司

Combined model training method and system

The embodiment of the invention provides a combined model training method. The method comprises the following steps: extracting the phase spectrum and the logarithm magnitude spectrum of a noisy voicetraining set in an implicit manner; by utilizing the magnitude spectrum fragments of the logarithm magnitude spectrum after expansion as the input features of a time frequency masking network, and byutilizing the noisy voice training set and a clear voice training set, determining a target masking label used for training the time frequency masking network, based on the input features and the target masking label, training the time frequency masking network, and estimating a soft threshold mask; and enhancing the phase spectrum of the noisy voice training set by utilizing the soft threshold mask, wherein the enhanced phase spectrum is adopted as the input features of a DOA (direction of arrival) estimation network, and training the DOA estimation network. The embodiment of the invention further provides a combined model training system. According to the embodiment of the invention, by setting the target masking label, the input features are extracted in an implicit manner, and the time frequency masking network and DOA estimation network combined training is more suitable for the DOA estimation task.
Owner:AISPEECH CO LTD

Voice conversion method and device and electronic equipment

The invention discloses a voice conversion method and a device and electronic equipment, and relates to the technical field of voice conversion, voice interaction, natural language processing and deeplearning. According to the specific implementation scheme, the method comprises the followings steps: acquiring source voice of a first user and reference voice of a second user; extracting first voice content information and a first acoustic feature from the source voice; extracting a second acoustic feature from the reference voice; inputting the first voice content information, the first acoustic feature and the second acoustic feature into a pre-trained voice conversion model to obtain a reconstructed third acoustic feature, and obtaining the pre-trained voice conversion model according to voice training of a third user; and synthesizing the target speech according to the third acoustic feature. According to the method, the first voice content information and the first acoustic feature of the source voice and the second acoustic feature of the reference voice are input into the pre-trained voice conversion model, and the target voice is obtained and synthesized according to the reconstructed third acoustic feature, so that the waiting time of voice conversion can be shortened.
Owner:BEIJING BAIDU NETCOM SCI & TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products