Multi-sampling-rate voice recognition method, device thereof and system and storage medium

A multi-sampling rate, speech recognition technology, applied in speech recognition, speech analysis, instruments, etc., can solve the problems of high operation and maintenance cost and low resource utilization efficiency.

Active Publication Date: 2020-05-05
AISPEECH CO LTD
View PDF5 Cites 7 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

This solution has the problems of low resource utilization efficiency and high operation and maintenance costs.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Multi-sampling-rate voice recognition method, device thereof and system and storage medium
  • Multi-sampling-rate voice recognition method, device thereof and system and storage medium
  • Multi-sampling-rate voice recognition method, device thereof and system and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0028] In order to make the purpose, features and advantages of the present invention more obvious and understandable, the technical solutions in the embodiments of the present invention will be clearly and completely described below in conjunction with the accompanying drawings in the embodiments of the present invention. Obviously, the described The embodiments are only some of the embodiments of the present invention, but not all of them. Based on the embodiments of the present invention, all other embodiments obtained by those skilled in the art without making creative efforts belong to the protection scope of the present invention.

[0029] In the description of this specification, descriptions with reference to the terms "one embodiment", "some embodiments", "example", "specific examples", or "some examples" mean that specific features described in connection with the embodiment or example , structure, material or characteristic is included in at least one embodiment or ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a multi-sampling-rate voice recognition method, a device thereof and a system and a storage medium. The method comprises the following steps: firstly, under the condition thatthe audio sampling rate is not changed, feature extraction is carried out on audios with different sampling rates in a corresponding configuration mode according to different sampling rates, and the extracted audios are used for training a neural network model. The neural network model has a general speech recognition tag, in addition, a sampling rate classification label is also added, and a gradient inversion method is used to carry out adversarial training on the sampling rate classification label when the neural network model is trained, so that the multi-sampling-rate speech recognition model obtained by training can autonomously adapt to audios with different sampling rates. Thereafter, the multi-sampling-rate voice recognition model obtained by training can be used for speech recognition, and the purpose of uniformly processing audio input of multiple sampling rates by using the same speech recognition model is achieved.

Description

technical field [0001] The invention relates to the field of artificial intelligence voice interaction, in particular to a multi-sampling rate voice recognition method, device, system and storage medium. Background technique [0002] With the continuous development and progress of artificial intelligence and electronic communication technology, intelligent voice interaction technology is becoming more and more popular and applied in many product fields, including intelligent customer service, call center, smart speaker and smart watch, etc. [0003] However, although they are both speech recognition, the speech sampling rates are different in different application scenarios. If it is necessary to process speech samples with different multi-sampling rates in one system, the following schemes are often used: 1) Unify the sampling rate of the audio by up / down sampling, so as to unify into a speech recognition system. This solution will change the nature of the original audio, ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G10L15/02G10L15/06G10L15/16
CPCG10L15/02G10L15/063G10L15/16
Inventor 施雨豪钱彦旻
Owner AISPEECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products