Multi-sampling-rate voice recognition method, device thereof and system and storage medium

What is AI technical title?
AI technical title is built by Patsnap AI team. It summarizes the technical point description of the patent document.
A multi-sampling rate, speech recognition technology, applied in speech recognition, speech analysis, instruments, etc., can solve the problems of high operation and maintenance cost and low resource utilization efficiency.

Active Publication Date: 2020-05-05

AISPEECH CO LTD

View PDF5 Cites 7 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

This solution has the problems of low resource utilization efficiency and high operation and maintenance costs.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0028] In order to make the purpose, features and advantages of the present invention more obvious and understandable, the technical solutions in the embodiments of the present invention will be clearly and completely described below in conjunction with the accompanying drawings in the embodiments of the present invention. Obviously, the described The embodiments are only some of the embodiments of the present invention, but not all of them. Based on the embodiments of the present invention, all other embodiments obtained by those skilled in the art without making creative efforts belong to the protection scope of the present invention.

[0029] In the description of this specification, descriptions with reference to the terms "one embodiment", "some embodiments", "example", "specific examples", or "some examples" mean that specific features described in connection with the embodiment or example , structure, material or characteristic is included in at least one embodiment or ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a multi-sampling-rate voice recognition method, a device thereof and a system and a storage medium. The method comprises the following steps: firstly, under the condition thatthe audio sampling rate is not changed, feature extraction is carried out on audios with different sampling rates in a corresponding configuration mode according to different sampling rates, and the extracted audios are used for training a neural network model. The neural network model has a general speech recognition tag, in addition, a sampling rate classification label is also added, and a gradient inversion method is used to carry out adversarial training on the sampling rate classification label when the neural network model is trained, so that the multi-sampling-rate speech recognition model obtained by training can autonomously adapt to audios with different sampling rates. Thereafter, the multi-sampling-rate voice recognition model obtained by training can be used for speech recognition, and the purpose of uniformly processing audio input of multiple sampling rates by using the same speech recognition model is achieved.

Description

technical field [0001] The invention relates to the field of artificial intelligence voice interaction, in particular to a multi-sampling rate voice recognition method, device, system and storage medium. Background technique [0002] With the continuous development and progress of artificial intelligence and electronic communication technology, intelligent voice interaction technology is becoming more and more popular and applied in many product fields, including intelligent customer service, call center, smart speaker and smart watch, etc. [0003] However, although they are both speech recognition, the speech sampling rates are different in different application scenarios. If it is necessary to process speech samples with different multi-sampling rates in one system, the following schemes are often used: 1) Unify the sampling rate of the audio by up / down sampling, so as to unify into a speech recognition system. This solution will change the nature of the original audio, ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & AuthorityApplications(China)

IPC IPC(8): G10L15/02G10L15/06G10L15/16

CPCG10L15/02G10L15/063G10L15/16

Inventor施雨豪钱彦旻

OwnerAISPEECH CO LTD

Multi-sampling-rate voice recognition method, device thereof and system and storage medium

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology