Unlock instant, AI-driven research and patent intelligence for your innovation.

Voice detection method and device, electronic device and storage medium

A technology of speech detection and speech, which is applied in speech analysis, speech recognition, instruments, etc., can solve the problems of insufficient robustness, inconsistency, and low precision of speech detection, and achieve the goal of improving classification accuracy, quickly distinguishing, and avoiding interference Effect

Active Publication Date: 2022-06-21
INST OF AUTOMATION CHINESE ACAD OF SCI
View PDF7 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, in the actual application of speech detection, there are cases where the nominal audio sampling rate of the speech to be detected is different from the actual audio sampling rate, resulting in insufficient robustness and low accuracy of speech detection.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Voice detection method and device, electronic device and storage medium
  • Voice detection method and device, electronic device and storage medium
  • Voice detection method and device, electronic device and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0040] In order to make the purposes, technical solutions and advantages of the embodiments of the present disclosure clearer, the technical solutions in the embodiments of the present disclosure will be described clearly and completely below with reference to the accompanying drawings in the embodiments of the present disclosure. Obviously, the described embodiments These are some, but not all, embodiments of the present disclosure. Based on the embodiments in the present disclosure, all other embodiments obtained by those of ordinary skill in the art without creative work fall within the protection scope of the present disclosure.

[0041] The audio sampling rate refers to how much the recording equipment samples the analog signal per unit time. The higher the sampling frequency, the more realistic and natural the waveform of the mechanical wave is. There are five levels of 24000Hz, 44100Hz and 48000Hz. 11025Hz can achieve the sound quality of AM broadcast, while 22050Hz and...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The present disclosure relates to a speech detection method and device, electronic equipment and a storage medium. The method includes: extracting acoustic features of the speech to be detected to obtain the first acoustic feature and the second acoustic feature; inputting the first acoustic feature sequence The pre-trained sampling rate prediction model obtains the sampling rate information features; the second acoustic feature and sampling rate information features are input into the pre-trained speech detection model to obtain the classification result of the speech to be detected as real speech or synthetic speech, combined with sampling The rate information features are used to detect the speech to be detected, which can quickly judge the audio quality of the audio in the actual scene, help the speech detection model to identify different frequency bands of the actual audio more focused, and avoid false high-frequency speech from interfering with the model discrimination. Improve the classification accuracy of the detection model.

Description

technical field [0001] The present disclosure relates to the field of speech technology, and in particular, to a speech detection method and device, an electronic device, and a storage medium. Background technique [0002] At present, in order to capture more discriminative information, speech detection models use a variety of acoustic features for speech signal processing. However, in the actual application process of speech detection, there is a situation that the nominal audio sampling rate of the speech to be detected is different from the actual audio sampling rate, which leads to the problems of insufficient robustness and low precision of speech detection. SUMMARY OF THE INVENTION [0003] In order to solve the above technical problem or at least partially solve the above technical problem, the embodiments of the present disclosure provide a voice detection method and apparatus, an electronic device, and a storage medium. [0004] In a first aspect, an embodiment o...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G10L15/02G10L15/06G10L15/08G10L25/60
CPCG10L15/02G10L15/063G10L15/08G10L25/60
Inventor 傅睿博陶建华易江燕张震孙旭东刘睿霖王立强
Owner INST OF AUTOMATION CHINESE ACAD OF SCI