Voice recognition technology based on audio media analysis
A speech recognition and audio technology, applied in speech recognition, speech analysis, instruments, etc., can solve problems such as difficulty in expressing advanced semantic concepts
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0044] The present invention will be further described in detail below in conjunction with examples, so that those skilled in the art can implement it with reference to the description.
[0045] It should be understood that terms such as "having", "comprising" and "including" as used herein do not exclude the presence or addition of one or more other networks or combinations thereof.
[0046] A method for recognizing voice in audio and video in this example, comprising the following steps:
[0047] 1) Extract audio data from video stream.
[0048] 2) Extract 13th-order MFCC, 13th-order first-order differential MFCC, zero-crossing rate, short-term energy and sub-band energy ratio:
[0049] Mel Frequency Cepstral Coefficients (MFCC):
[0050]
[0051] Zero-crossing rate (ZCR):
[0052]
[0053] Short-Term Energy (STE):
[0054] Subband Energy Ratio:
[0055] 3) Classify by SVM classifier:
[0056]
[0057] 4) Discrimination of silence by the double threshold d...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


