News video retrieval method based on speech classifying identification
A technology of classification, recognition and speech, applied in speech recognition, speech analysis, television, etc., can solve the problems of query methods that are not suitable for people's habitual methods, unable to find speakers, and how users can get them.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0019] The present invention will be further described in detail below in conjunction with the accompanying drawings and specific embodiments.
[0020] Such as figure 1 As shown, a news video retrieval method based on speech classification and recognition includes the following steps:
[0021] (1) Utilize sound classifier, segment out the speech segment of standard speech in the news video, the standard speech in the present embodiment is illustrated with standard mandarin as example;
[0022] Audio classification uses a classification model based on support vector machines, which is divided into two parts: classifier model training and classification prediction. The audio feature uses a 13-dimensional feature vector composed of log energy (log energy) and Mel cepstral coefficient (MFCC).
[0023] In this embodiment, the process of classifier model training is: first select training samples, then extract the audio features formed by the logarithmic energy and Mel cepstral co...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com