Media segment-based speaking detection method and system
A detection method and media technology, applied in speech analysis, instrumentation, computing, etc., can solve problems such as weak generalization ability and decreased detection rate
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0047] The technical solution of the present invention will be described in detail below in conjunction with the embodiments and accompanying drawings.
[0048] Such as figure 1 , the processing work of the method provided by the embodiment of the present invention includes the following specific steps:
[0049] Step 1, divide the input media signal S(t) into audio signal S 1 (t) and the video signal S 2 (t), which are processed separately,
[0050] For audio signal S 1 (t), processed as follows:
[0051] (1) To the audio signal of the media file of input, calculate the harmonic frequency vector in the discrete Fourier window, suppose to obtain a common harmonic frequency in a plurality of discrete Fourier window DFT (DiscreteFouriertransform) in the embodiment.
[0052] (2) Calculate the likelihood ratio logΛ(t) of each frame containing a harmonic frequency component as an audio feature, and t is the frame label of the audio.
[0053] In specific implementation, (1) and...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com