Portrait audio video synchronous calibration device and method
A technology of voice and video, calibration device, applied in image communication, instrument, character and pattern recognition, etc., can solve problems such as inability to recognize, out of synchronization of voice information and video information, inability to judge motion without sound, etc., to improve computing power. performance, the effect of reducing the amount of information storage
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0065] Embodiment 1: to the synchronous voice and video detection process
[0066] Step S1, read the audio and video header file information, obtain the total time length of the audio and video 72, the unit is second, a certain moment of the audio and video is t, 1≤t≤72;
[0067] Step S2, set the dynamic lips array P[k], 1≤k≤72, set the initial value of all elements in the array P to 0, set the vocal array S[f], 1≤f≤72, set the array S The initial value of all elements in is set to 0;
[0068] Step S3, sequentially extracting the picture frames at time t of the video file, image 3 It is the binary image of the image frame extracted at the 32nd second of the video file, Figure 6 It is the binary image of the picture frame extracted in the 31st second of the video file, using face recognition technology to recognize the i human face area M in the picture frame at a certain moment t,i , 1≤i≤I, I=1, Figure 4 From image 3 A face region M extracted from 32,1 , Figure 7 F...
Embodiment 2
[0077] Embodiment 2: To asynchronous voice and video detection and calibration process
[0078] Step S1, read the audio and video header file information, obtain the total length of the audio and video time 58, the unit is second, a certain moment of the audio and video is t, 1≤t≤58;
[0079] Step S2, set the dynamic lips array P[k], 1≤k≤58, set the initial value of all elements in the array P to 0, set the vocal array S[f], 1≤f≤58, set the array S The initial value of all elements in is set to 0;
[0080] Step S3, sequentially extracting the picture frames at time t of the video file, Figure 11 is the binary image of the image frame extracted from the 19th video file, Figure 14 It is the binary image of the picture frame extracted from the 18th second of the video file, and the i face area M in the picture frame at a certain moment is recognized by face recognition technology t,i , 1≤i≤I, I=3, Figure 12 From Figure 11 The three face regions M extracted from 19,1 ,M ...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com