System and method for converting audio/video data into written records
An audio data conversion technology, applied in the field of data processing, can solve problems such as unrecoverable scenarios, incomplete and detailed meeting content, etc., and achieve the effect of quick browsing and positioning, and reduced costs
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0048] Such as figure 1 and figure 2 As shown, the system for converting audio and video data into text records proposed by the present invention includes a data collection part, a data identification part, a data organization part, and a data supplement and correction part.
[0049] The data collection part includes data collection devices such as microphones and cameras.
[0050] The microphone is used to capture the audio data of the participant who is currently speaking. When the participant starts to speak, the microphone collects the audio data of the participant who is currently speaking, and judges whether the participant who is currently speaking is speaking or is paused according to the intensity of the collected audio data. If the pause exceeds a certain time (for example, 3s), it is considered that the participant's speech is over, record the start time and end time of the audio data of the participant who is currently speaking, and record the audio data of the p...
Embodiment 2
[0085] The present invention also proposes a method for converting audio and video data into text records, the method flow chart is as follows image 3 shown, including the following steps:
[0086] Step S21, data collection:
[0087] When a participant starts to speak, the microphone collects the audio data of the participant who is currently speaking, and judges whether the participant who is currently speaking is speaking or pauses according to the intensity of the collected audio data. If the pause exceeds a certain period of time (for example, 3s), the participant is considered After the participant finishes speaking, record the start time and end time of the audio data of the participant who is currently speaking, and send the audio data of the participant who is currently speaking together with the start time (or end time) and the device identifier of the microphone to the data identification step. The function of transmitting the device identifier of the microphone i...
Embodiment 3
[0123] The present invention also proposes a method for converting audio and video data into text records, the method flow chart is as follows Figure 4 shown, including the following steps:
[0124] Step S30, preparatory work:
[0125] Start the microphone and camera, create a list of participants, and create a file address to save the text, where the list of participants includes the unique identity tag of the participant, and also includes the voiceprint feature data and facial feature data of the participant to be collected later;
[0126] Each participant is assigned a unique identity tag. For example, in a one-party conference, "Participant A", "Participant B", and "Participant C" can be used as ID tags to assign participants; in a multi-party conference, you can Use "Participant A1", "Participant B2", "Participant C1" as identity tags to assign to participants, where the first characters "A", "B" and "C" in the tags represent each conference party, The second characte...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com