System and method for converting audio/video data into written records

An audio data conversion technology, applied in the field of data processing, can solve problems such as unrecoverable scenarios, incomplete and detailed meeting content, etc., and achieve the effect of quick browsing and positioning, and reduced costs

Active Publication Date: 2017-05-31
GUANGZHOU SHIYUAN ELECTRONICS CO LTD +1
View PDF6 Cites 64 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] In order to solve the technical problem that the content of the above-mentioned recorded meeting is incomplete and detailed, and the scene at that time cannot be restored after subsequent viewing of the record, the present invention provides a system and method for converting audio and video data into text records. The technical solution is as follows

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • System and method for converting audio/video data into written records
  • System and method for converting audio/video data into written records
  • System and method for converting audio/video data into written records

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0048] Such as figure 1 and figure 2 As shown, the system for converting audio and video data into text records proposed by the present invention includes a data collection part, a data identification part, a data organization part, and a data supplement and correction part.

[0049] The data collection part includes data collection devices such as microphones and cameras.

[0050] The microphone is used to capture the audio data of the participant who is currently speaking. When the participant starts to speak, the microphone collects the audio data of the participant who is currently speaking, and judges whether the participant who is currently speaking is speaking or is paused according to the intensity of the collected audio data. If the pause exceeds a certain time (for example, 3s), it is considered that the participant's speech is over, record the start time and end time of the audio data of the participant who is currently speaking, and record the audio data of the p...

Embodiment 2

[0085] The present invention also proposes a method for converting audio and video data into text records, the method flow chart is as follows image 3 shown, including the following steps:

[0086] Step S21, data collection:

[0087] When a participant starts to speak, the microphone collects the audio data of the participant who is currently speaking, and judges whether the participant who is currently speaking is speaking or pauses according to the intensity of the collected audio data. If the pause exceeds a certain period of time (for example, 3s), the participant is considered After the participant finishes speaking, record the start time and end time of the audio data of the participant who is currently speaking, and send the audio data of the participant who is currently speaking together with the start time (or end time) and the device identifier of the microphone to the data identification step. The function of transmitting the device identifier of the microphone i...

Embodiment 3

[0123] The present invention also proposes a method for converting audio and video data into text records, the method flow chart is as follows Figure 4 shown, including the following steps:

[0124] Step S30, preparatory work:

[0125] Start the microphone and camera, create a list of participants, and create a file address to save the text, where the list of participants includes the unique identity tag of the participant, and also includes the voiceprint feature data and facial feature data of the participant to be collected later;

[0126] Each participant is assigned a unique identity tag. For example, in a one-party conference, "Participant A", "Participant B", and "Participant C" can be used as ID tags to assign participants; in a multi-party conference, you can Use "Participant A1", "Participant B2", "Participant C1" as identity tags to assign to participants, where the first characters "A", "B" and "C" in the tags represent each conference party, The second characte...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a system and method for converting audio/video data into written records. The system comprises a data acquisition part, a data recognition part and a data organization part. The data acquisition part comprises an audio acquisition module and a video acquisition module. The data recognition part comprises a voice and voiceprint recognition module and a face and expression recognition module. The data organization part generates the written records according to text information, recognition starting time, the identity tag of the current speaker and the emotion of the current speaker. The whole audio/video data process is saved more meticulously and more completely, so as to be closer to reality. Audio/video data are converted into texts to be saved, so that storage cost and transmission cost are reduced greatly, follow-up checking of records can be achieved conveniently, and conference content can be browsed and positioned more quickly.

Description

technical field [0001] The invention relates to a data processing technology, in particular to a system and method for converting audio and video data into text records. Background technique [0002] When an audio and video conference is held, in order to record the content of the meeting, the camera is usually used to collect video data and the microphone to collect audio data or only the microphone is used to collect audio data, and the audio and video data or audio data are saved as multimedia files and stored in the storage device; through Play multimedia files, you can watch or listen to the conference content. Alternatively, a dedicated meeting recorder can record the content of the meeting through input devices such as computers or by handwriting. [0003] Using cameras, microphones and other equipment to record audio and video data requires storing audio and video files in storage devices, which requires a large storage space and high cost. In the later stage, the c...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G10L15/25G10L17/22
CPCG10L15/25G10L17/22
Inventor 李纯冬
Owner GUANGZHOU SHIYUAN ELECTRONICS CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products