Method of video speech recognition and search
A video voice and voice recognition technology, applied in voice recognition, voice analysis, special data processing applications, etc., can solve the problems of lack of detail, large amount of video search data, unapplied search, etc.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0032] Such as figure 2 As shown, in public security, while recording the video content of the camera, obtain the sound file and use the speech recognition technology for corresponding processing, and store it in the cloud, or only save the text file in the cloud, and store the actual data in other convenient large data storage bodies. When searching for audio and video files, single text retrieval or text, video clips and screenshots of corresponding time slices can be used as the retrieval results for two situations.
Embodiment 2
[0034] Such as image 3 As shown, in personal applications, it is also possible to search for video files similar to network video media. For special applications, for example, when sorting out items, record the storage location and report the corresponding item name. When searching, enter the corresponding item name to find the storage location of the item. Prevent the difficulty of finding due to forgetting or searching by non-organizers, such as taking pictures and saying when tidying up the room: put summer clothes here, dad’s shirts here, mom’s coats here, brother’s pencils here , my sister’s cosmetics are put here. When searching, enter a shirt, and then retrieve multiple shirt results. According to the screenshot, determine the time slice of the target shirt or find the location directly. This family application can greatly avoid conflicts caused by reasons such as missing items or family members misunderstanding different memories of the same thing, and is especially c...
Embodiment 3
[0036] Such as Figure 4 As shown, for the search of online video media, the cloud analyzes the video and converts the sound, and marks it according to the Time Line in a subtitle-like manner. (Converted into text by speech recognition technology) can list the corresponding subtitle text and video clips and screenshots of the corresponding Time Line. For example, when the user only remembers part of the lines of a certain movie, this technology can be used to perform video retrieval for this part of the lines.
[0037] In summary, due to the adoption of the above technology, the present invention can perform extensive and targeted search on videos, and at the same time, this technology can also be used for rapid positioning in terms of public security and private item search.
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 