Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Generation method of video indexing data and system

A technology for data generation and video indexing, which is applied in the search field and can solve problems such as not being found

Inactive Publication Date: 2013-06-19
SHENZHEN RAISOUND TECH +2
View PDF5 Cites 19 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] In fact, for many videos, we are interested in some specific content, such as a news video (30 minutes of news broadcast), its corresponding file name and the content of the web page (such as news titles, and important news content) It is only a small part of the content of the news video, and if the content to be searched (such as "China Merchants Bank", which is a specific name mentioned in a certain financial news) does not appear in the content of the webpage, but appears in the video or audio, then it will be impossible to find

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Generation method of video indexing data and system
  • Generation method of video indexing data and system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0080] like figure 1 As shown, it is a flow chart of the steps of the method for generating video index data of an embodiment, including the following steps:

[0081] Step S101, acquiring video content and text content related to the video content. In a preferred embodiment of the present invention, step S101 is to use a web crawler to grab webpage information with the video content, and extract the video content and text content related to the video content in the webpage respectively .

[0082] Step S102, extracting characteristic parameters of the text through preset keywords, and performing text classification on the text content to obtain classification information in the text.

[0083] Step S103, according to the classification information in the text, select the corresponding pinyin language model and word language model from the preset language model library.

[0084] Step S104, extract audio data from the video content, and divide the audio data into multiple audio...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a generation method of video indexing data and a device. The generation method of the video indexing data comprises the following steps: obtaining video content and text content which is relevant to the video content, classifying the text content, selecting a proper pinyin language model and a proper word language model according to a classification result, segmenting voice data of the video content and classifying speakers, selecting a proper acoustic model according to a speaker classification result, generating a pinyin gridding according to the selected acoustic model and the selected word language model and a first pronunciation dictionary according to the text content, obtaining a word gridding according to the pinyin gridding, the word language mode corresponding to the text content, and a second pronunciation dictionary, recalculating a confidence coefficient of the word gridding to obtain a new work gridding according to the pinyin gridding and the word gridding, and finally combining the new gridding with original video content to obtain the video indexing data. According to the video indexing data, a user can conveniently and accurately retrieve the relevant video content through text keywords.

Description

【Technical field】 [0001] The invention relates to the field of search technology, in particular to a method and system for generating video index data. 【Background technique】 [0002] With the development of network technology, the search function has become an indispensable tool for users. Text-based search engines have become very common. Before searching, it is necessary to establish index data for the search target content, which is used to match the text entered by the user to realize the search function. [0003] Video retrieval technology has also been applied in many search engines. The search engines of Baidu and Google basically search according to the name and label of the video file, and retrieve the corresponding text content of the web page where each audio file is located. However, the video and the audio content (Content) in the video are not formally processed, and these contents are used for effective retrieval. [0004] In fact, for many videos, we are...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F17/30
Inventor 黄石磊刘轶程刚曹文晓
Owner SHENZHEN RAISOUND TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products