Multilayer index voice document searching method and system thereof

A technology of document retrieval and multi-layer indexing, which is applied in the field of information retrieval, and can solve problems such as the inability to find documents, the problem of out-of-set words, and the inaccurate return of documents, etc.

Inactive Publication Date: 2009-08-19
PEKING UNIV
View PDF0 Cites 70 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006]In the current audio document retrieval system, most of the words are used as the index unit to construct the index, but there is a problem of out-of-set words when using the word to construct the index, so that when the search word The corresponding document cannot be found when it is an out-of-set word
Recently, some scholars hav

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Multilayer index voice document searching method and system thereof
  • Multilayer index voice document searching method and system thereof
  • Multilayer index voice document searching method and system thereof

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0044] 1. Automatic speech recognition module

[0045] Automatic Speech Recognition frameworks such as image 3 As shown, each module of the system will be introduced in detail below.

[0046] 1. Feature extraction

[0047] The purpose of feature extraction is to extract useful information that better reflects the stability of speech as features for automatic speech recognition. A basic characteristic of speech signal is short-term stationary characteristic, and short-term analysis is the basis of speech signal feature extraction. Before extracting features, it is generally necessary to pre-emphasize the speech signal to enhance the high-frequency components of the speech to reduce the attenuation of the high-frequency components of the speech signal by the channel. Subsequently, the speech signal is divided into frames (usually with a frame length of 25 milliseconds and a frame shift of 10 milliseconds), and a Hamming window is added for smoothing.

[0048] The commonly u...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a multilayer indexing voice document retrieval method and a system thereof, and belongs to the technical field of information retrieval. The multilayer indexing voice document retrieval method comprises the following steps: (1) feature extraction of a multimedia stream is implemented, thus obtaining a voice feature sequence; (2) a voice identifying decoder is used for searching the voice feature sequences, thus obtaining a word lattice and an optimal identification result; (3) according to the word lattice and the optimal identification result, a word and syllable double-layer indexing database is constructed; and (4) relevant documents of a given query term are searched in the indexing database and returned to users. The multilayer indexing voice document retrieval system comprises an automatic voice identifying module that is used for automatically identifying characters in voice documents; an automatic voice document index constructing module that is used for constructing double indexes of the voice identification result, and a voice document retrieval module that is used for searching the relevant documents of given query terms in the indexing database and returning the documents to users. Compared with the prior art, the multilayer indexing voice document retrieval method and the system can realize quick and accurate searching of multimedia data.

Description

technical field [0001] The invention relates to a multi-layer index voice document retrieval method and system thereof, which uses voice information in multimedia materials for automatic cataloging and retrieval, belongs to the technical field of information retrieval, and can be applied to television stations, radio stations, and multimedia websites. Background technique [0002] With the rapid development of digital multimedia technology, the multimedia information faced by people is growing explosively. As the speed of the Internet continues to increase, more and more multimedia data exists on the Internet. At the same time, the construction of the digital library is becoming more and more perfect. Therefore, how to quickly and accurately obtain the required knowledge from massive multimedia data is an urgent problem to be solved. [0003] In recent years, information retrieval technology has become an important means for people to obtain information, and has profoundly...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/30G10L15/08
Inventor 吴玺宏迟惠生曲天书万广鲁
Owner PEKING UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products