Voice retrieval device, voice retrieval method
A sound and sound signal technology, applied in the field of sound retrieval devices, can solve problems such as poor retrieval accuracy
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment approach 1
[0035] Such as figure 1 As shown, the voice search device 100 of Embodiment 1 is physically equipped with: ROM (Read Only Memory: Read Only Memory) 1, RAM (Random Access Memory: Random Access Memory) 2, external storage device 3, input device 4, output A device 5 , a CPU (Central Processing Unit: Central Processing Unit) 6 , and a bus 7 .
[0036] ROM1 stores a sound search program. RAM2 is used as a work area of CPU6.
[0037] The external storage device 3 is constituted by, for example, a hard disk, and stores an audio signal to be analyzed, a monophonic model, a triphonic model, and phoneme time lengths described later as data.
[0038] The input device 4 is constituted by, for example, a keyboard or a voice recognition device. The input device 4 supplies the CPU 6 with the search word input by the user as text data. The output device 5 includes, for example, a screen such as a liquid crystal display, a speaker, and the like. The output device 5 displays text data ou...
Embodiment approach 2
[0111] Next, Embodiment 2 of the present invention will be described.
[0112] The voice search device 100 according to Embodiment 1 executes calculation of the output probability used for acquiring the likelihood after the search character string is acquired by the search character string acquisition unit 111 . However, the present invention is not limited thereto. The voice search device according to Embodiment 2 performs calculations of output probabilities using a monophonic submodel which requires a large amount of calculations in advance when selecting candidates for sections corresponding to search character strings, thereby speeding up search time. That is, the output probabilities corresponding to the search words are obtained in advance for all sections of the audio signal to be searched, and are stored as search indexes. Then, at the time of retrieval, the likelihood of the likelihood acquisition section is obtained by adding the output probabilities corresponding ...
Deformed example 1
[0123] As used in Embodiment 1 Figure 7 As described above, when the selection unit 121 selects the time length with the highest likelihood, x (10) likelihoods are added for each time length in descending order of likelihood, and a likelihood-based phase is selected. The likelihood acquisition interval of the length of time in which the added value becomes the maximum. However, the selection method is not limited to these. Such as Figure 11A and Figure 11B As an example, in Modification 1, the likelihood of the likelihood acquisition interval based on which speech rate is better is compared using the added value of the corrected likelihood with a weighting factor that multiplies larger as the likelihood is higher.
[0124] Figure 11B is an example of a weighting coefficient, and the higher the likelihood order is, the larger the weighting coefficient is set. Figure 11A This is an example showing that when comparing the likelihood of the likelihood acquisition section...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com