Method for quickly forming voice data base for key word checkout task

A keyword and database technology, applied in the direction of electrical digital data processing, special data processing applications, instruments, etc., can solve the problems of small-scale, uncontrollable, and uncertain voice databases, and achieve good recording quality, simple control, and simple labeling Effect

Inactive Publication Date: 2008-05-21
BEIJING INSTITUTE OF TECHNOLOGYGY
View PDF2 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] (1) It is difficult to record and organize the voice database
[0006] At present, the voice database used in the keyword detection system is actually recorded and collected under specific requirements. Each recording needs to involve many speakers and operators, and the workload of the voice database is very huge; The database performs post-processing such as labeling, and the workload is several times that of the recording process itself; at the same time, factors such as accent, spoken language, and noise are uncertain and uncontrollable, and the recording work is difficult and inefficient
[0007] (2) Poor flexibility of speech database
[0008] After each collection of the voice database, some characteristics of the database, such as the length of sentences and the distribution of the number of occurrences of keywords, are relatively fixed; even if a certain change can be achieved by selecting a subset of the database, such changes are Very limited, and the size of the database is also reduced, because once the voice data is collected, each word (whether defined as a keyword or a non-keyword) is fixed in the database
[0009] (3) The scale of the database is small, and it is not yet comprehensive for the performance of the inspection system
[0010] Due to the huge workload of actually collecting the voice database, the scale of the voice database used in the keyword detection system is relatively small; at the same time, due to the particularity of the test of the keyword detection system, the real "keyword" in the continuous voice stream The probability of occurrence is generally small, and the occurrence of many keywords cannot appear more times, thus affecting the effective test of system performance
[0011] (4) The characteristics of keyword appearance are uncontrollable
[0012] For the speech database of the test keyword system, it is generally hoped that its characteristics such as the frequency of keywords, the position of the keywords in the sentence, and the distribution of the number of times the keywords appear in each sentence meet certain requirements; It is extremely difficult to completely meet the preset settings in the word speech database, and once a specific setting is satisfied, it is impossible to change it

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method for quickly forming voice data base for key word checkout task
  • Method for quickly forming voice data base for key word checkout task
  • Method for quickly forming voice data base for key word checkout task

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0067] For the existing keyword detection system, when we want to evaluate its system performance, we can construct a speech database for keyword detection tasks according to the following method:

[0068] Assume that there is a keyword detection system X that can handle up to 100 keywords and test its system performance. It is hoped that the database meets the following conditions:

[0069] 100 keywords, each keyword appears 20 times;

[0070] The total number of sentences is 10,000, of which 1,000 sentences appear once, 200 sentences appear twice, and 200 sentences appear three times;

[0071] The average length of a sentence is 15 words.

[0072] Such as figure 1 Shown, be the fast construction of the present invention and be used for the voice database method flowchart of keyword detection system, comprise the following steps:

[0073] Step 1. Utilize or record the isolated word speech database:

[0074] For example, the existing isolated word speech database D contain...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

A method for quickly structuring voice databank used in key word detection task includes recording voice databank of isolated word, confirming key word table and nonkey word table according to requirement of key word detection system, confirming parameters such as total sentence number and time length as well as key word occurrence frequency for key word detection test, connecting key word and nonkey word in accordance with requirement of key word detection system to be sentence by pasting - up means for generating out databank used in key word detection task.

Description

technical field [0001] The invention relates to a construction method of a speech database, in particular to a construction method of a speech database used for keyword detection tasks. Background technique [0002] The fundamental purpose of speech recognition research is to realize the interaction between human beings and machines with natural language, so that the machine has the same auditory function as human beings, can directly receive human speech, understand human intentions and make corresponding responses. Speech is the most natural, most convenient, and most commonly used way of information exchange for human beings. Keyword detection is the identification of a given set of words—called keywords—from a continuous, unrestricted stream of natural speech. Keyword recognition is a branch and an important research direction of speech recognition. Keyword detection technology has demonstrated great value in many application systems. It has brought speech recognition ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06F17/30
Inventor 黄石磊谢湘匡镜明
Owner BEIJING INSTITUTE OF TECHNOLOGYGY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products