Acoustic model conditioning on sound features
An acoustic model and sound technology, applied in speech analysis, speech recognition, instruments, etc., can solve the problems of accuracy, noise, background sound or music, etc. in speech recognition, so as to improve market competitiveness and overall profitability. Effect
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0036] Various design choices for relevant aspects of the conditional acoustics model are described below. Design choices for the various aspects are independent of each other and work together in any combination, except where noted.
[0037] acoustic model
[0038] An ASR's acoustic model takes input including a speech audio segment and produces an output of inferred probabilities for one or more phonemes. Some models can infer senone probabilities, which are a type of phoneme probabilities. In some applications, the output of an acoustic model is a SoftMax collection of probabilities over a recognized phoneme or senone.
[0039] Some ASR applications run acoustic models on spectral components computed from frames of audio. The spectral components are, for example, mel-frequency cepstral coefficients (MFCCs) calculated over a window of 25 milliseconds of audio samples. For example, acoustic model inference may repeat at intervals of every 10 milliseconds.
[0040] The spec...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


