Method and system for optimizing voice recognition acoustic model
An acoustic model and speech recognition technology, applied in the computer field, can solve the problem of low efficiency in optimizing the acoustic model of speech recognition, and achieve the effect of improving optimization efficiency and quality
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0022] Embodiment 1. This embodiment provides a method for optimizing the acoustic model of speech recognition, which is applied to but not limited to voice search or voice input systems. See figure 1 shown, including the following steps:
[0023] S11. Recognize the input speech segment by using the speech recognition acoustic model to obtain a recognition result, and obtain an annotation script of the input speech segment.
[0024] In this embodiment, the user continuously inputs voice to perform a voice search operation, which includes several voice segments, and each voice segment includes voice data representing voice components and mute data representing noise (silence) components.
[0025] In this embodiment, take the processing of one voice segment as an example, other voice segments can perform the same processing, and will not go into details, for example: the user voice inputs a query sentence "how to change the WeChat interface", and the server receives and stores t...
Embodiment 2
[0033] Embodiment 2. This embodiment provides a method for optimizing the acoustic model of speech recognition, which is applied to but not limited to voice search or voice input systems. See figure 2 shown, including the following steps:
[0034] S21. Recognize the input speech segment by using the speech recognition acoustic model to obtain a recognition result, and obtain an annotation script of the input speech segment.
[0035] The specific description is consistent with S11 and will not be repeated here.
[0036] S22. Comparing the recognition result with the tagged script to obtain the wrongly recognized speech segment.
[0037] The specific description is consistent with that of S12 and will not be repeated here.
[0038] S23. Update the training data of the speech recognition acoustic model with the wrongly recognized speech segment.
[0039] In this embodiment, the speech segment acquired in step S22 is further filtered, and the training data of the speech recogn...
Embodiment 3
[0051] Embodiment 3. This embodiment provides a system for optimizing the acoustic model of speech recognition, which is applied to but not limited to the field of voice search or voice input. See Figure 4 As shown, it includes: an acquisition unit 31 , a comparison unit 32 , an update unit 33 and a training unit 34 .
[0052] Wherein, the obtaining unit 31 is configured to use the speech recognition acoustic model to recognize the input speech segment to obtain a recognition result, and obtain the annotation script of the input speech segment.
[0053] In this embodiment, the user continuously inputs voice to the system for optimizing the acoustic model of voice recognition to perform a voice search operation, which includes several voice segments, and each voice segment includes voice data representing audio components and voice data representing noise (silence) components. Silent data.
[0054] In this embodiment, take the processing of one voice segment as an example, ot...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 