Voice recognition method and device, electronic equipment and medium
A technology of speech recognition and electronic equipment, applied in speech recognition, speech analysis, instruments, etc., can solve the problem that real-time monitoring of the accuracy of the user's pronunciation can not be realized, and achieve the effect of improving the efficiency of speech correction
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0030] figure 1 It is a flow chart of a speech recognition method provided by Embodiment 1 of the present disclosure. This embodiment is applicable to the situation of recognizing whether the user's pronunciation is accurate. / or implemented in the form of hardware.
[0031] Such as figure 1 Described, the method of the present embodiment comprises:
[0032] S110. Acquire the user's pronunciation of the target speech text read by the user, and generate a target speech text image of the target speech text.
[0033] Among them, the target voice text can be written texts such as books, newspapers or magazines that the user reads in daily life, or text reading materials displayed on the web page, or practice questions for language learning. The language of the target voice text can be It is made up of Chinese, other languages, or a combination of multiple languages. In the embodiment of the present invention, the type and language type of the target voice text are only explaine...
Embodiment 2
[0044] As a preferred embodiment of the above embodiment, figure 2 It is a flow chart of a voice recognition method provided in Embodiment 2 of the present disclosure.
[0045] Such as figure 2 As shown, the method includes:
[0046] S210. Acquire the user's pronunciation of the target speech text read by the user, and generate a target speech text image of the target speech text.
[0047] S220. Determine sentences in the target voice text by performing character recognition on the target voice text image.
[0048] S230. Determine whether the sentence in the target phonetic text contains polyphonic characters, and if so, acquire multiple pronunciations of the polyphonic characters.
[0049] Specifically, each word contained in the sentence in the target speech text needs to be queried for its corresponding pinyin in real time. The pinyin query is supported by a conventional dictionary, a network dictionary or a dictionary database. , then acquire multiple pronunciations ...
Embodiment 3
[0059] image 3 A schematic structural diagram of a speech recognition device provided by Embodiment 3 of the present disclosure, the device includes: a target speech text image generation module 310 , a sentence correct pronunciation acquisition module 320 and a speech recognition result determination module 330 .
[0060] Target voice text image generation module 310, for obtaining the user's pronunciation of the user's reciting target voice text, and generating the target voice text image of the target voice text;
[0061] The correct pronunciation acquisition module 320 of the sentence is used to determine the sentence in the target speech text according to the target speech text image, and obtain the correct pronunciation of the sentence in the target speech text;
[0062] A voice recognition result determining module 330, configured to determine the user's voice recognition result according to the user's pronunciation and the correct pronunciation of the sentence in the ...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


