Voice recognition method, device and apparatus and storage medium

A speech recognition and speech feature technology, applied in speech recognition, speech analysis, instruments, etc., can solve the problem that the speech recognition rate cannot be further improved

Inactive Publication Date: 2018-07-31
GUANGDONG XIAOTIANCAI TECH CO LTD
View PDF5 Cites 32 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] The most critical thing in the above process is the accuracy of speech recognition results, and rel

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Voice recognition method, device and apparatus and storage medium
  • Voice recognition method, device and apparatus and storage medium
  • Voice recognition method, device and apparatus and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0056] figure 1 It is a flow chart of a speech recognition method provided by Embodiment 1 of the present invention. This embodiment is applicable to the situation of improving the speech recognition rate. The method can be executed by a speech recognition device, which can use software and / or hardware The device can be implemented in a terminal, such as a typical mobile phone, a tablet computer, and the like. Such as figure 1 As shown, the method specifically includes the following steps:

[0057] S110. When the utterance event is triggered, receive the voice signal and the image signal including lips sent by the microphone and collected by the user during the utterance event;

[0058] In a specific embodiment of the present invention, the sounding event may represent a sound generated during learning activities or entertainment activities using the functions of the terminal, such as reading a text or singing a song. And usually in the process of carrying out the above act...

Embodiment 2

[0078] figure 2 It is a flow chart of a speech recognition method provided by Embodiment 2 of the present invention. This embodiment is applicable to the situation of improving the speech recognition rate. The method can be executed by a speech recognition device, which can use software and / or hardware The device can be implemented in a server, such as a typical server. Such as figure 2 As shown, the method specifically includes the following steps:

[0079] S210. Receive the speech feature signal and the lip language feature signal sent by the terminal;

[0080] In a specific embodiment of the present invention, the server can receive the voice feature signal and the lip language feature signal sent by the terminal, so that the server can further identify and analyze the voice feature signal and the lip language feature signal.

[0081] S220. Perform matching analysis on the voice feature signal and the preset voice signal to generate a voice recognition result;

[0082...

Embodiment 3

[0108] image 3 The flow chart of a voice recognition method provided in Figure 3 of Embodiment 3 of the present invention, this embodiment is applicable to the situation of improving the voice recognition rate, the method can be executed by a voice recognition device, and the device can use software and / or hardware The device can be implemented in a device, such as a typical mobile phone, a tablet computer, a server, and the like. Such as image 3 As shown, the method specifically includes the following steps:

[0109] The microphone and the terminal establish a communication connection;

[0110] When the sounding event is triggered, the microphone collects the voice signal and the image signal including the lips during the execution of the sounding event, and sends the voice signal and the image signal including the lips to the terminal;

[0111] The terminal performs feature extraction on the voice signal to generate a voice feature signal, and performs feature extractio...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a voice recognition method, device and apparatus and a storage medium. The method includes: when a sounding event is triggered, receiving a voice signal sent by a microphone and acquired by a user during the execution of the sounding event and an image signal including a lip; performing feature extraction on the voice signal to generate a voice feature signal, and performing feature extraction on the image signal including the lip to generate a lip-language feature signal; sending the voice feature signal and the lip-language feature signal to a server to instruct the server to match the voice feature signal with a preset voice signal to generate a voice recognition result and to match the lip-language feature signal with a preset lip-language signal to generate a lip-language recognition result; if the similarity between the voice recognition result and the lip-language recognition result is greater than or equal to a similarity threshold, generating a recognition feedback result according to the voice recognition result and sending the recognition feedback result to a terminal. The embodiment of the invention achieves improved voice recognition rate.

Description

technical field [0001] Embodiments of the present invention relate to speech recognition technology, and in particular to a speech recognition method, device, equipment and storage medium. Background technique [0002] With the advent of the electronic information age, mobile devices are becoming more and more popular, mobile terminals and external devices of mobile terminals, such as children's tablet computers and microphones. In addition, the functions that can be realized by the above-mentioned devices are becoming more and more abundant. For example, a microphone can be connected to a mobile terminal to perform language learning or song singing according to the content displayed on the mobile terminal. In this process, the microphone is required to record the user's voice in real time. Then upload the sound to the mobile terminal, perform corresponding speech recognition in the mobile terminal, and then obtain the speech recognition result, and then give the evaluation ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G10L15/02G10L15/25G10L15/30
CPCG10L15/02G10L15/25G10L15/30
Inventor 李滨何
Owner GUANGDONG XIAOTIANCAI TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products