Voice control method and device

A technology of voice control and voice information, applied in voice analysis, voice recognition, telephone communication, etc., can solve the problems of voice delay, robot unusable, audio and video out of sync, etc., and achieve the effect of avoiding recording delay

Inactive Publication Date: 2017-05-17
UBTECH ROBOTICS CORP LTD
View PDF5 Cites 14 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] 1. All ordinary voices have to be processed by voice recognition and then input. The voice has a large delay, and it is easy for the audio and video to be out of sync.
[0004] 2. It is necessary to customize the video call or voice recording program, because the API provided by the voice engine needs to be used to import sound, and the ordinary third-party video calling or voice recording program calling the Android standard AudioRecord cannot be used on the robot

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Voice control method and device
  • Voice control method and device
  • Voice control method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0047] Such as figure 1 The voice control method shown is applied to a system provided with a first audio unit and a second audio unit. In view of the shortcoming that the voice call and recording function occupies the audio input unit, which makes the speech recognition engine unable to use audio input units such as microphones for voice command recognition, an additional audio input unit is introduced in the hardware, and the sound source of the speech recognition engine is designated as more This audio input unit can recognize voice commands in parallel during calls or recordings.

[0048] Specifically, one more microphone source is introduced into the hardware, which can be accessed through the I2S (Inter-IC Sound) bus, which is dedicated to data transmission between audio devices and is widely used in various multimedia systems. It adopts the design of transmitting clock and data signals along independent wires. By separating the data and clock signals, it avoids the dis...

Embodiment 2

[0062] Such as figure 2 The voice control method shown is applied to a system provided with a first audio unit and a second audio unit, and the voice control method includes the following steps:

[0063] S201. Allocate the first audio unit as an input source of a speech recognition engine. The "first" and "second" involved in the present invention are only used to distinguish different components, and do not have the function of distinguishing order. The first audio unit can be assigned as the input source of the speech recognition engine, and of course other audio units can also be assigned, such as the second audio unit as the input source of the speech recognition engine.

[0064] Specifically, the allocation may be implemented through means such as an Application Programming Interface (Application Programming Interface, API).

[0065] By distributing the input source of the speech recognition engine, it is convenient to arrange or adjust the positions of the first audio...

Embodiment 3

[0078] Such as image 3 Voice control devices shown, including:

[0079] 111. A first acquiring unit, configured to acquire first voice information input by the first audio unit.

[0080] 112. A second acquiring unit, configured to acquire second voice information input by the second audio unit.

[0081] Typically, both the first audio unit and the second audio unit include a microphone, a microphone matrix, a microphone interface, a microphone matrix interface or a wireless audio input device.

[0082] 101. An allocating unit, configured to allocate the first audio unit as an input source of a speech recognition engine.

[0083] 102. A receiving unit, configured to receive a wake-up instruction for waking up the first audio unit;

[0084] 103 A second judging unit, configured to judge whether the first audio unit is allowed to wake up, and if the first audio unit is allowed to be woken up, wake up the first audio unit.

[0085] 120. A recognition unit, configured to recog...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a voice control method applied to a system which is provided with a first audio unit and a second audio unit. The voice control method includes the steps of acquiring first voice information input by the first audio unit, recognizing a voice command of the first voice information, judging whether to stop acquisition of second voice information input by the second audio unit or not according to the voice command, and if so, then stopping acquisition of the second voice information input by the second audio unit. In the system provided with the first audio unit and the second audio unit, the first audio unit is used as an audio input source of a voice recognition engine, the second audio unit is used as an input source of other applications such as call recording, and thus, the voice command can be recognized parallelly during calling or voice recording. Therefore, the general problem in the industry that it is impossible to realize parallel processing of the voice command during audio or video calling is solved.

Description

technical field [0001] The invention relates to the field of voice recognition, in particular to a voice control method and device. Background technique [0002] At this stage, electronic devices with voice control functions generally have only one microphone or pickup as the audio input unit on the hardware. When making a voice call or recording sound, this microphone will be occupied, and the voice recognition engine program cannot use it. One microphone for voice command recognition. In the existing technology, the voice engine and video call or voice input are usually written in one application, so that the voice is first recognized by the voice engine. After the recognition is not an instruction, the voice is transparently transmitted to the video call or voice input logic. Two disadvantages: [0003] 1. All ordinary voices have to be processed by voice recognition, and then input. The voice has a large delay, and it is easy for the audio and video to be out of sync. ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): H04M1/725G10L15/22G10L15/30G10L15/34G10L17/10H04M1/72403H04M1/72433
CPCG10L15/22G10L15/30G10L15/34G10L17/10H04M1/72433H04M1/72403
Inventor 王嘉晋熊友军
Owner UBTECH ROBOTICS CORP LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products