Speech enhanced interaction method, system, storage medium and electronic device

A technology of speech enhancement and interaction method, applied in speech analysis, speech recognition, instruments, etc., can solve the problems of poor axial pickup effect, deterioration of effect, and large complexity.

Active Publication Date: 2018-11-23
FUZHOU ROCKCHIP SEMICON
View PDF8 Cites 18 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The single-microphone voice enhancement technology has a small algorithm complexity, but the suppression effect on non-stationary noise is poor, and it is easy to cause different degrees of voice distortion
Due to the limitations of the linear microphone array speech enhancement method, the effect of picking up sound in the normal direction of the array is better, but the effect of picking up sound in the axial direction is poor; at the same time, although adaptive beamforming has real-time tracking of noise, However, when the noise environment is complex and the reverberation is large, it is difficult to guarantee the accuracy of sound source localization, and the accuracy of adaptive tracking in the null direction is also difficult to guarantee, resulting in different degrees of voice distortion.
In order to avoid the deterioration of the effect of adaptive beams in complex environments, it is usually necessary to design complex adaptive algorithms and complex sound source localization methods, which are difficult to meet the real-time application requirements of embedded systems
In general, the sound source localization method and the adaptive beam method usually use different design methods, such as the GCC and GSC methods, and there are almost no multiplexing modules in the two, resulting in greater complexity

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Speech enhanced interaction method, system, storage medium and electronic device
  • Speech enhanced interaction method, system, storage medium and electronic device
  • Speech enhanced interaction method, system, storage medium and electronic device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0063] Embodiments of the present invention are described below through specific examples, and those skilled in the art can easily understand other advantages and effects of the present invention from the content disclosed in this specification. The present invention can also be implemented or applied through other different specific implementation modes, and various modifications or changes can be made to the details in this specification based on different viewpoints and applications without departing from the spirit of the present invention. It should be noted that, in the case of no conflict, the following embodiments and features in the embodiments can be combined with each other.

[0064] It should be noted that the diagrams provided in the following embodiments are only schematically illustrating the basic ideas of the present invention, and only the components related to the present invention are shown in the diagrams rather than the number, shape and shape of the compo...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a speech enhanced interaction method, a speech enhanced interaction system, a storage medium and an electronic device. The method includes the following steps that: the time-domain signals of microphones in an annular microphone array are converted into the frequency-domain signals of the microphones, and reverberation suppression and stationary noise suppression are performed on the frequency-domain signals of the microphones; wake-up direction sound source positioning is performed based on the reverberation and stationary noise-removed frequency-domain signals of the microphones, so that a wake-up direction is obtained; main direction beam time-domain signals and wake-up direction beam time-domain signals are obtained in a main direction and the wake-up direction on the basis of the reverberation and stationary noise-removed frequency-domain signals of the microphones; speech recognition is performed on the main direction beam time-domain signals; and wake-up word recognition is performed on the wake-up direction beam time-domain signals, and if the signals are identified as wake-up words, the main direction is changed to the obtained wake-up direction. With the speech enhanced interaction method, the speech enhanced interaction system, the storage medium and the electronic device of the invention adopted, the stability and reliability of speech interaction can be improved effectively.

Description

technical field [0001] The invention relates to the technical field of speech processing, in particular to a speech enhancement interaction method and system, a storage medium and electronic equipment. Background technique [0002] With the development of information technology, artificial intelligence technology has increasingly entered people's lives. Among the many human-computer interactions, voice interaction is the most natural and most in line with human behavior. The continuous development of voice recognition technology has also made voice interaction a reality. During use, a specific wake-up word is usually used to trigger the voice interaction system. However, in real life scenarios, the voice interaction environment is relatively complex and is easily affected by environmental noise, reverberation, and human voice interference, which makes the signal-to-noise ratio of the voice signal collected by the microphone poor, which seriously affects the accuracy of voi...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G10L21/0216G10L21/0224G10L21/0232G10L15/22
CPCG10L15/22G10L21/0216G10L21/0224G10L21/0232G10L2021/02166
Inventor 金剑张益萍
Owner FUZHOU ROCKCHIP SEMICON
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products