Supercharge Your Innovation With Domain-Expert AI Agents!

Voice activity detection method, related device and equipment

A voice activity detection and voice technology, applied in the computer field, can solve the problems such as the large influence of the threshold setting, the inability to effectively reflect the characteristics of the voice frame, and the inability to accurately detect the voice segment, etc., to achieve the effect of improving the accuracy

Active Publication Date: 2022-07-26
TENCENT TECH (SHENZHEN) CO LTD
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] The technical problem to be solved by the embodiments of the present invention is to provide a method for detecting voice activity, a device for detecting voice activity, a device for detecting voice activity, and a computer-readable storage medium, so as to solve the problems in the prior art based on short-term energy and The threshold setting of the spectral entropy scheme is greatly affected by the recording environment, or the scheme based on the spectral entropy energy product cannot effectively reflect the characteristics of the speech frame, resulting in the technical problem that the speech segment cannot be accurately detected

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Voice activity detection method, related device and equipment
  • Voice activity detection method, related device and equipment
  • Voice activity detection method, related device and equipment

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0023] The technical solutions in the embodiments of the present invention will be described below with reference to the accompanying drawings in the embodiments of the present invention.

[0024] It is also to be understood that the terminology used in this specification of the present invention is for the purpose of describing particular embodiments only and is not intended to limit the present invention.

[0025] It should also be further understood that, as used in this specification and the appended claims, the term "and / or" refers to and including any and all possible combinations of one or more of the associated listed items .

[0026] In specific implementations, the terminals described in the embodiments of the present invention include, but are not limited to, other portable devices such as mobile phones, laptops, or tablet computers with touch-sensitive surfaces (eg, touchscreen displays and / or touchpads). It should also be understood that, in some embodiments, the...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a voice activity detection method, comprising: receiving voice data; the voice data includes multiple frames of voice signals; calculating the energy and spectral entropy of one frame of voice signals; The root of the spectral entropy; and according to the energy after the root or the spectral entropy after the root, the spectral entropy energy root of the speech signal is calculated; when the spectral entropy energy root of the speech signal is less than the first preset threshold, It is judged that the voice signal is a non-voice frame; or when the spectral entropy energy root of the voice signal is not less than the first preset threshold, it is judged that the voice signal is a voice frame. The invention also discloses a voice activity detection device and a voice activity. The detection device solves the problem that the threshold setting of the solution based on short-term energy and spectral entropy in the prior art is greatly affected by the recording environment, or the solution based on the energy product of spectral entropy cannot effectively reflect the characteristics of the speech frame, resulting in the inability to accurately detect the speech segment. technical problem.

Description

technical field [0001] The invention relates to the field of computers, in particular to a voice activity detection method, a voice activity detection device and a voice activity detection device. Background technique [0002] Speech recognition is an interdisciplinary subject. In the past two decades, speech recognition technology has made significant progress and has begun to move from the laboratory to the market. With the development of speech recognition technology, speech recognition technology will enter various fields such as industry, home appliances, communications, automotive electronics, medical care, home services, and consumer electronic products. [0003] Voice Activity Detection (VAD), also known as voice activity detection, voice endpoint detection, voice boundary detection, etc., is a technology used for voice processing to detect the presence of voice signals. VAD is standard for speech recognition technology. [0004] In the prior art, the VAD algorith...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G10L15/04G10L25/78
CPCG10L15/04G10L25/78G10L2025/783G10L25/03G10L2025/786G10L25/21G10L15/22G10L25/18G10L25/84G10L25/93G10L2025/937
Inventor 刘继忠
Owner TENCENT TECH (SHENZHEN) CO LTD
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More