Voice discrimination method, device, electronic device and storage medium

What is AI technical title?
AI technical title is built by Patsnap AI team. It summarizes the technical point description of the patent document.
A discrimination method and voice technology, applied in voice recognition, voice analysis, character and pattern recognition, etc., can solve the problem of low accuracy in identifying generated voice and real voice, inability to distinguish generated voice and real voice, and identify generated voice The method with real voice does not have universal applicability, so as to achieve the effect of improving accuracy

Active Publication Date: 2022-04-01

INST OF AUTOMATION CHINESE ACAD OF SCI

View PDF6 Cites 0 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0002] With the development of deep learning technology, the similarity between the generated speech obtained by using speech synthesis and speech conversion technology and the real speech of a real person has been greatly improved, and even the fake one has been widely used in medical, entertainment and other fields. It provides the technical conditions for using generated voice to carry out network fraud, which poses a great threat to people's safety and social stability. Therefore, the identification technology of generated voice has become an urgent need in today's society

The existing technology often uses acoustic features to identify generated speech and real speech, but only using acoustic features cannot distinguish generated speech and real speech well

[0003] In the process of realizing the concept of the present disclosure, the inventors found that there are at least the following technical problems in the related art: the accuracy rate of distinguishing generated speech and real speech is low, and the method of distinguishing generated speech and real speech is not universal

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0025] Hereinafter, the present disclosure will be described in detail with reference to the accompanying drawings and embodiments. It should be noted that, in the case of no conflict, the embodiments in the present disclosure and the features in the embodiments can be combined with each other.

[0026] It should be noted that the terms "first" and "second" in the specification and claims of the present disclosure and the above drawings are used to distinguish similar objects, but not necessarily used to describe a specific sequence or sequence.

[0027] The method embodiments provided by the embodiments of the present disclosure may be executed in a computer terminal or a similar computing device. Take running on a computer terminal as an example, figure 1 A block diagram schematically shows a hardware structure of a computer terminal according to a speech discrimination method according to an embodiment of the present disclosure. Such as figure 1 As shown, the computer te...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The present disclosure relates to a voice discrimination method, device, electronic equipment, and storage medium. The method includes: acquiring the voice to be differentiated; extracting the acoustic features and language style features of the voice to be differentiated; analyzing the acoustic features and the language style The features are subjected to speech splicing processing to obtain fusion features; the fusion features are input into a speech discriminator to determine whether the speech to be discriminated is real speech or generated speech. The above-mentioned technical means are adopted to solve the problems in the prior art that the accuracy rate of distinguishing generated speech and real speech is low, and the method for distinguishing generated speech and real speech does not have universal applicability.

Description

technical field [0001] The present disclosure relates to the field of voice recognition, in particular to a voice discrimination method, device, electronic equipment and storage medium. Background technique [0002] With the development of deep learning technology, the similarity between the generated speech obtained by using speech synthesis and speech conversion technology and the real speech of a real person has been greatly improved, and even the real one is false. It has a wide range of applications in medical, entertainment and other fields. It provides the technical conditions for using generated voice to carry out network fraud, which poses a great threat to people's safety and social stability. Therefore, the identification technology of generated voice has become an urgent need in today's society. In the prior art, acoustic features are often used to identify generated speech and real speech, but only acoustic features are used, and the generated speech and real sp...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & AuthorityPatents(China)

IPC IPC(8): G10L15/02G10L15/06G10L15/16G10L15/26G10L25/51G06K9/62

CPCG10L15/02G10L15/063G10L15/16G10L15/26G10L25/51G06F18/253

Inventor陶建华遆敬苗易江燕傅睿博

OwnerINST OF AUTOMATION CHINESE ACAD OF SCI

Voice discrimination method, device, electronic device and storage medium

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology