Voice discrimination method, device, electronic device and storage medium

A discrimination method and voice technology, applied in voice recognition, voice analysis, character and pattern recognition, etc., can solve the problem of low accuracy in identifying generated voice and real voice, inability to distinguish generated voice and real voice, and identify generated voice The method with real voice does not have universal applicability, so as to achieve the effect of improving accuracy
CN113724693BActive Publication Date: 2022-04-01INST OF AUTOMATION CHINESE ACAD OF SCI

Patent Information

Authority / Receiving Office
CN Β· China
Patent Type
Patents(China)
Current Assignee / Owner
INST OF AUTOMATION CHINESE ACAD OF SCI
Publication Date
2022-04-01

Smart Images

  • Figure 1
    Figure 1
  • Figure 2
    Figure 2
  • Figure 3
    Figure 3
Patent Text Reader

Abstract

The present disclosure relates to a voice discrimination method, device, electronic equipment, and storage medium. The method includes: acquiring the voice to be differentiated; extracting the acoustic features and language style features of the voice to be differentiated; analyzing the acoustic features and the language style The features are subjected to speech splicing processing to obtain fusion features; the fusion features are input into a speech discriminator to determine whether the speech to be discriminated is real speech or generated speech. The above-mentioned technical means are adopted to solve the problems in the prior art that the accuracy rate of distinguishing generated speech and real speech is low, and the method for distinguishing generated speech and real speech does not have universal applicability.
Need to check novelty before this filing date? Find Prior Art

Description

technical field

[0001] The present disclosure relates to the field of voice recognition, in particular to a voice discrimination method, device, electronic equipment and storage medium. Background technique

[0002] With the development of deep learning technology, the similarity between the generated speech obtained by using speech synthesis and speech conversion technology and the real speech of a real person has been greatly improved, and even the real one is false. It has a wide range of applications in medical, entertainment and other fields. It provides the technical conditions for using generated voice to carry out network fraud, which poses a great threat to people's safety and social stability. Therefore, the identification technology of generated voice has become an urgent need in today's society. In the prior art, acoustic features are often used to identify generated speech and real speech, but only acoustic features are used, and the generated speech and real sp...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More