Unlock instant, AI-driven research and patent intelligence for your innovation.

Method, system and device for voiceprint identification based on video and spectrogram

An identification method and spectrogram technology, applied in the field of speech recognition, can solve the problem of low accuracy of identification results

Active Publication Date: 2021-01-19
SPEAKIN TECH CO LTD
View PDF20 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The object of the present invention is to provide a voiceprint identification method, system, device and computer-readable storage medium based on video and spectrogram, so as to solve the problem in the prior art by using the voice corresponding to the spectrogram to identify, and the accuracy of the identification result. not high problem

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method, system and device for voiceprint identification based on video and spectrogram
  • Method, system and device for voiceprint identification based on video and spectrogram
  • Method, system and device for voiceprint identification based on video and spectrogram

Examples

Experimental program
Comparison scheme
Effect test

specific Embodiment approach

[0061] As a specific embodiment, the acquisition module is specifically:

[0062] Spectral parameters in the audio file are acquired, where the spectral parameters include bandwidth, dynamic range, attenuation coefficient, high-frequency boost coefficient, and windowing type, so as to construct a module of a spectrogram corresponding to the audio file.

[0063] As a specific implementation manner, the building module is specifically:

[0064] A module for a callback function to time is established on the video file and the spectrogram respectively.

[0065] As a specific implementation manner, the verification module is specifically:

[0066] selecting syllables in the audio file for analysis;

[0067] A module for identifying the corresponding formant of the syllable and video.

[0068] The voiceprint identification system based on video and spectrogram provided by this embodiment obtains the spectrogram of the audio file corresponding to the video file, and then establish...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a video and speech spectrum based voiceprint identification method. The video and speech spectrum based voiceprint identification method comprises: acquiring a speech spectrumcorresponding to an audio file, wherein the audio file corresponds to a video file; establishing associations with time on the video file and the speech spectrum so as to position the same time pointin the other end of the video file and the speech spectrum while any time point is selected from any one of the video file and the speech spectrum, and obtaining a corresponding video and formant according to the associations; and identifying a voiceprint according to the video and the formant so as to determine the identity of a person to be identified. The speech spectrum of the audio file corresponding to the video file is obtained, the associations are established in the video file and the speech spectrum, and then when any time point is selected from any one of the video file and the speech spectrum, the same time point is positioned in the other one of the video file and the speech spectrum; and the video and the formant are obtained, and mouth shapes and expression when the person to be identified speaks are observed to increase the identification basis. The invention also provides a system and device having the above advantages, and a computer readable storage media.

Description

technical field [0001] The invention relates to the field of speech recognition, in particular to a voiceprint identification method, system, device and computer-readable storage medium based on video and spectrogram. Background technique [0002] The voice of each person is different, and the voice of a person is like a fingerprint of a person, with the characteristics of "everyone is different". Especially when a person is an adult, the pronunciation organs have matured, language habits have been formed, and the pronunciation is stable except under special circumstances such as the influence of diseases. And because everyone's physiological structure, living environment and other factors are different, people's voice is specific. Therefore, it is an important science and technology to carry out personal identification through voiceprint identification. [0003] The existing identification method uses the shape and trend of the formant on the spectrogram as the most impor...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G10L17/00G10L17/06
CPCG10L17/00G10L17/06
Inventor 黎智勇
Owner SPEAKIN TECH CO LTD
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More