Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Automatic voiceprint modeling warehousing method, device and equipment

A voiceprint, automatic technology, applied in special data processing applications, instruments, electrical digital data processing, etc., can solve difficult problems and other problems

Active Publication Date: 2020-07-14
合肥讯飞数码科技有限公司
View PDF4 Cites 3 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, it is more difficult to mark the speaker in the corpus (that is, to classify the corpus) compared to the recognition directions such as language recognition and continuous speech recognition.
At present, even in the case of knowing the corpus information such as the speaker and background-related knowledge, there will still be a certain error rate when labeling a small amount of corpus, not to mention when faced with unfamiliar speakers, unknown background, and unrestricted scenes. When large corpus massive data sets (the present invention abbreviates this large corpus without subject), it is difficult to classify different speakers and realize the corresponding voiceprint modeling storage operation, which is also the promotion, One of the key obstacles to the application of speaker recognition technology

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Automatic voiceprint modeling warehousing method, device and equipment
  • Automatic voiceprint modeling warehousing method, device and equipment
  • Automatic voiceprint modeling warehousing method, device and equipment

Examples

Experimental program
Comparison scheme
Effect test

Embodiment approach

[0126] Based on the above embodiments and their preferred solutions, those skilled in the art can understand that, in actual operation, the present invention is applicable to various implementation modes, and the present invention uses the following carrier as a schematic illustration:

[0127] (1) An automatic voiceprint modeling storage device, which may include:

[0128] one or more processors, memory, and one or more computer programs, wherein the one or more computer programs are stored in the memory, the one or more computer programs include instructions, when the instructions are When the device described above is executed, the device is made to perform the steps / functions of the foregoing embodiments or equivalent implementation manners.

[0129] Figure 4 It is a schematic structural diagram of an embodiment of the automatic voiceprint modeling storage device of the present invention, wherein the device may be an electronic device or a circuit device built in the abo...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses an automatic voiceprint modeling warehousing method, device and equipment. The concept of the invention lies in that a set of data cutting thought for massive corpora is provided for the demand of speaker voiceprint modeling, so that thematic and hierarchical properties of the massive corpora are realized, and the task of voiceprint modeling and storage by using the massivecorpora is completed. The method specifically comprises the steps of performing initial filtering and distinguishing on mass data based on multi-dimensional information by utilizing a reduction thought, then performing staged progressive classification and purification operation on simplified corpus data by adopting a multi-stage superimposed consensus clustering thought, and finally obtaining corpora capable of being used for voiceprint modeling. According to the invention, a large amount of labor cost does not need to be invested for labeling, and the problem that voiceprint modeling storage cannot be achieved finally due to the fact that the error rate is possibly downloaded step by step due to direct indiscriminate voiceprint collision clustering on mass data, and the purity of modeling corpora is affected is avoided.

Description

technical field [0001] The invention relates to the technical field of speech processing, in particular to a method, device and equipment for automatic voiceprint modeling and storage. Background technique [0002] Voiceprint modeling through known target corpus data is a very important link in speaker recognition technology. Generally speaking, it is necessary to model the voiceprint of the target speaker based on the corpus of the target speaker and the voiceprint recognition algorithm, and store the voiceprint information of the target speaker into the voiceprint database. This process is the voiceprint Model storage. Among them, the quantity and quality of the corpus used for voiceprint modeling have a great influence on the subsequent recognition performance, and it is particularly important to provide a sufficient quantity and quality of modeling corpus. [0003] Therefore, it is necessary to prepare the speaker's clean corpus in advance in the process of voiceprint ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/63G06F16/65G06F16/683
CPCG06F16/63G06F16/65G06F16/683
Inventor 方磊宣璇夏翔方昕
Owner 合肥讯飞数码科技有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products