Unlock instant, AI-driven research and patent intelligence for your innovation.

Speaker tag alignment method and device, electronic equipment and computer readable storage medium

A speaker and label technology, applied in the field of speaker label alignment method, device, electronic equipment and computer readable storage medium, to achieve the effect of improving accuracy

Pending Publication Date: 2022-05-10
SHANGHAI ZHENGDA XIMALAYA NETWORK TECH CO LTD
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The purpose of the present invention is to provide a speaker label alignment method, device, electronic equipment and computer-readable storage medium, which can solve the fusion problem of multi-channel speaker log tags and improve the accuracy of speaker logs

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Speaker tag alignment method and device, electronic equipment and computer readable storage medium
  • Speaker tag alignment method and device, electronic equipment and computer readable storage medium
  • Speaker tag alignment method and device, electronic equipment and computer readable storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0046] In order to make the purpose, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below in conjunction with the drawings in the embodiments of the present invention. Obviously, the described embodiments It is a part of embodiments of the present invention, but not all embodiments. The components of the embodiments of the invention generally described and illustrated in the figures herein may be arranged and designed in a variety of different configurations.

[0047] Accordingly, the following detailed description of the embodiments of the invention provided in the accompanying drawings is not intended to limit the scope of the claimed invention, but merely represents selected embodiments of the invention. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art wi...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The embodiment of the invention provides a speaker tag alignment method and device, electronic equipment and a computer readable storage medium, and the method comprises the steps: obtaining N speaker logs of N sound channels, carrying out the clustering of each speaker log, obtaining N clustered speaker tag sets, taking a first target speaker tag set as a reference tag set, and carrying out the clustering of the N speaker tag sets, and based on the reference tag set, performing alignment processing on N-1 second target speaker tag sets except the first target speaker tag set. According to the method, the speaker tags corresponding to the multi-channel speaker logs can be aligned, so that the speaker tag set corresponding to the multi-channel speaker logs is no longer a relative tag but an absolute tag, and the accuracy of the speaker logs is further improved.

Description

technical field [0001] The present invention relates to the technical field of speech recognition, in particular to a speaker label alignment method, device, electronic equipment and computer-readable storage medium. Background technique [0002] With the continuous development of deep learning technology, the accuracy of speech recognition technology is also increasing, and its application is becoming more and more extensive. For single-person near-field speech recognition scenarios, speech recognition has been able to achieve a high accuracy rate. However, the voice recognition scene with multiple people in the far field is still a difficult point. One of the important difficulties lies in the speaker log technology, that is, the speaking time of each speaker needs to be identified first, and then the speech recognition technology can be used for speech recognition. [0003] For the far-field voice scene of multiple people, we often use multi-microphone devices to obtain...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G10L25/03G10L17/06G10L17/00G10L15/28G06K9/62G06F17/16
CPCG10L25/03G10L17/00G10L17/06G10L15/28G06F17/16G06F18/23
Inventor 吕翔印晶晶卢恒
Owner SHANGHAI ZHENGDA XIMALAYA NETWORK TECH CO LTD