Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

A method for real-time speech separation and speech transcription

A real-time voice and voice separation technology, applied in the computer field, can solve the problem of not being able to obtain the human voice in the conversation at the same time, so as to achieve effective voice transcription and reduce interference

Active Publication Date: 2022-03-15
北京睿科伦智能科技有限公司
View PDF6 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] The existing desktop pointing sound pickup products can only pick up the sound of the user, that is, the user can pick up the sound in the near field, and cannot obtain the voice of the person in the conversation at the same time. If there are many people talking in the scene, it is necessary to arrange multiple near-field Pickup equipment, and very close to the speaker, usually within 20 cm

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A method for real-time speech separation and speech transcription
  • A method for real-time speech separation and speech transcription
  • A method for real-time speech separation and speech transcription

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0032] The technical solutions of the present invention will be clearly and completely described below in conjunction with the accompanying drawings of the present invention. Apparently, the described embodiments are only some of the embodiments of the present invention, not all of them. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without creative efforts fall within the protection scope of the present invention.

[0033] according to figure 1 , figure 2 As shown in the process steps in the method, a method for real-time voice separation and voice transcription includes: collecting the voices of many people speaking through a hardware acquisition module, and obtaining digital signals of multi-channel microphones; Separate the voice signals of a plurality of single persons; each of the voice signals is respectively connected to the voice transcription module, and is transcribed into the text content...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The present invention is a method for real-time voice separation and voice transcription, comprising: a hardware acquisition module, a voice separation module, and a voice transcription module, wherein the hardware acquisition module is used to collect voice digital signals; The voice digital signals collected in different directions are separated; the voice transcription module transcribes the various voice digital signals separated by the voice separation module into text, and the microphone pick-up module of the hardware acquisition module collects voice signals, only need to configure the angle Parameters; the voice separation module can effectively separate multi-person conversations by speakers in real time and perform real-time voice transcription; at the same time, the voice separation module can effectively reduce the interference of environmental noise and transcribe the sound source in a fixed direction, so that Effectively separate overlapping dialogue voices and perform effective voice transcription.

Description

technical field [0001] The invention relates to the field of computers, in particular to a method for real-time speech separation and speech transcription. Background technique [0002] In service dialogue scenarios like insurance and bank counters with fixed positions, it is necessary to effectively record the dialogue between the two parties in a slightly noisy environment; [0003] The existing desktop pointing sound pickup products can only pick up the sound of the user, that is, the user can pick up the sound in the near field, and cannot obtain the voice of the person in the conversation at the same time. If there are many people talking in the scene, it is necessary to arrange multiple near-field Pickup equipment, and it is very close to the speaker, usually within 20 cm. This method can judge and separate multiple human voices in different directions in real time through the microphone array pickup placed on the desktop, and output corresponding text information acc...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G10L21/0272G10L25/51G10L15/26
CPCG10L21/0272G10L15/26G10L25/51G10L2021/02166
Inventor 赵建平荆榆程栋梁沈忱石松涛高博许乾坤张宇韬
Owner 北京睿科伦智能科技有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products