Voice recognition method and system for multi-person speaking scene

A speech recognition and scene technology, applied in speech recognition, speech analysis, instruments, etc., can solve problems such as reducing the workload of speech recognition

Active Publication Date: 2019-12-17
BEIJING UNISOUND INFORMATION TECH
View PDF6 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

It can be seen that the speech recognition method and system of this multi-person speech scene are different from the speech recognition technology of the prior art. In the multi-person speech scene, speech recognition processing can only be performed sequentially according to the order of the received speech signals. The speech recognition method and system of the present invention can not only sequentially receive and recognize speech signals in a multi-person speech scene, but also can recognize the speech collection time stamp for each speech signal in the multi-person speech scene, although the method and system There is still a time difference in the output of the corresponding speech recognition results, but the method and system can identify the speaking time points of different speak

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Voice recognition method and system for multi-person speaking scene
  • Voice recognition method and system for multi-person speaking scene

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0052] The following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only some, not all, embodiments of the present invention. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without creative efforts fall within the protection scope of the present invention.

[0053] refer to figure 1 , is a schematic flowchart of a method for speech recognition in a multi-person speech scene provided by an embodiment of the present invention. The speech recognition method of this multi-person speech scene comprises the following steps:

[0054] In step (1), in each of the plurality of preset recognition cycles, the voice signal and the voice collection time stamp of each of the several speaking terminals are recognized.

[0055] Prefe...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a voice recognition method and a voice recognition system for a multi-person speaking scene. For the voice recognition method and the voice recognition system for the multi-person speaking scene, voice signals can be sequentially received and recognized in the multi-person speaking scene, and meanwhile, the recognition for voice collecting timestamps also can be carried outon each voice signal in the multi-person speaking scene, so that the defect that for the original voice recognition technology, the speaking sequence of different speaking ends can not be correctly restored is effectively overcome; and in addition, the method and the system can accurately restore the speaking sequence of the different speaking ends, so that the accuracy of a final voice recognition text is guaranteed, the time for follow-up voice recognition result sorting is saved, and the voice recognition efficiency is improved.

Description

technical field [0001] The present invention relates to the technical field of voice recognition, in particular to a voice recognition method and system in a multi-person speech scene. Background technique [0002] At present, speech recognition technology is widely used in the field of human-computer interaction. The existing speech recognition technology can accurately and quickly identify the speech object corresponding to the speech signal and the meaning of the speech signal itself, which greatly promotes the application of human-computer interaction. And development. [0003] However, the advantages of the existing speech recognition technology are limited to the scene where a single person speaks. For the scene of multiple speeches, speech recognition not only needs to identify the speech objects and speech meanings corresponding to different speech signals, but also recognizes and distinguishes between different speech objects. The sequence of speeches between the s...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G10L15/26G10L15/28G10L25/51
CPCG10L15/26G10L15/28G10L25/51
Inventor 何世阳王善彬
Owner BEIJING UNISOUND INFORMATION TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products