Voice recognition method and system for multi-person speaking scene

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A speech recognition and scene technology, applied in speech recognition, speech analysis, instruments, etc., can solve problems such as reducing the workload of speech recognition

Active Publication Date: 2019-12-17

BEIJING UNISOUND INFORMATION TECH

View PDF6 Cites 1 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

It can be seen that the speech recognition method and system of this multi-person speech scene are different from the speech recognition technology of the prior art. In the multi-person speech scene, speech recognition processing can only be performed sequentially according to the order of the received speech signals. The speech recognition method and system of the present invention can not only sequentially receive and recognize speech signals in a multi-person speech scene, but also can recognize the speech collection time stamp for each speech signal in the multi-person speech scene, although the method and system There is still a time difference in the output of the corresponding speech recognition results, but the method and system can identify the speaking time points of different speaking ends according to the voice collection time stamps, thus effectively overcoming the inability of the original speech recognition technology to correctly restore the speech points of different speaking ends. The defect of speaking order, the method and system can accurately restore the speaking order between different speaking terminals to ensure the accuracy of the final speech recognition text; in addition, the method and system also sequentially process the stored speech signals by constructing a sequential buffer queue , which can not only effectively reduce the workload of speech recognition, but also ensure that each speech signal is accurately recognized and processed, thereby saving the time for subsequent speech recognition results and improving the efficiency of speech recognition

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0052] The following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only some, not all, embodiments of the present invention. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without creative efforts fall within the protection scope of the present invention.

[0053] refer to figure 1 , is a schematic flowchart of a method for speech recognition in a multi-person speech scene provided by an embodiment of the present invention. The speech recognition method of this multi-person speech scene comprises the following steps:

[0054] In step (1), in each of the plurality of preset recognition cycles, the voice signal and the voice collection time stamp of each of the several speaking terminals are recognized.

[0055] Prefe...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention provides a voice recognition method and a voice recognition system for a multi-person speaking scene. For the voice recognition method and the voice recognition system for the multi-person speaking scene, voice signals can be sequentially received and recognized in the multi-person speaking scene, and meanwhile, the recognition for voice collecting timestamps also can be carried outon each voice signal in the multi-person speaking scene, so that the defect that for the original voice recognition technology, the speaking sequence of different speaking ends can not be correctly restored is effectively overcome; and in addition, the method and the system can accurately restore the speaking sequence of the different speaking ends, so that the accuracy of a final voice recognition text is guaranteed, the time for follow-up voice recognition result sorting is saved, and the voice recognition efficiency is improved.

Description

technical field [0001] The present invention relates to the technical field of voice recognition, in particular to a voice recognition method and system in a multi-person speech scene. Background technique [0002] At present, speech recognition technology is widely used in the field of human-computer interaction. The existing speech recognition technology can accurately and quickly identify the speech object corresponding to the speech signal and the meaning of the speech signal itself, which greatly promotes the application of human-computer interaction. And development. [0003] However, the advantages of the existing speech recognition technology are limited to the scene where a single person speaks. For the scene of multiple speeches, speech recognition not only needs to identify the speech objects and speech meanings corresponding to different speech signals, but also recognizes and distinguishes between different speech objects. The sequence of speeches between the s...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G10L15/26G10L15/28G10L25/51

CPCG10L15/26G10L15/28G10L25/51

Inventor 何世阳王善彬

Owner BEIJING UNISOUND INFORMATION TECH

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Voice recognition method and system for multi-person speaking scene

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology