Microphone selection and multi-talker segmentation with ambient automated speech recognition (ASR)

A microphone and audio technology, applied in speech recognition, speech analysis, instruments, etc., can solve problems such as unreliable TDOA estimation and no microphone selection

Active Publication Date: 2019-08-27
NUANCE COMM INC
View PDF4 Cites 5 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] Although TDOA has been used previously in many different fields, this parameter is not used for microphone selection due to excessive noise that can be generated by ambient noise, reverberation, and / or head motion of a human speaker
Also, microphones located far from the audio source often produce unreliable TDOA estimates

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Microphone selection and multi-talker segmentation with ambient automated speech recognition (ASR)
  • Microphone selection and multi-talker segmentation with ambient automated speech recognition (ASR)
  • Microphone selection and multi-talker segmentation with ambient automated speech recognition (ASR)

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0029] In the following description of various illustrative embodiments, reference is made to the accompanying drawings which form a part hereof, and which show, by way of illustration, various embodiments in which aspects of the invention may be practiced. It is to be understood that other embodiments may be utilized and structural or functional modifications may be made without departing from the scope of the present invention.

[0030] It is to be understood that the phraseology and terminology used herein are for the purpose of description and should not be regarded as limiting. Instead, phrases and terms used herein are to be given their broadest interpretations and meanings. The use of "including" and "comprising" and variations thereof are intended to cover the items listed thereafter and equivalents thereof as well as other items and equivalents thereof. Use of the terms "mounted," "connected," "coupled," "positioned," "engaged" and similar terms is meant to include b...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

Disclosed methods and systems are directed to determining a best microphone pair and segmenting sound signals. The methods and systems may include receiving a collection of sound signals comprising speech from one or more audio sources (e.g., meeting participants) and / or background noise. The methods and systems may include calculating a TDOA and determining, based on the TDOA and via robust statistics, the best pair of microphones. The methods and systems may also include segmenting sound signals from multiple sources.

Description

[0001] Cross References to Related Applications [0002] This patent application claims U.S. Nonprovisional Patent Application Serial No. 15 / 403,481, filed January 11, 2017, entitled "METHOD FOR MICROPHONESELECTION AND MULTI-TALKER SEGMENTATION WITH AMBIENT AUTOMATED SPEECHRECOGNITION (ASR)" and filed in 2016 Priority to U.S. Provisional Patent Application Serial No. 62 / 394,286, entitled "MICROPHONE SELECTION AND MULTI-TALKER SEGMENTATION WITHAPPLICATION TO AMBIENT AUTOMATED SPEECH RECOGNITION (ASR)," filed September 14, the entire contents of both patent applications Incorporated herein by reference. technical field [0003] Aspects described herein relate generally to computers, computer systems, and automatic speech recognition. More specifically, aspects described herein are used to perform microphone selection and multiple speaker segmentation to select an appropriate input stream for performing automatic speech recognition (ASR). Background technique [0004] Speaker...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G10L17/06G10L25/03
CPCG10L15/00G10L17/06G10L25/03G10L25/84G10L21/0216H04R3/005H04R2410/01G10L15/04G10L17/16G10L21/0232G10L21/028G10L2021/02166H04R1/406
Inventor 巴勃罗·佩索·帕拉达杜什杨特·夏尔马帕特里克·内勒
Owner NUANCE COMM INC
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products