Geotagged environmental audio for enhanced speech recognition accuracy

What is AI technical title?
AI technical title is built by PatSnap AI team. It summarizes the technical point description of the patent document.
A geo-tagging and geographic location technology, applied in the field of speech recognition, can solve problems such as difficulty in accurately identifying spoken utterances, and achieve the effects of improving the accuracy of speech recognition, increasing computing efficiency, and improving process optimization.

Active Publication Date: 2013-02-06

GOOGLE LLC

View PDF5 Cites 22 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

Ambient audio may partially obscure the user's speech making it difficult for an automated speech recognition ("ASR") engine to accurately recognize spoken words

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0018] figure 1 is a diagram of an example system 100 that uses geotagged ambient audio to enhance speech recognition accuracy. figure 1 Also illustrated is the flow of data within the system 100 during state (a) through state (i) and the user interface 158 displayed on the mobile device 104 during state (i).

[0019] More specifically, system 100 includes server 106 and ASR engine 108 in communication with mobile client communication devices including mobile device 102 and mobile device 104 over one or more networks 110 . Server 106 may be a search engine, a dictation engine, a dialogue system, or any other engine or system that uses transcribed speech. The network 110 may include a wireless cellular network, a wireless local area network (WLAN) or a Wi-Fi network, a third generation (3G) or fourth generation (4G) mobile telecommunications network, a private network (such as an intranet), a public network (such as the Internet) or any suitable combination thereof.

[002...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for enhancing speech recognition accuracy. In one aspect, a method includes receiving geotagged audio signals that correspond to environmental audio recorded by multiple mobile devices in multiple geographic locations, receiving an audio signal that corresponds to an utterance recorded by a particular mobile device, determining a particular geographic location associated with the particular mobile device, generating a noise model for the particular geographic location using a subset of the geotagged audio signals, where noise compensation is performed on the audio signal that corresponds to the utterance using the noise model that has been generated for the particular geographic location.

Description

[0001] Cross References to Related Applications [0002] This application claims priority to US Application No. 12 / 760,147, filed April 14, 2010, and entitled GEOTAGGED ENVIRONMENTAL AUDIO FOR ENHANCED SPEECH RECOGNITION ACCURACY, the disclosure of which is incorporated herein by reference. technical field [0003] This specification deals with speech recognition. Background technique [0004] As used in this specification, a "search query" includes one or more query terms that a user submits to a search engine when the user requests the search engine to perform a search query, where a "search term" or "query term" includes one or more A full or partial word, character, or string. The "results" of a search query (or "search results") include, among other things, a Uniform Resource Identifier (URI) that references a resource that the search engine determined was responsive to the search query. The search results may include other things such as titles, preview images, user...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & AuthorityApplications(China)

IPC IPC(8): G10L21/0208

CPCG10L21/0208G10L15/20

InventorT·克里斯特詹森M·I·洛伊德

OwnerGOOGLE LLC

Geotagged environmental audio for enhanced speech recognition accuracy

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology