Supercharge Your Innovation With Domain-Expert AI Agents!

Voice recognition and encoding and decoding method and device, electronic equipment and storage medium

A speech recognition and coding technology, applied in speech recognition, speech analysis, instruments, etc., to achieve the effect of improving accuracy and recognition efficiency

Active Publication Date: 2022-01-04
BEIJING BAIDU NETCOM SCI & TECH CO LTD
View PDF10 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, there is no better implementation in the prior art

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Voice recognition and encoding and decoding method and device, electronic equipment and storage medium
  • Voice recognition and encoding and decoding method and device, electronic equipment and storage medium
  • Voice recognition and encoding and decoding method and device, electronic equipment and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0040] Exemplary embodiments of the present disclosure are described below in conjunction with the accompanying drawings, which include various details of the embodiments of the present disclosure to facilitate understanding, and they should be regarded as exemplary only. Accordingly, those of ordinary skill in the art will recognize that various changes and modifications of the embodiments described herein can be made without departing from the scope and spirit of the disclosure. Also, descriptions of well-known functions and constructions are omitted in the following description for clarity and conciseness.

[0041] In addition, it should be understood that the term "and / or" in this article is only an association relationship describing associated objects, which means that there may be three relationships, for example, A and / or B may mean: A exists alone, and A exists at the same time. and B, there are three cases of B alone. In addition, the character " / " in this article g...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a voice recognition and encoding and decoding method and a device thereof, electronic equipment and a storage medium, and relates to the field of intelligent voice, deep learning, natural language processing and other artificial intelligence, and the voice recognition method can comprise the steps: obtaining the audio features of a to-be-recognized voice; encoding the obtained audio features to obtain encoded features; performing truncation processing on the obtained coding features to obtain N continuous feature fragments, wherein N is a positive integer greater than one; and for any feature segment, obtaining corresponding historical feature abstract information, encoding the feature segment in combination with the historical feature abstract information, and decoding an encoding result to obtain an identification result corresponding to the feature segment, wherein the historical feature abstract information is information obtained by performing feature abstraction on the identified historical feature segment. By applying the scheme of the invention, the accuracy of an identification result and the identification efficiency can be improved.

Description

technical field [0001] The present disclosure relates to the field of artificial intelligence technology, and in particular to speech recognition and codec methods, devices, electronic equipment, and storage media in the fields of intelligent speech, deep learning, and natural language processing. Background technique [0002] Automatic speech recognition refers to the process of automatically converting the input speech into corresponding text by computer. With the in-depth research of deep learning technology in the field of speech recognition, especially the proposal of end-to-end speech recognition technology, the performance of speech recognition system has been improved. a great improvement. Moreover, with the continuous popularization of various smart devices, large-scale vocabulary speech recognition products have been widely used in fields such as smart customer service, car navigation, and smart speakers. [0003] In speech recognition with a large vocabulary, the...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G10L15/02G10L15/16G10L15/22G10L19/00
CPCG10L15/22G10L15/02G10L19/0018G10L15/16G06F16/683G10L15/187G10L15/26
Inventor 付晓寅陈志杰梁鸣心杨明顺贾磊王海峰
Owner BEIJING BAIDU NETCOM SCI & TECH CO LTD
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More