Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Voice content processing method and device, equipment and readable storage medium

A voice content and voice processing technology, applied in voice analysis, voice recognition, instruments, etc., can solve the problems of high implementation cost, large renovation, high maintenance cost, increased memory usage, etc., to achieve flexible application, reduce memory usage, The effect of reducing data footprint

Pending Publication Date: 2021-07-02
TENCENT TECH (SHENZHEN) CO LTD
View PDF1 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] However, the implementation cost of the above method is relatively high. The TensorFlow and Pytorch frameworks lack speech decoder-related technologies. Even if they are integrated into the speech framework Kaldi, due to the integration of the two frameworks, the memory usage will inevitably increase, and the transformation and maintenance costs will also be large.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Voice content processing method and device, equipment and readable storage medium
  • Voice content processing method and device, equipment and readable storage medium
  • Voice content processing method and device, equipment and readable storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0035] In order to make the purpose, technical solution and advantages of the present application clearer, the implementation manners of the present application will be further described in detail below in conjunction with the accompanying drawings.

[0036] First, a brief introduction to the nouns involved in the embodiments of this application:

[0037] Artificial Intelligence (AI): It is a theory, method, technology and application system that uses digital computers or machines controlled by digital computers to simulate, extend and expand human intelligence, perceive the environment, acquire knowledge and use knowledge to obtain the best results. In other words, artificial intelligence is a comprehensive technique of computer science that attempts to understand the nature of intelligence and produce a new kind of intelligent machine that can respond in a similar way to human intelligence. Artificial intelligence is to study the design principles and implementation methods ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a voice content processing method and device, equipment and a readable storage medium, and relates to the field of machine learning. The method comprises the following steps: acquiring voice content; performing feature extraction on the voice content to obtain an audio feature of a first data format; dynamically quantizing the audio feature to obtain a quantized feature of a second data format, wherein the data bits of the first data format are greater than the data bits of the second data format; and inputting the quantitative feature into a voice processing model, and outputting to obtain a content processing result. Model parameters in the voice processing model are quantized from the first data format to the second data format, and audio feature is quantized to be in the second data format before audio feature processing; and since the data bits of the first data format are greater than the data bits of the second data format, the overall data occupation amount of the voice processing model is reduced, and the external memory (such as flash) and memory occupation conditions of the voice processing model in the mobile device are reduced.

Description

technical field [0001] The embodiments of the present application relate to the field of machine learning, and in particular to a method, device, device, and readable storage medium for processing voice content. Background technique [0002] With the rapid development of the field of machine learning, there is an increasing demand for using offline neural networks on mobile devices. For example, in offline scenarios, using the neural network models on mobile The development of network algorithms has led to an increasing demand for computing and memory by neural networks, so that the computing power and memory space of mobile devices cannot bear it. [0003] In related technologies, taking the speech recognition scene as an example, the acoustic model is usually trained by using the neural network TensorFlow and Pytorch framework, and embedded in the speech framework Kaldi, so as to realize the quantization of the acoustic model. [0004] However, the implementation cost of ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G10L25/03G10L25/30G10L15/02G10L15/16G10L15/18
CPCG10L25/03G10L25/30G10L15/02G10L15/16G10L15/18
Inventor 李晋马龙张力张晓明
Owner TENCENT TECH (SHENZHEN) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products