Audio processing method and device and mobile terminal

A technology of audio processing and audio data, which is applied in the field of information processing, can solve the problems of low recognition accuracy and the trouble of finding someone's speech, and achieve the effect of improving accuracy

Inactive Publication Date: 2018-05-25
VIVO MOBILE COMM CO LTD
View PDF4 Cites 8 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] Embodiments of the present invention provide an audio processing method, device, and mobile terminal to solve the problem in the prior art that it is troublesome to find someone's speech content in the audio of multiple people's speeches, and the recognition accuracy is low

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Audio processing method and device and mobile terminal
  • Audio processing method and device and mobile terminal
  • Audio processing method and device and mobile terminal

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0026] The following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are some of the embodiments of the present invention, but not all of them. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without creative efforts fall within the protection scope of the present invention.

[0027] The embodiment of the present invention provides an audio processing method, the audio can be the recorded audio obtained by recording, or the chat recording audio obtained from chat software such as WeChat, QQ, etc., and there are multiple people speaking in the above audio . The execution subject of the embodiment of the present invention may be a mobile terminal, of course, may also be a server.

[0028] In a specific embodiment, if the audi...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

An embodiment of the invention discloses an audio processing method and device and a mobile terminal. The method comprises identifying each spokesperson in audio data to be processed; according to audio parameters of the audio data to be processed, splitting the audio data to be processed into a plurality of subaudio data, wherein the subaudio data is corresponding to unit statements; and markingthe spokesperson and speaking time information corresponding to each subaudio data. The audio processing method and device help users to search speeches released by some spokesperson in the audio datato be processed, and prevent influence of speech overlap on speech recognition accuracy and improve speech recognition accuracy.

Description

technical field [0001] The present invention relates to the technical field of information processing, in particular to an audio processing method, device and mobile terminal. Background technique [0002] With the rapid development of mobile terminals, audio applications such as recording or voice chat of mobile terminals have been widely developed, and audio-related functions have also been improved and developed. For example, voice-to-text, voiceprint recognition, etc. [0003] When using a mobile terminal for recording or voice chatting, there are often multiple people speaking in a piece of audio. At this time, if you want to find someone's speech content from the audio, you need to play the audio content or find the position of the person's speech by fast-forwarding, which is troublesome to find; There is voice overlap in the audio due to someone interrupting, and the voice recognition of the overlapping part is more difficult and error-prone, and the recognition acc...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G10L17/00G10L17/02G06F17/30
CPCG06F16/686G10L17/00G10L17/02
Inventor 王亚运
Owner VIVO MOBILE COMM CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products