Voice processing method and device thereof
A technology of speech processing and speech fragments, applied in the field of communication, can solve the problem that mixed speech cannot be separated quickly and effectively, and achieve the effect of quickly separating specific target speech
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0065] The method embodiment provided in Embodiment 1 of the present application may be executed in a mobile terminal, a computer terminal, or a similar computing device. Taking running on a mobile terminal as an example, figure 1It is a hardware structural block diagram of a mobile terminal of a voice processing method in an embodiment of the present invention, as figure 1 As shown, the mobile terminal 10 may include one or more ( figure 1 Only one is shown in the figure) a processor 102 (the processor 102 may include but not limited to a processing device such as a microprocessor MCU or a programmable logic device FPGA) and a memory 104 for storing data. Optionally, the above-mentioned mobile terminal also A transmission device 106 for communication functions as well as input and output devices 108 may be included. Those of ordinary skill in the art can understand that, figure 1 The shown structure is only for illustration, and does not limit the structure of the above mo...
Embodiment 2
[0088] In this embodiment, a voice processing device is also provided, which is used to implement the above embodiments and preferred implementation modes, and those that have been explained will not be repeated here. As used below, the term "module" may be a combination of software and / or hardware that realizes a predetermined function. Although the devices described in the following embodiments are preferably implemented in software, implementations in hardware, or a combination of software and hardware are also possible and contemplated.
[0089] image 3 is a block diagram of a speech processing device according to an embodiment of the present invention, such as image 3 shown, including:
[0090] Segmentation module 32, is used for dividing mixed speech into N speech segments by endpoint detection, wherein, said N is a natural number greater than or equal to 2;
[0091] The detection module 34 is configured to perform Bayesian information criterion BIC detection on any...
Embodiment 3
[0111] An embodiment of the present invention also provides a storage medium, in which a computer program is stored, wherein the computer program is set to execute the steps in any one of the above method embodiments when running.
[0112] Optionally, in this embodiment, the above-mentioned storage medium may be configured to store a computer program for performing the following steps:
[0113] S11, segmenting the mixed speech into N speech segments by endpoint detection, wherein the N is a natural number greater than or equal to 2;
[0114] S12. Perform Bayesian information criterion BIC detection on any two adjacent speech segments among the N speech segments, and discard the abnormal speech segment in the BIC detection to obtain a valid speech segment of the target object.
[0115] Optionally, in this embodiment, the above-mentioned storage medium may include but not limited to: U disk, read-only memory (Read-Only Memory, ROM for short), random access memory (Random Access ...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com