Voice processing method and device based on artificial intelligence

What is AI technical title?
AI technical title is built by Patsnap AI team. It summarizes the technical point description of the patent document.
A technology of artificial intelligence and speech processing, applied in speech analysis, speech recognition, instruments, etc., can solve the problems of easy error in segmentation or recognition process, high labor cost, low efficiency of labeling data, etc.

Active Publication Date: 2018-02-02

BAIDU ONLINE NETWORK TECH (BEIJIBG) CO LTD

View PDF5 Cites 71 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0005] For this reason, the first object of the present invention is to propose a speech processing method based on artificial intelligence, to realize automatic segmentation and labeling of speech, and to form labeling data with high accuracy for training the speech synthesis model , which is used to solve the problems of low efficiency of labeling data generation in the existing manual labeling method, prone to errors in the process of segmentation or recognition, and high labor costs

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0041] Embodiments of the present invention are described in detail below, examples of which are shown in the drawings, wherein the same or similar reference numerals designate the same or similar elements or elements having the same or similar functions throughout. The embodiments described below by referring to the figures are exemplary and are intended to explain the present invention and should not be construed as limiting the present invention.

[0042] The artificial intelligence-based speech processing method and device thereof according to the embodiments of the present invention will be described below with reference to the accompanying drawings.

[0043] figure 1 It is a schematic flowchart of an artificial intelligence-based speech processing method provided by an embodiment of the present invention. Such as figure 1 Shown, this speech processing method based on artificial intelligence comprises the following steps:

[0044] S101. Collect the voice and segment it...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention proposes a voice processing method and device based on artificial intelligence, and the method comprises the steps: voice for segmentation, forming a plurality of voice segments, recognizing each voice segment, obtaining a recognition text segment of each voice segment, determining an original text segment of a current recognition text segment from an original text corresponding to the current recognition text segment according to the sequence of recognition text segments, splicing the original text segment and the voice segments corresponding to an original text segment, obtaining a sentence text and sentence voice corresponding to the sentence text, generating the pinyin of the sentence text, forming a phone sequence according to the pinyin, enabling the phone sequence andthe sentence voice to be aligned, obtaining a phone boundary, and forming target data for the training of the voice synthesis model through the sentence text, sentence voice, pinyin and phone boundary. Therefore, the method achieves the automatic segmentation and marking of the voice, and forms the marking data which is higher in accuracy and is used for training the voice synthesis model.

Description

technical field [0001] The invention relates to the field of artificial intelligence, in particular to an artificial intelligence-based speech processing method and device thereof. Background technique [0002] Artificial Intelligence (Artificial Intelligence), the English abbreviation is AI. It is a new technical science that studies and develops theories, methods, technologies and application systems for simulating, extending and expanding human intelligence. Artificial intelligence is a branch of computer science that attempts to understand the essence of intelligence and produce a new intelligent machine that responds in a manner similar to human intelligence. Research in this field includes robotics, speech recognition, image recognition, natural language processing and expert systems, etc. [0003] At present, in the field of speech synthesis, most of the speech segmentation is performed manually, and then the original text corresponding to each speech segment is man...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & AuthorityApplications(China)

IPC IPC(8): G10L15/04G10L15/05G10L15/06G10L15/14

CPCG10L15/04G10L15/05G10L15/063G10L15/142G10L2015/0631

Inventor孔德威

OwnerBAIDU ONLINE NETWORK TECH (BEIJIBG) CO LTD

Voice processing method and device based on artificial intelligence

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology