Unlock instant, AI-driven research and patent intelligence for your innovation.

Automatic labeling method for Yi language voice data

An automatic tagging and voice data technology, applied in the field of intelligent recognition, can solve problems such as uneven tagging quality, low data tagging accuracy, and high cost, and achieve the effects of promoting research progress, saving labor costs, and improving tagging efficiency

Active Publication Date: 2021-04-09
KUNMING UNIV
View PDF5 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The existing data labeling methods mainly include: first, the artificial voice data labeling method, which is costly and the quality of the labeling is uneven, which will affect the effect of later data use; second, the automatic labeling method in the field of images, etc. The automatic labeling method of image data has achieved great success in the field, but it does not have the versatility to be extended to other fields
Third, the automatic voice data labeling method based on single-modal features uses existing voice recognition technology principles to achieve labeling, but because only a certain aspect of feature factors is considered, the accuracy of data labeling is low

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Automatic labeling method for Yi language voice data
  • Automatic labeling method for Yi language voice data
  • Automatic labeling method for Yi language voice data

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0055] In order to make the purpose, technical solutions and advantages of the present invention clearer, the technical solutions in the present invention are clearly and completely described below. Apparently, the described embodiments are part of the embodiments of the present invention, not all of them. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without creative efforts fall within the protection scope of the present invention.

[0056] The invention provides a method for automatically labeling voice data in Yi language, the method comprising the following steps:

[0057] Step 1: Process the Yi language samples into Yi language text samples and corresponding Yi language voice samples; then the Yi language text samples are shared and expressed at the language layer to obtain language layer shared identification samples; the Yi language voice samples are Perform shared representation on the a...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides an automatic labeling method for Yi language voice data. The automatic labeling method comprises the following steps: 1, respectively processing Yi language samples into Yi language text samples and corresponding Yi language voice samples; performing shared representation on the Yi language text sample in a language layer to obtain a language layer shared identification sample; performing shared representation on the Yi language voice sample in an acoustic layer to obtain an acoustic layer shared identification sample; 2, performing alignment fusion on the language layer shared identification sample and the acoustic layer shared identification sample to complete preprocessing so as to obtain Yi language sample data after alignment fusion of the language layer and the acoustic layer; 3, performing annotation parameter calculation on the aligned and fused Yi language sample data by utilizing an extended Gaussian mixture model and a hidden Markov model GMM-HMM; and 4, taking the annotation parameter calculation result as a basis for annotation, and finally finishing automatic annotation of the Yi language sample data. According to the method, the accuracy of automatic labeling can be improved.

Description

technical field [0001] The invention relates to the technical field of intelligent recognition, in particular to an automatic labeling method for Yi speech data. Background technique [0002] The Yi nationality is one of the most populous ethnic groups in southwestern my country, with a population of about 8.7 million, of which 5.1 million are in Yunnan alone. The Yi language belongs to the Yi branch of the Tibeto-Burman group of the Sino-Tibetan language family. It covers a wide geographical distribution, with many dialects, sub-dialects and native languages. It is divided into six major dialect regions: eastern, southern, western, northern, southeastern and central. The use of Yi language varies from place to place, and more than half of the residents in most areas where the Yi nationality gathers do not understand or basically understand Chinese. The Yi language has many characteristics at the acoustic level (phonemes, accents, consonants, etc.) and linguistics (vocabula...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F40/284G06F40/117G06F40/30G10L15/22G10L17/04G10L25/24
CPCG06F40/284G06F40/30G10L25/24G10L15/22G10L17/04G06F40/117Y02D10/00
Inventor 何俊张彩庆周义方邹目权张云飞
Owner KUNMING UNIV