Method for processing and acquiring human voice data

What is AI technical title?
AI technical title is built by Patsnap AI team. It summarizes the technical point description of the patent document.
A technology of data and human voice, applied in voice analysis, voice recognition, instruments, etc., can solve problems such as difficult to collect voice data, voice recognition performance, voice cannot be reused, and voice commands are difficult to recognize, so as to be suitable for popularization and implementation. Man-machine dialogue, ingeniously designed effects

Pending Publication Date: 2020-06-23

NANJING SILICON INTELLIGENCE TECH CO LTD

View PDF6 Cites 1 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

However, in this method, since the pitch of the speaker who actually uses the speech recognition function is often different from the pitch corresponding to the collected speech data, it is difficult to collect a large amount of speech data and ensure the speech recognition performance

Therefore, since an acoustic model is generally generated by learning voice data of an adult male, it is difficult to recognize voice commands of an adult female, an elderly person, or a child having a different voice pitch, and the recognized voice cannot be reused, let alone Carry out intelligent man-machine dialogue on the recognized voice, for this reason, the present invention proposes a method for processing and obtaining human voice data

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0021] The technical solutions in the embodiments of the present invention will be clearly and completely described below in conjunction with the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only a part of the embodiments of the present invention, rather than all the embodiments.

[0022] Reference figure 1 , A method for processing and acquiring human voice data, including the following steps,

[0023] S1: Acquire the sound signal collected by the collector of the mobile terminal; the sound signal is subjected to band-pass filtering processing of the pre-processor to obtain sampled data within a predetermined frequency range;

[0024] S2, collecting voice data of sampled data from a voice-based device;

[0025] S3, accumulate the voice data of the sampled data in the first memory;

[0026] S4, learning the accumulated voice data of the sample data;

[0027] S5, generating a personal acoustic model of the sampled data based ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a method for processing and acquiring human voice data. The method comprises the following steps: acquiring a voice signal acquired by a collector of a mobile terminal; collecting speech data of the sampled data from a speech-based device; storing a general acoustic model and a personal acoustic model of the sampled data in a second memory, wherein the second memory is connected with a tone conversion unit for converting the tone of the sampled data into other required tones, the tones being selecting from a database, the database being stored in a second memory; when avoice recognition request is received from the sampled data, extracting a feature vector from the voice data of the sampled data; selecting any one of the general acoustic model and a personal acoustic model of the sampled data based on a cumulant of speech data of the sampled data; and recognizing a voice command by using the extracted feature vector and the selected acoustic model. The method is ingenious in design, reasonable in method, capable of reasonably processing human voice and suitable for application and popularization.

Description

Technical field [0001] The present invention relates to the technical field of methods for processing and acquiring human voice data, and in particular to a method for processing and acquiring human voice data. Background technique [0002] According to the conventional voice recognition method, voice recognition is performed using an acoustic model that has been stored in a voice recognition device in advance. The acoustic model is used to represent the attributes of the speaker's speech. For example, phonemes, diphones, triphones, pentaphones, syllables, and characters are used as the basic units of the acoustic model. If phonemes are used as the basic model of an acoustic model, since the number of acoustic models is reduced, context-sensitive acoustic models such as diphones, triphones, or pentaphones are widely used to reflect the synergy caused by changes between adjacent phonemes The phenomenon of coarticulation. Large amounts of data are needed to learn context-sensiti...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & AuthorityApplications(China)

IPC IPC(8): G10L15/02G10L15/06G10L15/08G10L15/26G10L17/04G10L17/22

CPCG10L15/06G10L15/02G10L15/08G10L17/04G10L17/22

Inventor司马华鹏胡红燕陆放茅玥琪司马德一

OwnerNANJING SILICON INTELLIGENCE TECH CO LTD

Method for processing and acquiring human voice data

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology