Unlock instant, AI-driven research and patent intelligence for your innovation.

A method and device for distinguishing between machine voice and natural voice

A speech and machine technology, applied in speech analysis, speech recognition, instruments, etc., can solve complex or personalized problems that cannot be solved

Active Publication Date: 2020-10-09
武汉恩特拉信息技术有限公司
View PDF6 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

For example, in scenarios such as telephone customer service, high-accuracy speech recognition technology can play a certain auxiliary role, and can solve simple and common problems raised by users, but cannot solve complex or personalized problems.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A method and device for distinguishing between machine voice and natural voice
  • A method and device for distinguishing between machine voice and natural voice

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0067] In this embodiment, the morpheme segmentation of the collected speech to obtain the morpheme speech includes:

[0068] Perform speech recognition on the collected speech, and obtain the pronunciation interval between syllables of the collected speech;

[0069] One or more syllables whose pronunciation interval in the recognized speech is smaller than the preset interval value is regarded as a morpheme speech.

[0070] The syllables are defined according to the characteristics of the language. For example, for Chinese, the pronunciation of a Chinese character generally corresponds to one syllable, and the syllable usually includes initial consonants, finals and tones. Word pronunciation can correspond to one or more syllables.

[0071] The size of the preset interval value only needs to ensure that each syllable can be distinguished.

[0072] In one embodiment, the preset interval value can be adjusted according to the speech rate of the speech producer, and the preset...

Embodiment 2

[0075] In this embodiment, the acquisition of the number of morpheme sounds belonging to the same type of morpheme sounds from the segmented morpheme sounds includes:

[0076] Obtain the sound feature value of the morpheme speech obtained by segmentation;

[0077] Attribute the morpheme sounds with close sound feature values ​​to the same morpheme sounds;

[0078] The morpheme voices belonging to the same morpheme voice are counted.

[0079] In one embodiment, the sound feature values ​​include feature values ​​obtained based on syllables.

[0080] In an embodiment, the same morpheme voices can be classified into the same morpheme voice set, and the number of morpheme voices belonging to the same morpheme voice can be obtained by counting the morpheme voices in the set.

[0081] In one embodiment, if the quantity in the same morpheme voice set exceeds the third preset value, the morpheme voices in the set can be further classified according to semantics, and the morpheme voi...

Embodiment 3

[0084] In this embodiment, the comparison of the number of morpheme sounds corresponding to the same type of morpheme sounds includes:

[0085] Obtaining the pronunciation length and / or intonation of morpheme sounds belonging to the same morpheme sounds from the collected speech;

[0086] Compare the pronunciation length and / or intonation of morpheme sounds that belong to the same type of morpheme sounds.

[0087] In one embodiment, the method further includes obtaining, from the collected speech, the pronunciation intervals of the morpheme speeches belonging to the same kind of morpheme speeches. The pronunciation interval can reflect the speech habit of the speech producer, especially a real person.

[0088] The technical solution for distinguishing between machine speech and natural speech proposed by the embodiments of the present invention helps to capture the voices of different speech producers (especially real people) by acquiring the pronunciation length and / or inton...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to a method and a device for distinguishing machine voice and natural voice, wherein the method comprises the following steps of collecting voice aiming at the same voice generating party; performing morpheme division on the collected voice to obtain morpheme voice; obtaining the quantity of the morpheme voice belonging to the same kind of morpheme voice from the morpheme voice obtained after the division; when the quantity of the same kind of morpheme voice reaches a first preset value, comparing the morpheme voice corresponding to the quantity and belonging to the samekind of morpheme voice; determining the similar degree of the morpheme voice belonging to the same kind of morpheme voice; when the similar degree of the morpheme voice belonging to the same kind of morpheme voice exceeds a second preset value, determining the collected voice as machine voice. The method and device for distinguishing machine voice and natural voice provided by the embodiment can be used for fast and accurately distinguishing whether the voice is machine voice or the natural voice.

Description

technical field [0001] The invention belongs to the technical field of speech recognition, and in particular relates to a method and a device for distinguishing machine speech and natural speech. Background technique [0002] Speech recognition technology is already a relatively mature technology, and many solutions have been developed to meet the needs of different scenarios. For example, in scenarios such as telephone customer service, speech recognition technology with high accuracy can play a certain auxiliary role, and can solve simple and common problems raised by users, but cannot solve complex or personalized problems. Therefore, for complex or personalized problems, users still hope that the customer service is a real person, so as to effectively solve the problems that users need to solve. How to quickly and accurately distinguish the nature of the voice generator has become a technical problem to be solved in this application. Contents of the invention [0003...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G10L15/02G10L15/10
Inventor 不公告发明人
Owner 武汉恩特拉信息技术有限公司