Language recognition method and device, and device for language recognition

A technology of language recognition and speech fragments, which is applied in the computer field, can solve problems such as difficulty in guaranteeing and improving the accuracy of the language recognition system, performance degradation of test speech fragment representation system, etc., and achieve the effect of avoiding recognition delay

Inactive Publication Date: 2020-03-27
BEIJING SOGOU TECHNOLOGY DEVELOPMENT CO LTD
View PDF7 Cites 7 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, such conditions are difficult to guarantee in practical applications. For example, in the language recognition system of mobile phones, longer speech segments (such as more than 30 seconds) can be used for training in the training phase, while in the testing phase, the length of the speech segment is generally Only 3 seconds to 5 seconds
Since the extraction of i-Vector features requires the estimation of sufficient statistics of the acoustic features, insufficient representation of short test speech segments will lead to a decrease in system performance, making it difficult to improve the accuracy of the language recognition system

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Language recognition method and device, and device for language recognition
  • Language recognition method and device, and device for language recognition
  • Language recognition method and device, and device for language recognition

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0031] The following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are some of the embodiments of the present invention, but not all of them. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without creative efforts fall within the protection scope of the present invention.

[0032] method embodiment

[0033] refer to figure 1 , which shows a flow chart of the steps of an embodiment of a language recognition method of the present invention, which may specifically include the following steps:

[0034] Step 101, performing acoustic feature extraction on the speech segment to obtain an acoustic feature based on the frame sequence;

[0035] Step 102, input the acoustic features into a deep neural network, the deep neural netw...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The embodiment of the invention provides a language recognition method and device and a device for language recognition. The language recognition method specifically comprises the following steps: performing acoustic feature extraction on a voice segment to obtain acoustic features based on a frame sequence; inputting the acoustic features into a deep neural network, the deep neural network comprising a bottleneck layer and a time recursion layer, and the output of the bottleneck layer being connected to the time recursion layer; extracting a bottleneck feature sequence from the bottleneck layer, and inputting the bottleneck feature sequence into the time recursion layer to output a high-level feature sequence through the time recursion layer; and determining a language type correspondingto the voice segment according to the high-level feature sequence. According to the embodiment of the invention, the accuracy of language recognition can be improved.

Description

technical field [0001] The invention relates to the field of computer technology, in particular to a language recognition method, device and a device for language recognition. Background technique [0002] Language recognition refers to the technology of automatically processing a piece of speech and judging its language type through a computer. Language recognition technology is mainly used in the front end of the multilingual speech processing system. Automatic classification of speech through language recognition technology can save resources, avoid tedious manual classification, and greatly improve work efficiency. [0003] At present, language recognition systems usually use i-Vector features, which use a fixed-length low-dimensional vector to represent a segment of speech. In the process of extracting i-Vector features, sufficient statistics of acoustic features need to be calculated, therefore, the effective data of each speech segment must be long enough to ensure t...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G10L15/00G10L15/02G10L25/51
CPCG10L15/005G10L15/02G10L25/51
Inventor 陈艳妮潘逸倩于泓
Owner BEIJING SOGOU TECHNOLOGY DEVELOPMENT CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products