Method for compressing dictionary data

What is AI technical title?
AI technical title is built by PatSnap AI team. It summarizes the technical point description of the patent document.
a dictionary and data compression technology, applied in the field of speech recognition, can solve the problems of inability to represent pronunciation by general pronunciation rules, inability to correctly inability to properly generate pronunciation of some words, etc., to achieve effective compression, reduce the entropy of the dictionary, and improve the effect of compression

Inactive Publication Date: 2007-03-29

NOKIA CORP

View PDF13 Cites 164 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Benefits of technology

"The invention provides a more efficient method for compressing a pronunciation dictionary. The method involves pre-processing the dictionary by aligning and interleaving each entry with a statistical algorithm, such as HMM-Viterbi, to reduce the entropy of the dictionary and improve compression. The pre-processed dictionary is then used to convert speech or text into a sequence of phoneme units, which can be further processed and stored in a compressed format. The invention also includes an electronic device that performs the pre-processing and conversion steps. The technical effects of the invention include improved compression ratio, reduced memory requirement, and improved speech recognition accuracy."

Problems solved by technology

Although in many languages pronunciation of many words can be represented by rules, or even models, the pronunciation of some words can still not be correctly generated by these rules or models.

However, in many languages, the pronunciation cannot be represented by general pronunciation rules, but each word has a specific pronunciation.

In mobile phones the memory size is often limited due to reasons of cost and hardware size.

This imposes limitations also on speech recognition applications.

However, the problem with the statistical based method is that it requires a large working memory (buffer) during the decompression process.

Therefore this solution is not suitable for use in small portable electronic devices such as mobile terminals.

Although the existing compression methods are good in general, the compression of the pronunciation dictionaries is not efficient enough for portable devices.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0023]FIG. 1 illustrates a data processing device (TE) only for the parts relevant to a preferred embodiment of the invention. The data processing device (TE) can be, for example, a personal computer (PC) or a mobile terminal. The data processing unit (TE) comprises I / O means (I / O), a central processing unit (CPU) and memory (MEM). The memory (MEM) comprises a read-only memory ROM portion and a rewriteable portion, such as a random access memory RAM and FLASH memory. The information used to communicate with different external parties, e.g. a CD-rom, other devices and the user, is transmitted through the I / O means (I / O) to / from the central processing unit (CPU). The central processing unit (CPU) provides a pre-processing block (PRE) and a compression block (COM). The functionality of these blocks is typically implemented by executing a software code in a processor, but it can also be implemented with a hardware solution (e.g. an ASIC) or as a combination of these two. The pre-process...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention relates to pre-processing of a pronunciation dictionary for compression in a data processing device, the pronunciation dictionary comprising at least one entry, the entry comprising a sequence of character units and a sequence of phoneme units. According to one aspect of the invention the sequence of character units and the sequence of phoneme units are aligned using a statistical algorithm. The aligned sequence of character units and aligned sequence of phoneme units are interleaved by inserting each phoneme unit at a predetermined location relative to the corresponding character unit.

Description

CROSS-REFERENCE TO RELATED APPLICATIONS [0001] This is a continuation application of application Ser. No. 10 / 292,122, filed Nov. 11, 2002, the content of which is incorporated herein by reference in its entirety.BACKGROUND OF THE INVENTION [0002] The invention relates to speaker-independent speech recognition, and more precisely to the compression of a pronunciation dictionary. [0003] Different speech recognition applications have been developed during recent years for instance for car user interfaces and mobile terminals, such as mobile phones, PDA devices and portable computers. Known methods for mobile terminals include methods for calling a particular person by saying aloud his / her name into the microphone of the mobile terminal and by setting up a call to the number according to the name said by the user. However, present speaker-dependent methods usually require that the speech recognition system is trained to recognize the pronunciation for each name. Speaker-independent spee...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & AuthorityApplications(United States)

IPC IPC(8): G10L15/04G10L15/06G10L15/02G10L15/12G10L15/14G10L25/90H03M7/30

CPCG10L15/12H03M7/30G10L2015/025

InventorTIAN, JILEI

OwnerNOKIA CORP

Method for compressing dictionary data

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Benefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology