Unlock instant, AI-driven research and patent intelligence for your innovation.

Voice edition device, voice edition method, and voice edition program

By using pattern matching processing to edit voice data in a voice recognition device, the problem of complex and low-efficiency editing voice data operations on mobile terminals is solved, and the effects of editing voice data and expanding the recognition dictionary on mobile terminals are achieved conveniently and cheaply.

Inactive Publication Date: 2008-05-21
PANASONIC INTELLECTUAL PROPERTY CORP OF AMERICA
View PDF6 Cites 11 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

In addition, for example, when editing a component part of a large amount of voice data, since the operation of inputting a large amount of voice data from its beginning results in very low efficiency, a technique for easily editing voice data is required

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Examples

Experimental program
Comparison scheme
Effect test

no. 1 approach

[0117] FIG. 1 is a block diagram of a speech recognition device (which uses a speech editing device for speech recognition according to the present invention) according to an embodiment of the present invention.

[0118] The speech recognition device includes a sound analysis unit 10, a feature parameter extraction unit 12, a changed part specifying unit 14 (including a pattern matching unit 16 for specifying a changed part), a standard pattern generating unit 18, a standard pattern database updating unit 20, a pattern matching unit (speech recognition unit of speech recognition device for speech recognition) 22 and standard pattern database (speech recognition dictionary file) 24. The type of data stored in the standard pattern database 24 may be "feature parameter (cepstrum)", "speech converted into text form (dictionary data as character string)", or "speech data (waveform data)". In the following description, it is assumed that “feature parameters (cepstrum)” are stored as...

no. 2 approach

[0139] The second embodiment describes the structure and operation of a speech recognition device and a sequence for generating standard patterns. In this embodiment, various standard patterns are used to identify announcements broadcast in trains or subways.

[0140]For example, a commuter commuting by train or subway may miss the station (eg, Shibuya station) where he is supposed to get off. In this case, when a commuter passenger carries a mobile terminal equipped with a voice recognition device, the mobile terminal can recognize a notice "This station is Shibuya" broadcast in a train or subway, and activate vibration when it recognizes the notice device to remind commuters, thereby providing convenience. Therefore, commuter passengers can be prevented from forgetting to get off. In the case where commuters often get off at "Yokohama", the mobile terminal can be configured to activate the vibrator when it recognizes that "this station is Yokohama".

[0141] In the case w...

no. 3 approach

[0164] The third embodiment describes a sequence of generating a new standard pattern to control settings of a mobile terminal equipped with a voice recognition device (for example, settings when e-mail is received) by a user's voice.

[0165] A user can change a screen displayed on a display unit of his mobile terminal or a ringing sound when an e-mail is received, and select a folder in which e-mails are accumulated.

[0166] In general, the screen or ringtone when mail is received is changed by operating the enter key. However, since the operation keys of the mobile terminal are small, it is inconvenient for the user to operate such keys. Therefore, it is convenient to change the screen or ringtone by inputting voice instead of keys.

[0167] The term "display settings" includes display settings of the standby screen of the phone and display settings of downloaded games in addition to the display settings of e-mails. Generally, when changing the setting of the mobile term...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

It is possible to easily enrich the standard patterns registered in an audio recognition device and effectively extend vocabulary which can be recognized in audio. Instead of creating a novel standard pattern from zero, a novel standard pattern is effectively created by partially modifying an existing standard pattern. A pattern matching unit (16) in a modification part specification unit (14) performs matching to identify a part of the existing standard pattern to be modified. A standard pattern creation unit (18) cuts out the audio data on the part of the standard pattern to be modified, deletes it, and replaces it with another audio data or combines it with another audio data to create a novel standard pattern. A standard pattern database update unit (20) adds the new standard pattern to a standard pattern database (24).

Description

technical field [0001] The invention relates to a voice editing device, a voice editing method and a voice editing program. Background technique [0002] Generally, when an editor edits recorded voice data, the editor designates and cuts an editing point while listening to the played voice. [0003] In Patent Document 5, when an editor creates a voice card (which is generated by recording the voice on the card and pasting a picture on the card), the editor uses an advanced voice editing program to express the voice on the editing window on the computer screen , and use tools such as the mouse to delete, cut, or combine parts of speech. [0004] In addition, the voice recognition apparatus uses a voice standard mode (hereinafter referred to as "standard mode") as a voice recognition dictionary to recognize voices. However, the standard model needs to be extended to increase the number of words that can be voice-recognized. In such cases, elements of existing standard patte...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G10L15/06G10L15/10G10L13/06
CPCG10L2015/0631G10L15/06
Owner PANASONIC INTELLECTUAL PROPERTY CORP OF AMERICA