Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Speech recognition text segmentation method and device

A speech recognition and text segmentation technology, applied in speech recognition, speech analysis, semantic analysis, etc., can solve the problems of heavy workload and low efficiency of chapter structure

Active Publication Date: 2017-10-31
IFLYTEK CO LTD
View PDF13 Cites 46 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The present invention provides a speech recognition text segmentation method and device to solve the problem of large workload and low efficiency in the prior art by manually adjusting the chapter structure of the recognized text

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Speech recognition text segmentation method and device
  • Speech recognition text segmentation method and device
  • Speech recognition text segmentation method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0104] In order to enable those skilled in the art to better understand the solutions of the embodiments of the present invention, the embodiments of the present invention will be further described in detail below in conjunction with the drawings and implementations.

[0105] Such as figure 1 As shown, it is a flow chart of the speech recognition text segmentation method of the embodiment of the present invention, comprising the following steps:

[0106] In step 101, endpoint detection is performed on the voice data to obtain each voice segment and the start frame number and end frame number of each voice segment.

[0107] The voice data may be recorded according to practical applications, such as meeting recordings, interview recordings, and the like.

[0108] The so-called endpoint detection is to find out the start point and end point of each speech segment from a given speech signal. Specifically, some endpoint detection methods in the prior art may be used, which is not...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The present invention discloses a speech recognition text segmentation method and device. The method comprises: performing endpoint detection on speech data, to obtain speech segments and a starting frame serial number and an ending frame serial number of each speech segment; performing speech recognition on each speech segment, to obtain a recognition text corresponding to each speech segment; extracting a segmentation feature of the recognition text corresponding to each speech segment; by using the extracted segmentation feature and a pre-established segmentation model, performing segmented detection on the recognition text corresponding to the voice data, to determine a position where segmentation is needed; and segmenting the recognition text corresponding to the speech data according to a segmented detection result. According to the method and apparatus disclosed by the present invention, the recognition text can be segmented automatically, so that the structure of the recognition text can be clearer.

Description

technical field [0001] The invention relates to the field of natural language processing, in particular to a text segmentation method and device for speech recognition. Background technique [0002] With the development of speech technology, automatic speech recognition technology has been widely used in various fields of life. Converting speech into text greatly facilitates people's daily needs, such as converting meeting recordings into text and sending them to participants as meeting minutes ; Convert the recordings of interviews with journalists into texts, and then edit them into press releases, etc. However, the recognized text obtained by speech recognition does not have a clear chapter structure like the manually edited text, such as the division of paragraph structure, which makes it difficult for users to find the focus or theme of the entire recognized text when viewing the recognized text, especially When the recognized text is large and involves multiple topics...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/27G10L15/04G10L15/02
CPCG10L15/02G10L15/04G06F40/211G06F40/30
Inventor 胡尹潘清华王金钖胡国平胡郁
Owner IFLYTEK CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products