Speech recognition text segmentation method and device

What is AI technical title?
AI technical title is built by PatSnap AI team. It summarizes the technical point description of the patent document.
A speech recognition and text segmentation technology, applied in speech recognition, speech analysis, semantic analysis, etc., can solve the problems of heavy workload and low efficiency of chapter structure

Active Publication Date: 2017-10-31

IFLYTEK CO LTD

View PDF13 Cites 46 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0004] The present invention provides a speech recognition text segmentation method and device to solve the problem of large workload and low efficiency in the prior art by manually adjusting the chapter structure of the recognized text

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0104] In order to enable those skilled in the art to better understand the solutions of the embodiments of the present invention, the embodiments of the present invention will be further described in detail below in conjunction with the drawings and implementations.

[0105] Such as figure 1 As shown, it is a flow chart of the speech recognition text segmentation method of the embodiment of the present invention, comprising the following steps:

[0106] In step 101, endpoint detection is performed on the voice data to obtain each voice segment and the start frame number and end frame number of each voice segment.

[0107] The voice data may be recorded according to practical applications, such as meeting recordings, interview recordings, and the like.

[0108] The so-called endpoint detection is to find out the start point and end point of each speech segment from a given speech signal. Specifically, some endpoint detection methods in the prior art may be used, which is not...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The present invention discloses a speech recognition text segmentation method and device. The method comprises: performing endpoint detection on speech data, to obtain speech segments and a starting frame serial number and an ending frame serial number of each speech segment; performing speech recognition on each speech segment, to obtain a recognition text corresponding to each speech segment; extracting a segmentation feature of the recognition text corresponding to each speech segment; by using the extracted segmentation feature and a pre-established segmentation model, performing segmented detection on the recognition text corresponding to the voice data, to determine a position where segmentation is needed; and segmenting the recognition text corresponding to the speech data according to a segmented detection result. According to the method and apparatus disclosed by the present invention, the recognition text can be segmented automatically, so that the structure of the recognition text can be clearer.

Description

technical field [0001] The invention relates to the field of natural language processing, in particular to a text segmentation method and device for speech recognition. Background technique [0002] With the development of speech technology, automatic speech recognition technology has been widely used in various fields of life. Converting speech into text greatly facilitates people's daily needs, such as converting meeting recordings into text and sending them to participants as meeting minutes ; Convert the recordings of interviews with journalists into texts, and then edit them into press releases, etc. However, the recognized text obtained by speech recognition does not have a clear chapter structure like the manually edited text, such as the division of paragraph structure, which makes it difficult for users to find the focus or theme of the entire recognized text when viewing the recognized text, especially When the recognized text is large and involves multiple topics...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & AuthorityApplications(China)

IPC IPC(8): G06F17/27G10L15/04G10L15/02

CPCG10L15/02G10L15/04G06F40/211G06F40/30

Inventor胡尹潘清华王金钖胡国平胡郁

OwnerIFLYTEK CO LTD

Speech recognition text segmentation method and device

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology