Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Text segmentation method and device

A text and target text technology, applied in semantic analysis, instruments, electronic digital data processing, etc., can solve the problems of complex text processing process, large amount of calculation, long time consumption, etc., to avoid deviation of text segmentation results, improve efficiency and save The effect of computing resources

Active Publication Date: 2021-03-16
BEIJING BAIDU NETCOM SCI & TECH CO LTD
View PDF7 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

This technology has a large amount of calculation and takes a long time, which makes the text processing process more complicated

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Text segmentation method and device
  • Text segmentation method and device
  • Text segmentation method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0018] Exemplary embodiments of the present application are described below in conjunction with the accompanying drawings, which include various details of the embodiments of the present application to facilitate understanding, and they should be regarded as exemplary only. Accordingly, those of ordinary skill in the art will recognize that various changes and modifications of the embodiments described herein can be made without departing from the scope and spirit of the application. Also, descriptions of well-known functions and constructions are omitted in the following description for clarity and conciseness.

[0019] It should be noted that, in the case of no conflict, the embodiments in the present application and the features in the embodiments can be combined with each other. The present application will be described in detail below with reference to the accompanying drawings and embodiments.

[0020] figure 1 An exemplary system architecture 100 of an embodiment of t...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a text segmentation method and device, and relates to the technical field of cloud computing and text processing. The specific implementation mode comprises the steps: acquiring a target text and performing sentence segmentation processing on the target text, thus obtaining a sentence segmentation result; in response to any sentence segmentation result, determining that thelength of the sentence segmentation result exceeds a preset length threshold, and performing word segmentation processing on the sentence segmentation result to obtain at least three vocabularies; performing vocabulary combination on at least two vocabularies in the at least three vocabularies to obtain word groups; comparing the length of the current word group with a preset length threshold; and in response to the fact that the length of the word group does not exceed a preset length threshold, taking the word group as a segmentation result of the target text. According to the method, the text segmentation process can be simplified, computing resources are saved, and the text processing efficiency is improved. Moreover, the length of the text segmentation result can be controlled, and the problem that the obtained segmentation result is too long to achieve the segmentation purpose is avoided.

Description

technical field [0001] The present application relates to the field of computer technology, in particular to the field of cloud computing and text processing technology, especially to a text segmentation method and device. Background technique [0002] Text processing technology is widely used in various technical scenarios. For example, intelligent search scenarios, man-machine dialogue scenarios, etc. Therefore, the text processing technology is often used in the scene of interacting with the user, and is closely related to the user's direct use experience, and its importance is self-evident. [0003] When processing text, it is often necessary to rely on Natural Language Processing (Natural Language Processing, NLP) technology. This technology requires a large amount of calculation and takes a long time, which makes the text processing process more complicated. Contents of the invention [0004] Provided are a text segmentation method, device, electronic equipment an...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F40/289G06F40/30
CPCG06F40/289G06F40/30
Inventor 常炎隆
Owner BEIJING BAIDU NETCOM SCI & TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products