A method and device for information word segmentation

A word segmentation method and word segmentation technology, applied in the field of information processing, can solve problems such as poor word segmentation performance and low word segmentation accuracy of word segmentation devices, and achieve the effect of improving efficiency and reducing quantity

Active Publication Date: 2019-01-18
UNION MOBILE PAY
View PDF5 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] The present invention provides an information word segmentation method and device, which are used to solve the problems of low word segmentation accuracy and poor word segmentation performance of the word segmenter in the prior art

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A method and device for information word segmentation
  • A method and device for information word segmentation
  • A method and device for information word segmentation

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0055] In order to make the object, technical solution and advantages of the present invention clearer, the present invention will be further described in detail below in conjunction with the accompanying drawings. Obviously, the described embodiments are only some embodiments of the present invention, rather than all embodiments . Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.

[0056] An embodiment of the present invention provides a method for information word segmentation, such as figure 1 shown, including:

[0057] Step 101, obtaining the target text that needs word segmentation;

[0058] Step 102, according to the feature information in the target text, determine the preliminary word segmentation text corresponding to the target text;

[0059] Step 103, performing word segmentation on the preliminary word s...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses an information word segmentation method and apparatus, and relates to the field of information processing. The method comprises the steps of obtaining a target text required to be subjected to word segmentation; determining a preliminary word segmentation text corresponding to the target text according to characteristic information in the target text; and performing word segmentation on the preliminary word segmentation text according to specific characters to obtain a word segmentation result of the target text, wherein the target text is composed of the characteristic information. Through embodiments of the method and the apparatus, the target text can be correctly subjected to the word segmentation without a dictionary base and a corpus base, so that the word segmentation precision is improved and the word segmentation speed of the target text is increased.

Description

technical field [0001] The present invention relates to the field of information processing, in particular to an information word segmentation method and device. Background technique [0002] Word segmentation refers to dividing a word sequence into meaningful word sequences, also known as word segmentation. The process of automatically converting the word strings that make up the text into word strings by the word segmentation system is called automatic segmentation. Relatively speaking, there are spaces and other symbols between words in Indo-European languages ​​such as English, so segmentation is easier. The Chinese text is a continuous string of Chinese characters, and there is no clear separation mark between words. The word segmentation system needs to automatically identify word boundaries and divide the string of Chinese characters into correct word strings. [0003] The Chinese word breaker belongs to the technical category of natural language processing, and the...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06F17/27
CPCG06F40/216G06F40/284
Inventor 唐翌飞陈瑛绮吴锋海赵晓庆
Owner UNION MOBILE PAY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products