Word segmentation processing method and device, mobile terminal and computer readable storage medium

A word segmentation processing and word segmentation technology, which is applied in computer parts, computing, electrical digital data processing, etc., can solve problems such as low accuracy and poor user experience, and achieve the effects of improving accuracy, saving storage space, and improving user experience

Active Publication Date: 2018-03-23
BEIJING BAIDU NETCOM SCI & TECH CO LTD
View PDF7 Cites 15 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, the current word segmentation model is often obtained after training the corpus of a specific language, so when performing word segmentation processing for other languages, the accuracy is low and the user experience is poor

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Word segmentation processing method and device, mobile terminal and computer readable storage medium
  • Word segmentation processing method and device, mobile terminal and computer readable storage medium
  • Word segmentation processing method and device, mobile terminal and computer readable storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0022] Embodiments of the present invention are described in detail below, examples of which are shown in the drawings, wherein the same or similar reference numerals designate the same or similar elements or elements having the same or similar functions throughout. The embodiments described below by referring to the figures are exemplary and are intended to explain the present invention and should not be construed as limiting the present invention.

[0023] Specifically, the various embodiments of the present invention aim at the prior art, usually using a statistical-based word segmentation model or a dictionary-based word segmentation model to perform word segmentation processing on sentences to be segmented. However, the current word segmentation models often use specific language It is obtained after corpus training, so when performing word segmentation processing for other languages, the accuracy is low and the user experience is poor. A word segmentation processing metho...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a word segmentation processing method and device, a mobile terminal and a computer readable storage medium. The method comprises the following steps of: when a to-be-segmentedstatement is obtained, determining a target language type corresponding to the to-be-segmented statement; respectively first feature vectors corresponding individual characters, second feature vectorscorresponding to two words and third feature vectors corresponding to proper nouns in the to-be-segmented statement; determining current fourth feature vectors of the individual characters accordingto the first feature vectors, the second feature vectors and the third feature vectors; and carrying out word segmentation on the to-be-segmented statement according to a preset Chinese character label transfer matrix and the current fourth feature vectors of the individual characters. According to the method, word segmentation is carried out on to-be-segmented statements according to target language types corresponding to the to-be-segmented statements, so that the correctness of carrying out word segmentation on to-be-segmented statements in various language types is improved; and proper resources can be loaded according to requirements, so that storage spaces of mobile terminals are saved and the user experience is improved.

Description

technical field [0001] The present invention relates to the technical field of word segmentation processing, in particular to a word segmentation processing method, device, mobile terminal and computer-readable storage medium. Background technique [0002] With the continuous development of computer technology, word segmentation technology has been widely used in search engines, machine translation, speech synthesis, automatic summarization and other fields. Among them, the word segmentation technology refers to the technology of dividing a sentence or a paragraph of text into words one by one. [0003] In the prior art, a word segmentation model based on statistics or a word segmentation model based on a dictionary is usually used to perform word segmentation processing on the sentence to be word segmented. However, the current word segmentation model is often obtained after training on corpus of a specific language, so when performing word segmentation processing on other...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/27G06K9/62
CPCG06F40/284G06F18/2411
Inventor 肖求根郑利群詹金波邓卓彬何径舟
Owner BEIJING BAIDU NETCOM SCI & TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products