Chinese word segmentation method and device

A Chinese word segmentation and text technology, applied in the field of search engines, can solve the problem of low accuracy of Chinese word segmentation, and achieve the effect of solving the low accuracy and improving the accuracy of word segmentation

Inactive Publication Date: 2018-11-06
DATAGRAND TECH INC
View PDF10 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0007] The main purpose of this application is to provide a Chinese word segmentation method a

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Chinese word segmentation method and device
  • Chinese word segmentation method and device
  • Chinese word segmentation method and device

Examples

Experimental program
Comparison scheme
Effect test

Example Embodiment

[0030] In order to make those skilled in the art better understand the solutions of the present application, the technical solutions in the embodiments of the present application will be clearly and completely described below with reference to the accompanying drawings in the embodiments of the present application. Obviously, the described embodiments are only The embodiments are part of the present application, but not all of the embodiments. Based on the embodiments in the present application, all other embodiments obtained by those of ordinary skill in the art without creative work shall fall within the scope of protection of the present application.

[0031] It should be noted that the terms "first", "second", etc. in the description and claims of the present application and the above drawings are used to distinguish similar objects, and are not necessarily used to describe a specific sequence or sequence. It is to be understood that the data so used are interchangeable un...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a Chinese word segmentation method and device. The method comprises the steps of receiving first target text information sent by a user; carrying out data mapping on the firsttarget text information through a first classifier to obtain corresponding first target category information; and performing preset inquiry operation according to the first target category informationand returning an inquiry result to the user. In a mode that the first target text information sent by the user is subjected to the data mapping through the first classifier, the corresponding first target category information is obtained, so that the purpose of performing preset inquiry operation according to the first target category information is achieved, the technical effect of improving theword segmentation accuracy is achieved, and the problem of low accuracy of Chinese word segmentation in related technologies is solved.

Description

technical field [0001] This application relates to the field of search engines, in particular, to a Chinese word segmentation method and device. Background technique [0002] Search engines are based on a structure called an inverted index. The inverted index is a structure of <key, value>, and the key value in this structure directly affects the accuracy, recall rate, and speed of the entire search engine. Let's take a look at what happens if we don't use Chinese word segmentation. [0003] Assuming that Chinese word segmentation is not used, a single Chinese character index can be used. For example, for Daguan, the word 'Da' is first indexed, and then the word 'Guan' is indexed. Similarly, for an article, first index all Chinese characters separately and record their positions. In the search process, first find all the documents of the word 'Da', then find all the documents of the word 'Guan', and then do the cross 'AND' operation, that is, only the documents con...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/27G06F17/30
CPCG06F40/284
Inventor 王江高翔纪达麒陈运文
Owner DATAGRAND TECH INC
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products