Supercharge Your Innovation With Domain-Expert AI Agents!

Text word segmentation method and device

A word segmentation method and word segmentation technology, which is applied in the fields of instruments, electronic digital data processing, calculation, etc., can solve the problems of affecting the retrieval hit rate and large granularity of word segmentation, and achieve the effect of improving the retrieval hit rate and moderate granularity

Pending Publication Date: 2022-04-22
BEIJING JINTI TECH CO LTD
View PDF13 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] At present, the Chinese word segmentation method used in search engines is word segmentation through the word segmenter, but the word segmenter relies too much on the dictionary. If the dictionary is not fully covered, the word segmentation result will be a single word or longer, that is, the word segmentation granularity is too large or too small , thus affecting the retrieval hit rate

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Text word segmentation method and device
  • Text word segmentation method and device
  • Text word segmentation method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0016] In order to enable those skilled in the art to better understand the technical solutions in the embodiments of the present invention, the following will clearly and completely describe the technical solutions in the embodiments of the present invention in conjunction with the accompanying drawings in the embodiments of the present invention. Obviously, the described The embodiments are only some of the embodiments of the present invention, but not all of them. All other embodiments obtained by persons of ordinary skill in the art based on the embodiments in the embodiments of the present invention shall fall within the protection scope of the embodiments of the present invention.

[0017] The specific implementation of the embodiments of the present invention will be further described below in conjunction with the accompanying drawings of the embodiments of the present invention.

[0018] refer to Figure 1A , shows a flow chart of the steps of the text word segmentatio...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The embodiment of the invention provides a text word segmentation method and device, and relates to the technical field of natural language processing. The method comprises the steps of obtaining a coarse-grained word segmentation result and a fine-grained word segmentation result of a text to be subjected to word segmentation; traversing coarse-grained segmented words in the coarse-grained word segmentation result to determine the character length of the coarse-grained segmented words; according to the character length of the coarse-grained word segmentation, correcting the coarse-grained word segmentation result to obtain a corrected coarse-grained word segmentation result; and determining a final word segmentation result of the text according to the fine-grained word segmentation result and the corrected coarse-grained word segmentation result. According to the scheme, the moderate word segmentation granularity of the text can be effectively ensured, so that the word segmentation accuracy of the text is effectively improved.

Description

technical field [0001] The embodiments of the present application relate to the technical field of natural language processing, and in particular to a text word segmentation method, device, electronic equipment, and computer storage medium. Background technique [0002] In the information age with the rapid development of the Internet, search engines are one of the powerful means for people to obtain effective information. The focus of Chinese search engines is on the extraction of Chinese key information, and the difficulty is automatic Chinese word segmentation. A good Chinese word segmentation method can effectively help search engines increase the accuracy and timeliness of information retrieval. [0003] At present, the Chinese word segmentation method used in search engines is word segmentation through the word segmenter, but the word segmenter relies too much on the dictionary. If the dictionary is not fully covered, the word segmentation result will be a single word...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F40/284
CPCG06F40/284
Inventor 李刚
Owner BEIJING JINTI TECH CO LTD
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More