Eureka AIR delivers breakthrough ideas for toughest innovation challenges, trusted by R&D personnel around the world.

Word segmentation method and apparatus

A word segmentation method and word segmentation technology, applied in the field of data processing, can solve problems such as unsatisfactory word segmentation effect, achieve the effect of improving word segmentation effect, improving matching degree and accuracy

Inactive Publication Date: 2016-01-20
BAIDU ONLINE NETWORK TECH (BEIJIBG) CO LTD
View PDF4 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] The existing word segmentation method usually uses a third-party word segmentation dictionary to segment POI, but the word segmentation effect is not ideal

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Word segmentation method and apparatus
  • Word segmentation method and apparatus
  • Word segmentation method and apparatus

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0017] Embodiments of the present invention are described in detail below, examples of which are shown in the drawings, wherein the same or similar reference numerals denote the same or similar modules or modules having the same or similar functions throughout. The embodiments described below by referring to the figures are exemplary only for explaining the present invention and should not be construed as limiting the present invention. On the contrary, the embodiments of the present invention include all changes, modifications and equivalents coming within the spirit and scope of the appended claims.

[0018] figure 1 It is a schematic flow chart of a word segmentation method proposed by an embodiment of the present invention, and the method includes:

[0019] S11: Establish an initial word segmentation dictionary based on existing entries.

[0020] For example, an initial word segmentation dictionary is composed of all existing lexical entries, and each entry in the initia...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a word segmentation method and apparatus. The word segmentation method comprises: establishing an initial word segmentation dictionary according to existing entries; obtaining a first entry set, selecting entries meeting a preset condition from the first entry set, obtaining the word segmentation dictionary, performing word segmentation on the entries meeting the preset condition by adopting the obtained word segmentation dictionary, and updating the obtained word segmentation dictionary by using the entries subjected to the word segmentation, wherein the initial first entry set consists of the existing entries, and the initially obtained word segmentation dictionary is the initial word segmentation dictionary; performing word segmentation on the entries in the first entry set by adopting the updated word segmentation dictionary a second entry set is obtained according to the entries subjected to word segmentation; and when determining that a convergence condition is met, obtaining a word segmentation result according to the second entry set. The method is capable of improving word segmentation effect.

Description

technical field [0001] The present invention relates to the technical field of data processing, in particular to a word segmentation method and device. Background technique [0002] In map navigation products, it is often necessary to search for information points (PointOfInterest, POI). In a geographic information system, a POI can be a house, a shop, a mailbox, a bus stop, and so on. Since the place name input by the user is usually not a standard POI, it is difficult to obtain the query result required by the user by directly matching the POI in the database. In order to obtain the query results required by the user, post-processing matching is usually performed to obtain a fuzzy approximate result as the query result. POIs in the database need to be segmented during post-processing matching, and the performance of word segmentation directly affects the result of post-processing matching. [0003] The existing word segmentation method usually uses a third-party word seg...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F17/27G06F17/30
Inventor 穆向禹彭守业
Owner BAIDU ONLINE NETWORK TECH (BEIJIBG) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Eureka Blog
Learn More
PatSnap group products