Chinese word segmentation method based on navigation information retrieval

A Chinese word segmentation and navigation information technology, applied in the navigation field, can solve problems such as inability to recognize typos, achieve good adjustment, optimize speed, and improve understanding

Active Publication Date: 2014-03-26
SHENYANG MXNAVI CO LTD
View PDF3 Cites 27 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Such a search method requires a relatively complete name fragment of the facility, and typos cannot be identified

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Chinese word segmentation method based on navigation information retrieval
  • Chinese word segmentation method based on navigation information retrieval

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0038] This embodiment provides a kind of Chinese word segmentation method based on navigation information retrieval, it is characterized in that: Chinese word segmentation is designed as the basis of navigation retrieval, Chinese word segmentation is for search engine, the most important thing is not to find all results, but to The results that are most in line with semantic relevance are preferably ranked first, which is also called relevance ranking; the accuracy of Chinese word segmentation directly affects the relevance ranking of search results; from a qualitative analysis, the word segmentation algorithms of search engines are different , different lexicons will affect the relevance of search results;

[0039] Using statistical methods and rule understanding methods, in a large number of texts that have been segmented, use statistical machine learning models to learn the rules of word segmentation, so as to realize the segmentation of unknown texts; combine the Chinese c...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

A Chinese word segmentation method based on navigation information retrieval is characterized in that a word segmentation system is obtained through the steps that a dictionary is loaded, and text code conversion is carried out; segmentation processing is carried out, and a source character string is segmented into a plurality of slightly simpler short sentences; atomic word segmentation is carried out to obtain the smallest morpheme units which cannot be segmented in the short sentences; word forming full-match is achieved with a word-by-word traversal matching method; the matching results are screened to generate a plurality of best results; human names, place names and proper nouns are processed; the dictionary is corrected, and mainly, unlisted new words are added, and properties of the existing words are improved; the processing results of all the short sentences are finally combined to be output. The Chinese word segmentation method has the advantages that content input by a user can be formed into words through the Chinese word segmentation technology, the speed can be optimized, wrongly written characters can be corrected with the words as the basis, and a more suitable result can be provided. With the Chinese word segmentation technology, semantics can be understood by an information retrieval engine better, and the provided result set can be fully adjusted.

Description

technical field [0001] The invention relates to the field of navigation, in particular to a Chinese word segmentation method based on navigation information retrieval. Background technique [0002] The dictionary information used in the current navigation name retrieval is based on single-character words, and there is only an association between single-character words in the dictionary, and there is no information such as semantic interpretation. According to the content entered by the user, it is divided into individual words for searching, and the results are sorted by rules, and finally presented to the user. Such a search method requires relatively complete name fragments of facilities, and typos cannot be identified. Contents of the invention [0003] The purpose of the present invention is to improve the semantic understanding of the information retrieval engine, fully adjust the provided result set, and especially provide a Chinese word segmentation method based on...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
CPCG06F16/36G06F16/374G06F16/951
Inventor 李潍希于航解威朱小莹
Owner SHENYANG MXNAVI CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products