Method and device for establishing chain brand word bank and category word bank

A brand and chain technology, applied in the field of geographic information, can solve problems such as low work efficiency and inability to update the lexicon in time, and achieve the effect of improving efficiency, improving efficiency and speed

Active Publication Date: 2015-03-25
ALIBABA (CHINA) CO LTD
View PDF4 Cites 10 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, this method of relying on manual summarization to establish a category word thesaurus and chain brand wor

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and device for establishing chain brand word bank and category word bank
  • Method and device for establishing chain brand word bank and category word bank
  • Method and device for establishing chain brand word bank and category word bank

Examples

Experimental program
Comparison scheme
Effect test

Example Embodiment

[0045] Example one

[0046] In this embodiment, a chain brand word recognizer is trained based on the POI data in the POI database. The chain brand word recognizer can divide the name stems derived from the POI data into chain brand words and non-chain brand words, and filter the classification results Out is the backbone of the name of the chain brand word, and it is stored in the chain brand word database. See figure 1 , Which is a flowchart of a method for establishing a chain brand word database of the present invention, the method includes the following steps:

[0047] Step 101: Aggregate POI data with the same name backbone in the same city POI database into a POI data group, wherein the POI data group corresponds to the name backbone;

[0048] "Name backbone" refers to the part of the POI data name after removing the branch and address and other subsidiary information. The way of distinguishing the name backbone and subsidiary information is related to the POI data format. I...

Example Embodiment

[0098] Example two

[0099] The difference between the second embodiment and the first embodiment is that after the chain brand word recognizer is obtained, the recognition accuracy of the chain brand word recognizer is further checked. If the recognition accuracy of the chain brand word recognizer is checked, the recognition accuracy of the chain brand word The recognizer is adjusted, and then another test is performed, and the check and adjustment are repeated continuously until the recognition accuracy of the chain brand word recognizer meets the requirements. See figure 2 , Which is a flowchart of another method for establishing a chain brand word database of the present invention, the method includes the following steps:

[0100] Step 201: Aggregate POI data with the same name backbone in the POI database of points of interest in the same city into a POI data group, where the POI data group corresponds to the name backbone;

[0101] Step 202: Extract the identification feature...

Example Embodiment

[0125] Example three

[0126] The first two embodiments are mining chain brand words from the POI database. Since the data in the POI database uses terms, the chain brand words mined are basically standardized names, which may be related to the user's usage habits. Does not match. For example, the standardized name of a chain pharmacy is "**大药房", and the user may be accustomed to call it "**pharmacy". If the user enters the query term "**pharmacy", it will be concluded that the query term is not a chain brand The wrong result of the word. In addition, since the name of POI data is rarely a category word, it is also difficult to mine category words from the POI database.

[0127] In this embodiment, a recognizer that can recognize chain brand words, category words, and common words is trained based on the query words recorded in the user query log and the clicked POI data corresponding to the query words, and the recognizer is used to check the data recorded in the user query lo...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The embodiment of the invention discloses a method and device for establishing a chain brand word bank and a category word bank. In one case, training of a chain brand word recognizer is performed based on POI data in the same urban POI data bank, the chain brand word recognizer can be utilized to recognize name trunks of all POI data in the POI data bank, recognized trunks are name trunks of chain brand words and stored in the chain brand word bank; in another case, training of the chain brand word recognizer is performed based on query words recorded in user's query logs and clicked POI data corresponding to the query words, the chain brand word recognizer can be utilized to recognize all query words recorded in the user's query logs, recognized words are query words of the chain brand words and category words and are stored in the chain brand word bank and the category word bank respectively. According to the embodiment of the invention, the working efficiency is improved, and timely word bank update can be further achieved through timely excavation.

Description

technical field [0001] The invention relates to the technical field of geographic information, in particular to a method and device for establishing chain brand word thesaurus and category word thesaurus. Background technique [0002] Before using a navigation engine for route navigation, it is usually necessary to search for a destination. In the process of searching for a destination, the user first inputs a query word to the navigation engine, and the navigation engine searches several POI data matching the query word from the POI (Point of Interest) database. When the user selects a POI After receiving the data, the navigation engine performs route planning and navigation based on the POI data selected by the user. [0003] In some cases, the query term entered by the user may be a classifier that reflects a certain category. For example, "restaurant" is a classifier. Based on different dimensions, "restaurant" can be divided into "Chinese restaurant" and "Western resta...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/30
CPCG06F16/9535
Inventor 刘广权
Owner ALIBABA (CHINA) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products