Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Method and device for establishing chain brand word bank and category word bank

A brand and chain technology, applied in the field of geographic information, can solve problems such as low work efficiency and inability to update the lexicon in time, and achieve the effect of improving efficiency, improving efficiency and speed

Active Publication Date: 2015-03-25
ALIBABA (CHINA) CO LTD
View PDF4 Cites 10 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, this method of relying on manual summarization to establish a category word thesaurus and chain brand word thesaurus is not only low in work efficiency, but also cannot update the thesaurus in time once new words appear.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and device for establishing chain brand word bank and category word bank
  • Method and device for establishing chain brand word bank and category word bank
  • Method and device for establishing chain brand word bank and category word bank

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0046] This embodiment is based on the POI data in the POI database to carry out the training of the chain brand word recognizer, this chain brand word recognizer can divide the name backbone that comes from the POI data into chain brand words and non-chain brand words, and select from the classification results It is the backbone of the name of the chain brand word, and it is stored in the chain brand word lexicon. see figure 1 , which is a flow chart of a method for establishing a chain brand thesaurus of the present invention, the method may further comprise the steps:

[0047] Step 101: Aggregating POI data with the same name backbone in the POI database of the same city into a POI data group, wherein the POI data group corresponds to the name backbone;

[0048] "Name backbone" refers to the part of the name of POI data after removing subsidiary information such as branches and addresses. The method of distinguishing name backbone and subsidiary information is related to ...

Embodiment 2

[0099] The difference between this embodiment two and embodiment one is that after obtaining the chain brand word recognizer, the recognition accuracy of the chain brand word recognizer is further checked, if its recognition accuracy does not meet the requirements after inspection, the chain brand word recognizer The recognizer is adjusted, and then checked again, and the check and adjustment are repeated until the recognition accuracy of the chain brand word recognizer meets the requirements. see figure 2 , which is a flow chart of another chain brand lexicon building method of the present invention, the method may further comprise the steps:

[0100] Step 201: Aggregating POI data with the same name backbone in the POI database of the same city point of interest into a POI data group, the POI data group corresponds to the name backbone;

[0101] Step 202: extracting the identification features of the POI data sets from each POI data set;

[0102] The identification featur...

Embodiment 3

[0126] First two embodiments all are to mine chain brand words from POI database, because what the data in the POI database adopts is terminology, therefore, the chain brand words that excavate are all standardized names basically, and this may be different from user's usage habits. does not match. For example, the standardized name of a certain chain pharmacy is "****big pharmacy", but users may be used to calling it "****pharmacy". If the user enters the query term "****pharmacy", it will be concluded that the query term is not a chain brand Wrong result of the word. In addition, since the name of POI data is rarely a class word, it is also difficult to mine class words from the POI database. the

[0127] This embodiment is based on the query word recorded in the user query log and the clicked POI data corresponding to the query word to train a recognizer that can identify chain brand words, category words and common words, and utilize the recognizer to record in the user ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The embodiment of the invention discloses a method and device for establishing a chain brand word bank and a category word bank. In one case, training of a chain brand word recognizer is performed based on POI data in the same urban POI data bank, the chain brand word recognizer can be utilized to recognize name trunks of all POI data in the POI data bank, recognized trunks are name trunks of chain brand words and stored in the chain brand word bank; in another case, training of the chain brand word recognizer is performed based on query words recorded in user's query logs and clicked POI data corresponding to the query words, the chain brand word recognizer can be utilized to recognize all query words recorded in the user's query logs, recognized words are query words of the chain brand words and category words and are stored in the chain brand word bank and the category word bank respectively. According to the embodiment of the invention, the working efficiency is improved, and timely word bank update can be further achieved through timely excavation.

Description

technical field [0001] The invention relates to the technical field of geographic information, in particular to a method and device for establishing chain brand word thesaurus and category word thesaurus. Background technique [0002] Before using a navigation engine for route navigation, it is usually necessary to search for a destination. In the process of searching for a destination, the user first inputs a query word to the navigation engine, and the navigation engine searches several POI data matching the query word from the POI (Point of Interest) database. When the user selects a POI After receiving the data, the navigation engine performs route planning and navigation based on the POI data selected by the user. [0003] In some cases, the query term entered by the user may be a classifier that reflects a certain category. For example, "restaurant" is a classifier. Based on different dimensions, "restaurant" can be divided into "Chinese restaurant" and "Western resta...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F17/30
CPCG06F16/9535
Inventor 刘广权
Owner ALIBABA (CHINA) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products