Supercharge Your Innovation With Domain-Expert AI Agents!

Automatic hotel matching method based on text information extraction

A text information and hotel technology, applied in the information field, can solve the problems of inability to match, difficult to control the accuracy of fuzzy matching, and different ways of expressing address information, so as to achieve the effect of improving recall rate, avoiding interference, and improving robustness.

Active Publication Date: 2017-06-30
北京众荟信息技术股份有限公司
View PDF7 Cites 16 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] ●Fuzzy matching accuracy is difficult to control;
[0004] ●The hotel names are expressed in different ways, resulting in incompatibility;
[0005] ●Address information is expressed in different ways, resulting in inability to match;
[0006] ●The granularity of hotel city expression is different, resulting in incompatibility;
[0007] ●The hotel phone numbers are expressed in different ways, resulting in incompatibility

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Automatic hotel matching method based on text information extraction
  • Automatic hotel matching method based on text information extraction
  • Automatic hotel matching method based on text information extraction

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0042] The present invention is further illustrated below by means of examples, but the present invention is not limited to the scope of the examples.

[0043] Because the hotel name and hotel address are processed in basically the same way, they are put together for explanation. Steps 1-3 in the following steps are the general processing methods for hotel name and hotel address. During specific implementation, the hotel name can be processed in steps 1-3 first, then the hotel address can be processed in 1-3, and finally step 4 is performed.

[0044] Step 1 Text normalization

[0045] Text normalization has two meanings. One is to convert different texts with the same meaning into a unified format, and the other is to delete meaningless content in the text that interferes with subsequent processing. Normalized processing reduces the burden on subsequent analysis. The specific standardized contents include:

[0046] 1. Unify Chinese and English punctuation.

[0047] 2. Unif...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses an automatic hotel matching method based on text information extraction. The method includes the steps that 1, element extraction is conducted on hotel information of a target hotel, and element extraction is conducted on hotel information of a hotel to be matched; 2, according to extracted elements, a decision-tree algorithm is adopted to calculate the matching degree between the target hotel and the hotel to be matched; the method for conducting the element extraction on a hotel name and a hotel address in the hotel information includes the steps that 1, normalization processing is conducted on a Chinese character sequence, wherein the Chinese character sequence is the hotel name or the hotel address; 2, word segmentation is conducted on normalized text, and a word sequence is obtained; 3, the element extraction is conducted on the wore sequence, and element categories are marked. According to the method, the robustness of the matching is improved, and the disturbance of useless information on the matching process is avoided.

Description

technical field [0001] The invention belongs to the field of information technology, and relates to technical fields such as online travel websites, price comparison platforms, hotel information aggregation, and automatic acquisition of crawler links, and in particular to an automatic hotel matching method based on text information extraction. Background technique [0002] With the rapid development of online travel websites, online hotel booking platforms are gathering, and multiple platforms have launched price comparison functions. In order to compare prices, it is first necessary to determine the matching relationship of hotels on different platforms. In order to reduce the cost of manual matching, most of them use automatic matching methods. However, the traditional matching method using strings has the following disadvantages: [0003] ●Fuzzy matching accuracy is difficult to control; [0004] ●The hotel names are expressed in different ways, resulting in incompatibil...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
CPCG06F16/3332G06F16/3344G06F16/9537
Inventor 张猛杨洪伟林小俊陈文哲
Owner 北京众荟信息技术股份有限公司
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More