Homonymous cell distinguishing method and system based on text similarity

A text similarity and cell technology, applied in the field of information processing, can solve the problems of high misjudgment frequency and reduce the discrimination accuracy, and achieve the effect of improving the discrimination accuracy.

Pending Publication Date: 2020-06-16
青梧桐有限责任公司
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] However, in the identification method of the same-named community above, when there is an alias with a textual similarity of less than 90% in a certain community, or when the textual similarity of the names of two different communities exceeds 90%, there will be a high frequency of misjudgment. Greatly reduced the accuracy of discrimination

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Homonymous cell distinguishing method and system based on text similarity
  • Homonymous cell distinguishing method and system based on text similarity
  • Homonymous cell distinguishing method and system based on text similarity

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0048] Various exemplary embodiments of the present invention will now be described in detail with reference to the accompanying drawings. It should be noted that the relative arrangements of components and steps, numerical expressions and numerical values ​​set forth in these embodiments do not limit the scope of the present invention unless specifically stated otherwise.

[0049] The following description of at least one exemplary embodiment is merely illustrative in nature and in no way taken as limiting the invention, its application or uses.

[0050] Techniques, methods and devices known to those of ordinary skill in the relevant art may not be discussed in detail, but where appropriate, such techniques, methods and devices should be considered part of the description.

[0051] In all examples shown and discussed herein, any specific values ​​should be construed as exemplary only, and not as limitations. Therefore, other instances of the exemplary embodiment may have diffe...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a homonymous cell distinguishing method and system based on text similarity. The method comprises the following steps: acquiring a first name of a first cell to be distinguished and a second name of a second cell to be distinguished; acquiring first attribute information of the first cell to be distinguished and second attribute information of the second cell to be distinguished; when the first basic information is the same as the second basic information, determining the distance between the first to-be-distinguished cell and the second to-be-distinguished cell according to the first longitude and latitude information and the second longitude and latitude information; when the distance is smaller than or equal to a preset threshold value, calculating text similarity between the first name and the second name; and determining a discrimination result according to the text similarity and the distance between the two cells. Due to the fact that the distance betweenthe to-be-distinguished cells and the text similarity between the cell names are comprehensively considered, misjudgment caused by alias of the to-be-distinguished cells or the same name of the to-be-distinguished cells can be avoided, and distinguishing accuracy is effectively improved.

Description

technical field [0001] The present invention relates to the technical field of information processing, and more specifically, to a method and system for identifying a cell with the same name based on text similarity. Background technique [0002] With the rapid popularization and development of the Internet, a large number of housing rental and sales platforms have emerged. The real estate agent publishes the listing information on various rental and sales platforms, so that users can find the required listing information on the listing website by setting filter conditions. [0003] However, in some application scenarios, if the alias of community A is community B, different real estate agents may use different community names when publishing the listing information, which causes users to search for listing information. It is impossible to tell whether the two are the same housing source; in addition, in another application scenario, if there are two communities with the sa...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F16/33G06F16/9537G06F40/289G06K9/62
CPCG06F16/9537G06F16/3331G06F18/214
Inventor 朱晨晓李昭陈浩高靖崔岩卢述奇陈呈张宵
Owner 青梧桐有限责任公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products