Method and system for screening duplicated entity data

An entity data and database technology, applied in the Internet field, can solve the problem of ineffective identification of duplicate entity data.

Active Publication Date: 2011-04-20
TAOBAO CHINA SOFTWARE
View PDF0 Cites 21 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] However, there is no efficient method for

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and system for screening duplicated entity data
  • Method and system for screening duplicated entity data
  • Method and system for screening duplicated entity data

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0034] Embodiments of the present application provide a method and system for identifying duplicate entity data.

[0035] In order to enable those skilled in the art to better understand the technical solutions in the present application, the technical solutions in the embodiments of the present application will be clearly and completely described below in conjunction with the drawings in the embodiments of the present application. Obviously, the described The embodiments are only some of the embodiments of the present application, but not all of them. Based on the embodiments in this application, all other embodiments obtained by persons of ordinary skill in the art without creative efforts shall fall within the protection scope of the present invention.

[0036] The system structure involved in this application can be as follows figure 1 shown, including the server and database.

[0037] The following introduces an embodiment of the method for identifying duplicate entity ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a method and a system for screening duplicated entity data. The method comprises the following steps that: 1, a server acquires entity data to be screened; 2, the server compares entity names between names of the entity data to be screened and names of a predetermined amount of entity data in a database one by one to acquire a score; 3, the server determines duplication of the entity data to be screened and the entity data in the compared database through a compared score and a preset standard score; and 4, the server adds the unduplicated entity data to be screened into the database. By the method, the duplicated entity data can be efficiently screened out.

Description

technical field [0001] The invention relates to the technical field of the Internet, in particular to a method and system for identifying duplicate entity data. Background technique [0002] Search engine technology can collect information on the Internet according to certain strategies and use specific computer programs, and provide users with retrieval services after organizing and processing the information. Since the birth of search engine technology, websites that provide search services on the Internet have further launched life search in order to better provide search services for information around users. Life search means that there are clear life information in the search engine, and the in-depth processing of life information brings great convenience to users. If you select a category of life, region and other tags, and then use a search engine, it can help search users to easily find classified life information around them. At present, there are many types of i...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/30
Inventor 莫正华
Owner TAOBAO CHINA SOFTWARE
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products