A Method for Discriminating Identity of Real Estate Data from Different Information Sources

A discrimination method and identity technology, applied in network data retrieval, data processing applications, network data indexing and other directions, can solve the problems of deduplication of house data and reducing the accuracy of judgment.

Inactive Publication Date: 2021-02-02
NAT COMP NETWORK & INFORMATION SECURITY MANAGEMENT CENT
View PDF6 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0009] However, at present, there is no method to deduplicate housing data from multi-source real estate websites
The housing data published by the real estate transaction website is semi-structured data, which contains rich housing characteristics, such as the residential area, area, floor, etc. If the web page text is used to judge, the accuracy of the judgment will be reduced

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A Method for Discriminating Identity of Real Estate Data from Different Information Sources
  • A Method for Discriminating Identity of Real Estate Data from Different Information Sources
  • A Method for Discriminating Identity of Real Estate Data from Different Information Sources

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0145] This embodiment describes the implementation of a method for identifying the identity of real estate data from different information sources according to the present invention.

[0146] The implementation diagram is as follows figure 1 As shown in the system architecture, figure 2 It is a system processing flow of a method for discriminating identity of real estate data from different information sources in the present invention. The data collection system and the data analysis system of the present invention belong to the intermediate links of real estate data processing. Among them, the data collection system collects real estate transaction data from various real estate transaction websites, including district data, urban area data, housing data, housing transaction data, etc., and stores them in the real estate basic database.

[0147] Using the method proposed by the present invention, the house data in the real estate basic database is deduplicated, the process...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a method for discriminating house property data identity of different information sources, and belongs to the technical field of internet data analysis and mining. The identity discrimination method is based on the house data published by Lianjia, 5j5j, Zhongyuan and Maitian websites and the correlation thereof. The characteristics of the house data are analyzed, the repeated house data are removed through the three steps of regional duplicate removal, community duplicate removal and house duplicate removal, the house data are described according to the characteristicsof actual house objects, and although the described angles and modes are different, the data have very high correlation. According to the method, the house data from different websites can be duplicated, the identity of the house data from different information sources can be accurately and efficiently judged, the duplicated areas and cells can be effectively removed, the effective fusion of thehouse data oriented to multi-source heterogeneity can be achieved, and the clean and orderly data are provided for the real estate market analysis.

Description

technical field [0001] The invention relates to a method for discriminating identity of real estate data from different information sources, and belongs to the technical field of Internet data analysis and mining. Background technique [0002] Real estate is an important carrier of the national economy and an extremely important pillar industry in our country. The state of the real estate market and price trends are not only related to the overall development of the national economy, but also affect and affect people's living standards. In recent years, the real estate market has become the focus and hot spot of social attention. [0003] How to strengthen the monitoring of the real estate market and analyze the trend of real estate prices has become an important issue. With the gradual success of my country's real estate market, the core position of the second-hand housing market has become increasingly prominent, and its ability to dominate the entire market has gradually...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06F16/951G06F16/9535G06F16/2458G06F16/215G06Q50/16
CPCG06Q50/16
Inventor 刘春阳张旭王鹏姜越张华平张吴波张宝华
Owner NAT COMP NETWORK & INFORMATION SECURITY MANAGEMENT CENT
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products