Business address identification method and device, electronic equipment and storage medium

By integrating basic address, map address, and communication base station data, and combining base station positioning and address confidence models, the problem of discrepancies between a company's registered address and its actual business address has been solved, achieving highly reliable and accurate address identification.

CN116233754BActive Publication Date: 2026-06-23CHINA TELECOM CORP LTD

Patent Information

Authority / Receiving Office
CN · China
Patent Type
Patents(China)
Current Assignee / Owner
CHINA TELECOM CORP LTD
Filing Date
2022-12-16
Publication Date
2026-06-23

Smart Images

  • Figure CN116233754B_ABST
    Figure CN116233754B_ABST
Patent Text Reader

Abstract

Embodiments of the present application provide a kind of operating address identification method, device, electronic equipment and storage medium, first, the basic address of target place, map address and communication base station data are acquired, wherein, communication base station data includes the correspondence communication data corresponding to all communication numbers associated with target place, then according to communication base station data, target place is positioned to base station, and the base station address corresponding to target place is obtained, then according to basic address, map address and base station address, the address of target place is identified, and the actual operating address corresponding to target place is obtained, so that by integrating the multi-source address information including communication big data, and by base station address, the secondary verification for address is realized, so that the actual operating address of target place can be effectively identified in time, and the address output finally has higher credibility and precision.
Need to check novelty before this filing date? Find Prior Art

Description

Technical Field

[0001] This invention relates to the field of big data analytics, and in particular to a method for identifying business addresses, a device for identifying business addresses, an electronic device, and a computer-readable storage medium. Background Technology

[0002] As we all know, when a company operates, it must conduct its business activities at its registered address. However, in reality, many companies, for various reasons, often have a discrepancy between their registered address and their actual office address. In other words, a large number of companies have a discrepancy between their registered address and their actual business address. For example, a company may initially register in a development zone or industrial park, but its actual business is not suitable for operation in the park, so it chooses to conduct its business activities in other places, resulting in a discrepancy between the registered address and the actual office address. Another example is that many small and micro enterprises have a discrepancy between their registered address and their actual business address due to factors such as business expansion, relocation, or seeking a more convenient location for office work.

[0003] Currently, business address location and identification are often based on the registered address. If the registered address is incorrect, it can easily lead to problems such as inaccurate business address location and inability to contact the corresponding business in a timely manner. In some cases, the registered address and the actual business address are even in different regions. At the same time, the business itself will also feel troubled by the address discrepancy, which can easily lead to problems such as the inability to effectively increase the business's exposure, customers being unable to accurately locate the business, and reduced communication between the business and customers. Summary of the Invention

[0004] The present invention provides a method, apparatus, electronic device, and computer-readable storage medium for identifying business addresses, in order to solve or partially solve the problem of low accuracy in address identification caused by the inability to identify the actual business address corresponding to the target location in a timely and effective manner.

[0005] This invention discloses a method for identifying a business address, the method comprising:

[0006] Obtain the basic address, map address, and communication base station data of the target location, wherein the communication base station data includes all communication data corresponding to all communication numbers associated with the target location;

[0007] Based on the communication base station data, the target location is located using a base station to obtain the base station address corresponding to the target location;

[0008] The target location is identified by the base address, the map address, and the base station address to obtain the actual operating address corresponding to the target location.

[0009] Optionally, obtaining the basic address, map address, and communication base station data of the target location includes:

[0010] Obtain an address information database, and determine the basic address corresponding to the target location from the address information database, wherein the address information database is used to store the basic addresses of each location;

[0011] Obtain the corresponding geographic information based on the name of the target location, and perform address conversion processing on the geographic information to obtain the corresponding map address;

[0012] Obtain communication base station data corresponding to the target location.

[0013] Optionally, the step of obtaining the corresponding geographical information based on the name of the target location and performing address conversion processing on the geographical information to obtain the corresponding map address includes:

[0014] The geographic information of the target location is obtained from a preset map call interface based on the name of the target location;

[0015] The geographic information is subjected to address inverse encoding to obtain the map address of the target location.

[0016] Optionally, the step of performing address inverse encoding on the geographic information to obtain the map address of the target location includes:

[0017] The geographic information is subjected to inverse address encoding to obtain a map address set of the target location;

[0018] The map address set is filtered according to the operating attributes of the target location to obtain filtered map addresses, and the consistency of the filtered map addresses is judged to determine the map address corresponding to the target location.

[0019] Optionally, before obtaining the base address, map address, and communication base station data of the target location, the method further includes:

[0020] Obtain all communication numbers associated with the target location, and collect the incoming and outgoing communication data and communication address location of each communication number;

[0021] A communication base station is established for the target location, and big data real-time analysis is performed based on the collected communication data and communication address location to obtain the communication base station data corresponding to the communication base station.

[0022] Optionally, the communication base station data includes the geographical information of the communication base station, and the step of locating the target location based on the communication base station data to obtain the base station address corresponding to the target location includes:

[0023] The geographical information of the base station is processed by address conversion to obtain the base station address corresponding to the target location.

[0024] Optionally, the base address includes at least the installation address, relocation address, and order contact address. The step of identifying the target location based on the base address, the map address, and the base station address to obtain the actual operating address corresponding to the target location includes:

[0025] The installation address, relocation address, order contact address, map address, and base station address are standardized and input into an address confidence model for address identification, and the actual operating address of the target location is output.

[0026] Optionally, the step of standardizing the installation address, relocation address, order contact address, map address, and base station address, and inputting them into an address confidence model for address identification, and outputting the actual operating address of the target location, includes:

[0027] The installation address, relocation address, order contact address, map address, and base station address are first corrected, and then input into the entity recognition model for address standardization to obtain standard installation address, standard relocation address, standard order contact address, standard map address, and standard base station address, respectively.

[0028] The standard installation address, the standard relocation address, the standard order contact address, the standard map address, and the standard base station address are input into the address confidence model for address identification, and the actual operating address of the target location is output.

[0029] Optionally, the step of inputting the standard installation address, the standard relocation address, the standard order contact address, the standard map address, and the standard base station address into the address confidence model for address identification, and outputting the actual operating address of the target location, includes:

[0030] The standard installation address, the standard relocation address, the standard order contact address, the standard map address, and the standard base station address are input into the address confidence model for address recognition, and the installation address confidence, relocation address confidence, order contact address confidence, map address confidence, and base station address confidence are obtained respectively.

[0031] The confidence scores of the installation address, relocation address, order contact address, map address, and base station address are weighted to obtain at least one optimized operating address.

[0032] Select the one with the highest confidence-weighted score from at least one of the optimized operating addresses as the actual operating address of the target location, and output the actual operating address.

[0033] This invention also discloses a business address identification device, the device comprising:

[0034] The target location address acquisition module is used to acquire the basic address, map address and communication base station data of the target location. The communication base station data includes all communication data of all communication numbers associated with the target location.

[0035] A base station address positioning module is used to locate the target location based on the communication base station data and obtain the base station address corresponding to the target location.

[0036] The actual business address determination module is used to identify the target location based on the base address, the map address, and the base station address to obtain the actual business address corresponding to the target location.

[0037] Optionally, the target location address acquisition module includes:

[0038] The basic address acquisition module is used to acquire an address information database and determine the basic address corresponding to the target location from the address information database. The address information database is used to store the basic addresses of each location.

[0039] The map address generation module is used to obtain the corresponding geographic information based on the name of the target location, and to perform address conversion processing on the geographic information to obtain the corresponding map address;

[0040] The communication base station data acquisition module is used to acquire communication base station data corresponding to the target location.

[0041] Optionally, the map address generation module includes:

[0042] The geographic information acquisition module is used to acquire the geographic information of the target location from a preset map call interface based on the name of the target location;

[0043] The address inverse encoding processing module is used to perform address inverse encoding processing on the geographic information to obtain the map address of the target location.

[0044] Optionally, the address inverse encoding processing module includes:

[0045] The map address set acquisition module is used to perform address reverse encoding processing on the geographic information to obtain the map address set of the target location;

[0046] The consistency discrimination module is used to filter the map address set according to the operating attributes of the target location, obtain filtered map addresses, and perform consistency discrimination on the filtered map addresses to determine the map address corresponding to the target location.

[0047] Optionally, the device further includes:

[0048] The communication data acquisition module is used to acquire all communication numbers associated with the target location, and collect the communication data and communication address location of each communication number.

[0049] The data analysis module is used to establish a communication base station for the target location, and to perform real-time big data analysis based on the collected communication data and communication address location to obtain the communication base station data corresponding to the communication base station.

[0050] Optionally, the communication base station data includes the geographical information of the communication base station, and the base station address positioning module is specifically used for:

[0051] The geographical information of the base station is processed by address conversion to obtain the base station address corresponding to the target location.

[0052] Optionally, the basic address includes at least the installation address, relocation address, and order contact address, and the actual business address determination module is specifically used for:

[0053] The installation address, relocation address, order contact address, map address, and base station address are standardized and input into an address confidence model for address identification, and the actual operating address of the target location is output.

[0054] Optionally, the actual business address determination module includes:

[0055] The address standardization processing module is used to first perform address correction processing on the installation address, the relocation address, the order contact address, the map address, and the base station address, and then input them into the entity recognition model for address standardization processing to obtain standard installation address, standard relocation address, standard order contact address, standard map address, and standard base station address, respectively.

[0056] The address recognition module is used to input the standard installation address, the standard relocation address, the standard order contact address, the standard map address, and the standard base station address into the address confidence model for address recognition, and output the actual operating address of the target location.

[0057] Optionally, the address recognition module includes:

[0058] The address confidence generation module is used to input the standard installation address, the standard relocation address, the standard order contact address, the standard map address, and the standard base station address into the address confidence model for address recognition, and obtain the installation address confidence, relocation address confidence, order contact address confidence, map address confidence, and base station address confidence respectively;

[0059] The optimized business address generation module is used to perform address confidence weighting processing using the installation address confidence, the relocation address confidence, the order contact address confidence, the map address confidence, and the base station address confidence to obtain at least one optimized business address;

[0060] The optimized business address selection module is used to select the one with the highest confidence weighted score from at least one of the optimized business addresses as the actual business address of the target location, and output the actual business address.

[0061] This invention also discloses an electronic device, including a processor, a communication interface, a memory, and a communication bus, wherein the processor, the communication interface, and the memory communicate with each other through the communication bus;

[0062] The memory is used to store computer programs;

[0063] When the processor executes a program stored in the memory, it implements the method described in the embodiments of the present invention.

[0064] This invention also discloses a computer-readable storage medium storing instructions that, when executed by one or more processors, cause the processors to perform the methods described in this invention.

[0065] The embodiments of the present invention have the following advantages:

[0066] In this embodiment of the invention, a method for identifying a business address is provided. First, the basic address, map address, and communication base station data of the target location are obtained. The communication base station data includes all communication data of all communication numbers associated with the target location. Then, the target location is located using the communication base station data to obtain the base station address corresponding to the target location. Next, the target location is identified using the basic address, map address, and base station address to obtain the actual business address corresponding to the target location. By integrating multi-source address information, including big data from communications, and performing secondary verification of the address using the base station address, the actual business address of the target location can be identified in a timely and effective manner, resulting in a more reliable and accurate final output address. Attached Figure Description

[0067] Figure 1 This is a flowchart illustrating the steps of a business address identification method provided in an embodiment of the present invention;

[0068] Figure 2 This is an example diagram of a method for identifying an actual business address provided in an embodiment of the present invention;

[0069] Figure 3 This is a schematic diagram of a data modeling and construction process provided in an embodiment of the present invention;

[0070] Figure 4 This is a structural block diagram of a business address identification device provided in an embodiment of the present invention;

[0071] Figure 5 This is a block diagram of an electronic device provided in an embodiment of the present invention. Detailed Implementation

[0072] To make the above-mentioned objects, features and advantages of the present invention more apparent and understandable, the present invention will be further described in detail below with reference to the accompanying drawings and specific embodiments.

[0073] As an example, businesses are required to conduct their operations at their registered address. However, in reality, many businesses, for various reasons, often have a discrepancy between their registered address and actual business address. For instance, a company might initially register in a development zone or industrial park, but its actual business may not be suitable for operation there, leading it to choose another location. Similarly, many small and micro-enterprises may have a discrepancy between their registered address and actual business address due to business expansion, relocation, or needing to work closer to their registered office. These examples exemplify how a mismatch between registered address and actual business address can occur. The following situations may occur: (1) When registering initially, the company obtains address materials through irregular channels and registers; (2) After the company is registered, it has to work in a nearby location due to business expansion; (3) The company has to change its office location due to demolition of the address where it is located; (4) The company has to change to a more affordable office location due to tight funds; (5) The company has to change to a larger office location because the original business address cannot meet its office needs due to business expansion; (6) The company changes its business address to make it more convenient for its employees to commute to and from work, etc. There are many factors that cause the registered address to be inconsistent with the actual business address. The above examples only list a few of the more common factors.

[0074] Currently, business address location and identification are often based on the registered address. If the registered address is incorrect, it can easily lead to problems such as inaccurate business address location and inability to contact the corresponding business in a timely manner. In some cases, the registered address and the actual business address are even in different regions. At the same time, the business itself will also feel troubled by the address discrepancy, which can easily lead to problems such as the inability to effectively increase the business's exposure, customers being unable to accurately locate the business, and reduced communication between the business and customers.

[0075] One of the core inventive points of this invention is to provide a method for identifying a business address. First, the basic address, map address, and communication base station data of the target location are obtained. The communication base station data includes all communication data associated with the target location. Then, base station positioning is performed on the target location based on the communication base station data to obtain the base station address corresponding to the target location. Next, address identification is performed on the target location based on the basic address, map address, and base station address to obtain the actual business address corresponding to the target location. By integrating multi-source address information, including big data from communications, and by using the base station address for secondary verification of the address, the actual business address of the target location can be identified promptly and effectively, resulting in a more reliable and accurate output address.

[0076] Reference Figure 1 The diagram illustrates a flowchart of a method for identifying a business address provided in an embodiment of the present invention, which may specifically include the following steps:

[0077] Step 101: Obtain the basic address, map address, and communication base station data of the target location. The communication base station data includes all communication data of all communication numbers associated with the target location.

[0078] In the specific implementation, the basic address, map address and communication base station data corresponding to the target location can be obtained first. The communication base station data can include the communication data of all communication numbers associated with the target location. By obtaining the address information of the target location, it is convenient to output the actual operating address of the target location through subsequent analysis.

[0079] Firstly, regarding the basic address, when identifying the address of a target location, taking the identification of a business address as an example, operator data is the core data for address identification technology. This is fundamentally because operators possess massive amounts of accurate relevant data. For a business, broadband internet, fiber optic cables, and landline phones are necessities for its business operations and office work. The operators, as the providers of these services, possess this core data. In addition to these office necessities, the actual business address can be further verified through the mobile phone base stations of company employees. Therefore, before address identification, it is necessary to integrate the massive address information from operators. For example, operator address information can mainly include the following parts:

[0080] Firstly, when businesses are conducting office operations, they will immediately apply for essential office supplies such as broadband internet, fiber optic cables, and landline phones. Since the address information requires on-site installation and testing by the operator's engineers, this address information has already been manually verified, which can ensure the accuracy of the address to a certain extent. This address can be the corresponding installation address for the business.

[0081] Secondly, for essential office equipment such as broadband internet, in addition to the initial installation, as the company's actual business address changes, a relocation application needs to be submitted to the operator. Furthermore, for the operator, a relocation application is required for each related service to ensure the normal operation of all network and telephone services. Similarly, after submitting a relocation application, the operator will conduct a secondary verification of each application. Specifically, an engineer will visit the company's designated relocation address to perform the relocation according to the service product information.

[0082] Furthermore, telecom operators are also comprehensive information service providers. Therefore, when businesses use the operator's basic network and fixed-line telephone services, they also apply for additional value-added services, including but not limited to enterprise networking, enterprise ringback tones, enterprise monitoring, and enterprise storage. Whether it's basic network, fixed-line telephone, mobile phone services, or various value-added services, corresponding orders will be generated in the operator's CRM (Customer Relationship Management) system. These orders include the company's contact person and address, enabling the operator to communicate with the company's designated personnel promptly and accurately. During the company's actual operations, this address information can continuously link with the operator. Therefore, as long as the company operates normally, this address information will continuously enter the operator's database. This address can serve as the company's corresponding order contact address, also known as the order address.

[0083] The addresses provided by the above-mentioned operators will accompany the entire lifecycle of the enterprise, including basic network and fixed-line telephone services during the enterprise's network access period, value-added services during the growth period, and relocation services. When the enterprise ceases operation, it will also apply to the operator for disconnection as soon as possible. Therefore, it is crucial to integrate the above-mentioned address information. Thus, an enterprise can have multiple address information. By analyzing the timeline and business order information, installation address, relocation address, and order contact address can be output respectively. These types of addresses can serve as the basic addresses of the enterprise.

[0084] As an optional embodiment, obtaining the base address, map address, and communication base station data of the target location can be achieved by: acquiring an address information database; determining the base address corresponding to the target location from the address information database (which stores the base addresses of various locations); then acquiring the corresponding geographic information based on the name of the target location; performing address conversion processing on the geographic information to obtain the corresponding map address; and finally acquiring the communication base station data corresponding to the target location. Specifically, for the integration of address information from various locations, an address information database can be established. This database can retrieve the base addresses of various locations, such as enterprises, merchants, and institutions, from the massive amounts of relevant data from operators, and store these base addresses in the address information database for easy retrieval later.

[0085] Furthermore, obtaining the corresponding geographic information based on the name of the target location and performing address conversion processing on the geographic information to obtain the corresponding map address can be achieved by: obtaining the geographic information of the target location from a preset map call interface based on the name of the target location, and then performing address reverse encoding processing on the geographic information to obtain the map address of the target location.

[0086] In one optional embodiment, the geographic information is subjected to address inverse encoding to obtain the map address of the target location. Specifically, the geographic information is subjected to address inverse encoding to obtain a set of map addresses of the target location. Then, the set of map addresses is filtered according to the business attributes of the target location to obtain filtered map addresses. The filtered map addresses are then subjected to consistency judgment to determine the map address corresponding to the target location. Thus, by filtering the addresses and performing consistency judgment, the obtained map addresses are made more accurate.

[0087] For example, the latitude and longitude of a company can be obtained through the API (Application Program Interface) of a mainstream map application based on its name. Then, using this latitude and longitude information, reverse geocoding can be performed to obtain relatively standard actual address information, i.e., the map address corresponding to the company. For instance, based on specified coordinates, the returned geographic information mainly consists of administrative code information, such as the province / city / county corresponding to the specified point, a description of the point's location, the nearest POI (Point of Interest) information, the nearest or conditionally suitable road information, and the latitude and longitude of the nearest point on the road to the specified point. It should be noted that "relatively standard actual address information" here refers to the latitude and longitude information obtained from mainstream map applications, which is relatively accurate and reliable, thus ensuring that the map address obtained through reverse geocoding is also relatively accurate.

[0088] The main purpose of obtaining map addresses using the above method is to directly access the interface of remote services via HTTP / HTTPS (Hypertext Transfer Protocol / Hypertext Transfer Protocol Secure) and obtain the ability to convert between structured addresses and latitude and longitude. Geocoding refers to the process of converting the detailed location description of an address or place name into latitude and longitude coordinates, while address inverse coding is the opposite, referring to the process of converting geographic coordinates into address information.

[0089] Specifically, analysis shows that current mainstream maps basically achieve full coverage of geographic information content. Therefore, geographic information data can be obtained by fusing several current mainstream maps. For example, this acquisition process can mainly include the following steps:

[0090] First, obtain the Key value from the map application development platform and determine the calling interface. After obtaining the Key value, the corresponding calling interface can be determined on the development platform according to the requirements, thereby determining the data acquisition content and scope.

[0091] The key value is located at the end of the registry structure chain. Similar to a file in the file system, it contains the actual configuration information and data used by the computer and applications during execution. Key values ​​can contain several data types to adapt to the usage needs of different environments.

[0092] Secondly, the URL (Universal Resource Locator) parameters need to be defined and encoded. As is well known, searching by name may yield at least one address with the same name, even if they correspond to different business locations. For example, searching by name might yield two address entries, one for a company and the other for a merchant. In this case, further restrictions can be set for the search. These restrictions can be entered when defining the URL parameters. For instance, after obtaining the address data of the target location, it needs to be categorized according to the industry it corresponds to, such as whether the address belongs to a company, merchant, organization, or other category. Furthermore, after further categorizing the address data according to the search restrictions, it is necessary to verify the authenticity of the address data. Therefore, data authenticity verification conditions can be added. For example, PANDAS (Python Data Analysis Library), numPy (Numerical Python, an open-source scientific computing library for Python), and MKL (Math Kernel) can be used. Data analysis modules such as the Library (mathematical kernel library) perform consistency checks using basic data analysis methods and regular expressions. Since the consistency check method used in this invention is relatively conventional, it will not be elaborated here.

[0093] For example, some implementation code of this embodiment of the invention is as follows:

[0094]

[0095]

[0096]

[0097] Furthermore, after obtaining the address data through the above process, the address data can be summarized and cleaned, and can be written to the data storage document through the IO (Input Output) port via the specified file path to realize data download. At the same time, it can be imported into the address information database to realize localized management of data assets.

[0098] Step 102: Based on the communication base station data, perform base station positioning on the target location to obtain the base station address corresponding to the target location;

[0099] Based on communication base station data, the target location can be located to obtain the corresponding base station address. Before obtaining the communication base station data of the target location, all communication numbers associated with the target location can be obtained, and the communication data and communication address location of each communication number can be collected. Then, a communication base station for the target location can be established, and big data real-time analysis can be performed based on the collected communication data and communication address location to obtain the communication base station data corresponding to the communication base station.

[0100] In practical implementation, taking enterprise address identification as an example, we can first establish the connection between the enterprise and its employees. Currently, operators have completed real-name registration for all customers, so the communication numbers registered by enterprises all have corresponding users, who can include the enterprise's responsible persons, senior executives, employees, etc. We can then link employees to the enterprise. Next, we can collect data from each communication number, mainly collecting incoming and outgoing communication data, to construct the enterprise's employee work network using this data.

[0101] Specifically, while basic employee information can be obtained from telecom operators, in reality, not all employees are registered under the corresponding company name. Therefore, to accurately locate the company's business address through work circles, it is necessary to enrich employee information and ensure the accuracy of the business address based on work circle location. Here, "work circle" refers to further linking the company with its employees to find more employees corresponding to that company. This can be achieved by using an employee's contact number as the central point, and through big data analysis of the number's communication frequency and call records based on call details and SMS messages. Then, based on the communication during working hours, the employee's work circle can be constructed. For example, working hours can be set to 9:00 AM to 6:00 PM on natural workdays within a calendar month. This allows for the acquisition and analysis of employee communication during this period, and through data modeling and analysis over nearly a month, a set of employees representing multiple employees within the company can be obtained, thus completing the construction of the employee's work circle.

[0102] Next, the company's work circle address can be located based on base station information. Specifically, base station location is the most accurate and timely method for employees to communicate using their phone numbers. This allows for the establishment of communication base stations targeting specific base stations. Real-time big data analysis can be performed based on collected communication data and address location to obtain the corresponding base station data. This can be achieved by analyzing the work circles of employees within the company (not analyzing individual employees), and obtaining base station information according to these work circles. Base station information is real-time data, and base station interactions can include multiple dimensions such as voice, SMS, and data traffic. Data can be collected in segments by day and hour, including basic information such as base station name, address, and latitude / longitude. For example, the data collection period can be consistent with the construction of the work circle, i.e., base station information from 9:00 AM to 6:00 PM on weekdays can be collected. The collected base station information is then analyzed according to the communication frequency within the employee's work circle, ultimately outputting one or more base stations that are most frequently associated with the company's employees' work circles during working hours.

[0103] Furthermore, the communication base station data can include the geographical information of the communication base stations. Therefore, to locate the target location using the communication base station data and obtain the corresponding base station address, the process can involve address conversion processing of the geographical information to obtain the base station address corresponding to the target location. Specifically, address conversion processing can be performed based on the latitude and longitude information of the communication base stations to output the base station address based on the employee's work area.

[0104] It should be noted that, unlike the previously mentioned installation address, relocation address, map address, registration address, and annual report address, the base station address corresponding to the employee's work circle changes over time, with the employee's communication activities, or with changes in the company's business address. As long as the company can communicate, its actual business address can be accurately located. Furthermore, as the company's actual business address changes, the base station with the most frequent interactions in the corresponding work circle will also change. Therefore, the base station address collected through communication base stations is accurate, effective, and timely. By combining the base station address, a secondary address verification is actually performed on the aforementioned operator address, map POI address, and other addresses, making the output of the actual business address more accurate.

[0105] Step 103: Based on the base address, the map address, and the base station address, the target location is identified to obtain the actual operating address corresponding to the target location.

[0106] In practical implementation, the target location can be identified based on the base address, map address, and base station address to obtain the actual operating address corresponding to the target location.

[0107] For example, refer to Figure 2 This illustration shows an example diagram of a method for identifying actual business addresses provided in an embodiment of the present invention. Taking enterprise address identification as an example, the addresses corresponding to enterprises can be mainly divided into three categories: operator addresses, map POI addresses, and other addresses. Among them, operator addresses, also known as basic addresses, refer to address information obtained from operators, which may include installation addresses, relocation addresses, and order addresses (i.e., order contact addresses). Map POI addresses refer to address information obtained by calling the map API interface, which may include enterprise merchant addresses (i.e., map addresses). Other addresses may include the enterprise's registered address, annual report address, etc. The embodiments provided by the present invention can obtain the actual office location of enterprise customers based on the aforementioned types of addresses. The following will explain how to obtain the actual office location of enterprise customers based on operator addresses, map POI addresses, and other addresses.

[0108] It should be noted that the address examples listed above are merely illustrative. In practical applications, the address corresponding to a company may include other addresses that can be obtained through other channels. Furthermore, in addition to using the addresses listed above for company address identification, this invention also uses the base station addresses corresponding to the communication base stations of company employees for processing. For example, the address identification processing flow is described below. Figure 3 This illustration shows a data modeling and construction process provided in an embodiment of the present invention. After obtaining the operator address (operator installation address, relocation address, order address, etc.), map POI address (enterprise merchant address), and other addresses (registration address, annual report address), the employee base station (i.e., the communication base station built for enterprise employees) can be parsed to obtain the corresponding employee work location (i.e., the base station address corresponding to the communication base station). Then, each address can be input into a module for address standardization processing to standardize each address. Next, the corresponding standardized addresses can be processed for features. Then, the processed addresses can be input into the LGBM (Light Gradient Boosting Machine, a distributed gradient boosting framework based on decision tree algorithm) model to score each address and obtain the confidence level corresponding to each address. Then, a weighted score of address confidence level is calculated based on each confidence level to achieve iterative weighted processing of the address optimization. Finally, the address with the highest weighted score of address confidence level can be used as the actual office address. Thus, through a series of processing of the source address information, the final actual office address corresponding to the enterprise can be output.

[0109] The above content is only a brief description of some steps in the address recognition process provided in the embodiments of the present invention. The address recognition process in the above content will be further explained below.

[0110] To ensure the high availability of the final output of actual business addresses, all of the above addresses need to be standardized. In practice, during the address entry stage, there may be issues such as non-standard address entry systems, typos or irrelevant information, inaccurate addresses, missing data, changes to the community / street / house number of the address's location, and unverified addresses. Therefore, most of the installation addresses and relocation addresses listed above are not standardized addresses, or are even incorrect addresses. Before standardizing these addresses, it is necessary to perform address correction and address completion procedures.

[0111] For example, the problems of excessive irrelevant information and non-standard structure can be solved by information extraction, i.e., entity recognition, thereby avoiding the problem of incomplete coverage caused by traditional keywords. For the problem of different descriptions of the same address information, as well as homophones and typos, intelligent matching methods such as entity disambiguation can be used to solve the problem. Thus, address standardization technologies such as entity recognition can be used to standardize all addresses from different sources and with non-standard descriptions, thereby enabling data interoperability between various business systems.

[0112] By standardizing addresses, the final output address data can be a detailed address that breaks down a long address into province, city, district, road, street, and even down to the point of being structured by POI (and even down to the building number). Compared to addresses that have not been standardized, this output data is much more precise. By separating and classifying each element of the address during the standardization process, the address becomes clearer and more structured, making it easier to use and manage.

[0113] As an optional embodiment, address identification of the target location is performed based on the base address, map address, and base station address to obtain the actual operating address corresponding to the target location. This can be achieved by: standardizing the installation address, relocation address, order contact address, map address, and base station address, inputting them into an address confidence model for address identification, and outputting the actual operating address of the target location.

[0114] Furthermore, the installation address, relocation address, order contact address, map address, and base station address are standardized and then input into an address confidence model for address recognition, outputting the actual operating address of the target location. This can be achieved by first performing address correction processing on the installation address, relocation address, order contact address, map address, and base station address, then inputting them into an entity recognition model for address standardization processing to obtain standard installation address, standard relocation address, standard order contact address, standard map address, and standard base station address, respectively. Finally, these standard installation address, standard relocation address, standard order contact address, standard map address, and standard base station address are input into an address confidence model for address recognition, outputting the actual operating address of the target location.

[0115] As an optional embodiment, the standard installation address, standard relocation address, standard order contact address, standard map address, and standard base station address are input into an address confidence model for address identification, outputting the actual operating address of the target location. This can be achieved by: inputting the standard installation address, standard relocation address, standard order contact address, standard map address, and standard base station address into an address confidence model for address identification, obtaining the confidence scores for the installation address, relocation address, order contact address, map address, and base station address, respectively; and then performing address confidence calculation using these confidence scores. The system processes the data to obtain at least one optimized operating address. Finally, it selects the one with the highest confidence weighted score from the at least one optimized operating address as the actual operating address of the target location and outputs the actual operating address. It should be noted that, for the sake of simplicity, this embodiment of the invention only selects the installation address, relocation address, order contact address, map address, and base station address of the target location for address identification processing. In practical applications, those skilled in the art can add other addresses, such as registration address, annual report address, etc., together with the addresses listed above as the source address for address identification processing. The methods used are the same as or similar to those involved in this embodiment of the invention. It is understood that this invention does not impose any limitations on this.

[0116] Specifically, after address standardization, the actual business address can be output by establishing a model, such as the actual office address of the enterprise in this embodiment of the invention. In order to make the final output actual office address more accurate, the address most likely to be the actual office address needs to be selected from the multiple structured addresses obtained. First, the source confidence C of multiple source address information (such as the types of addresses listed in the example above) can be determined. Then, the address confidence weighted score can be calculated according to the source confidence of each address. Finally, the address with the highest confidence weighted score can be output after model training and optimization iteration, which is the actual office address of the enterprise, i.e., the actual business address.

[0117] In practical implementation, the source confidence of each source address information can be calculated by establishing an address feedback mechanism through front-line staff of the operator, such as account managers, installation and maintenance engineers, and sales representatives. For example, front-line staff can feed back the correct actual operating address of the enterprise to the address information database through the system, thereby obtaining positive samples for training the model. Specifically, the addresses of each information source after address standardization can be used as training sets to obtain their confidence. That is, the accuracy of each address can be verified separately when relying only on a single information source address, and the address confidence of the address corresponding to each information source (also known as the source confidence of the address) can be determined by using the Bayesian parameter tuning principle. For example, assuming the registration address is a, the annual report address is b, the carrier installation address is c, the relocation address is d, the order address is e, the map merchant address is f, and the base station address corresponding to the enterprise employee is g, then the confidence scores of the above seven types of addresses can be obtained through model training, such as the confidence score of the registration address Ca, the confidence score of the annual report address Cb, the confidence score of the installation address Cc, the confidence score of the relocation address Cd, the confidence score of the order address Ce, the confidence score of the map address Cf, and the confidence score of the base station address Cg.

[0118] The Bayesian parameter tuning principle refers to a given objective function (a generalized function that only requires specifying the input and output, without needing to know its internal structure and mathematical properties), which is updated by continuously adding sample points. For example, in this embodiment of the invention, an optimized objective function can be preset for address recognition processing, and correct actual business addresses can be continuously added as sample points to continuously update the objective function in order to achieve the purpose of model training. The trained model is then used to calculate the address confidence of addresses that need to be identified in the future.

[0119] Once the address confidence level for each address is obtained, a weighted score based on the address confidence level can be calculated. In practice, the seven types of standardized addresses mentioned above typically only output one to four different address information. Therefore, based on the source confidence levels corresponding to each type of address, a weighted score based on the address confidence level can be calculated for each of these different address types.

[0120] For example, suppose that after address standardization and address confidence calculation, three different addresses appear. Address 1 originates from registration address a and annual report address b; address 2 originates from installation address c and order address e; and address 3 originates from relocation address d, map address f, and base station address g. Then: the weighted score of address 1 is "S1 = Wa × Ca + Wb × Cb", the weighted score of address 2 is "S2 = Wc × Cc + We × Ce", and the weighted score of address 3 is "S3 = Wd × Cd + Wf × Cf + Wg × Cg", where W represents the weight corresponding to each source address in the address, such as Wa representing the weight corresponding to registration address a, which can be 0.6, etc. Then, by calculating the weighted score of address confidence, a more accurate actual operating address can be obtained.

[0121] In the above processing, extreme cases may occur, such as when all source addresses are the same, in which case the weighted score can be the sum of the products of the confidence level and weight of each address, such as "S4=Wa×Ca+Wb×Cb+Wc×Cc+Wd×Cd+We×Ce+Wf×Cf+Wg×Cg". Alternatively, all source addresses may be different, in which case the scores of the seven addresses can be the products of their confidence levels and weights, such as the weighted score for registration address a being "S5=Wa×Ca", the weighted score for annual report address b being "S6=Wb×Cb", and the weighted score for installation address c being "S6=Wb×Cg". The weighted score is "S7 = Wc × Cc", the weighted score corresponding to the relocation address d is "S8 = Wd × Cd", the weighted score corresponding to the order address e is "S9 = We × Ce", the weighted score corresponding to the map address f is "S10 = Wf × Cf", the weighted score corresponding to the base station address g is "S11 = Wg × Cg", and so on. It should be noted that the examples listed above are only examples. In actual applications, other different situations may occur. Those skilled in the art can perform the corresponding calculations according to the address confidence weighted calculation method provided in the embodiments of this invention. This invention does not limit the scope of the invention.

[0122] For example, in one scenario, a company address containing all seven types of address information is standardized to obtain one or more corresponding structured addresses. The most likely actual operating address is then output based on a confidence model, typically originating from the operator's address and the map POI address. In another scenario, some companies may not have established a connection with an operator but can obtain map POI addresses and other addresses. In this case, the obtainable addresses are substituted into the model, and the most likely actual operating address is output, typically originating from the map POI address and the company's most recent annual report address. In yet another scenario, for individual companies, only other corresponding addresses (such as the registered address and annual report address) can be obtained. In this case, the model can output the most likely actual operating address, typically originating from the company's most recent annual report address. This invention does not impose any limitations on this scenario.

[0123] The obtained results can then be used to train the model. After optimization iterations, the address with the highest confidence-weighted score is output. Specifically, regardless of the number of addresses, the weighted score for each address can be obtained after address confidence calculation. The set of weight values ​​mentioned in the example above can be continuously optimized and adjusted by machine algorithms. To avoid excessive human intervention affecting the weight values, the LGBM ensemble algorithm in machine learning can be used for modeling. The LGBM ensemble algorithm is a gradient boosting framework that uses a decision tree-based learning algorithm, thus having advantages such as good training effect, low overfitting, low memory usage, and higher accuracy.

[0124] For example, the relevant code for the model training part can be:

[0125] clf=lgb.LGBMClassifier(boosting_type='gbdt',num_class=3,num_leaves=17,max_depth=6,learning_rate=0.001,n_estimators=100,subsample_for_bin=200000,objective=None,class_weight=None,min_split_gain= 0.0,min_child_weight=0.001,min_child_samples=20,subsample=1.0,subsample_freq=0,colsample_bytree=1.0,reg_alpha=0.0,reg_lambda=0.0,random_state=None,n_jobs=-1,silent=True,importance_type='split')

[0126] During model training, Bayesian parameter tuning principles can be used to adjust the parameters to obtain the optimal model. Finally, the multi-category recommendation probabilities can be converted into weighted scores, and the address information with the highest weighted score is the most likely actual business address of the enterprise. Thus, the model can accurately identify the actual business address of the enterprise.

[0127] It should be noted that the embodiments of the present invention include, but are not limited to, the examples described above. It is understood that those skilled in the art can make further settings according to actual needs under the guidance of the ideas in the embodiments of the present invention, and the present invention does not limit such settings.

[0128] In this embodiment of the invention, a method for identifying a business address is provided. First, the basic address, map address, and communication base station data of the target location are obtained. The communication base station data includes all communication data of all communication numbers associated with the target location. Then, the target location is located using the communication base station data to obtain the base station address corresponding to the target location. Next, the target location is identified using the basic address, map address, and base station address to obtain the actual business address corresponding to the target location. By integrating multi-source address information, including big data from communications, and performing secondary verification of the address using the base station address, the actual business address of the target location can be identified in a timely and effective manner, resulting in a more reliable and accurate final output address.

[0129] It should be noted that, for the sake of simplicity, the method embodiments are all described as a series of actions. However, those skilled in the art should understand that the embodiments of the present invention are not limited to the described order of actions, because according to the embodiments of the present invention, some steps can be performed in other orders or simultaneously. Furthermore, those skilled in the art should also understand that the embodiments described in the specification are preferred embodiments, and the actions involved are not necessarily essential to the embodiments of the present invention.

[0130] Reference Figure 4 The diagram shows a structural block diagram of a business address identification device provided in an embodiment of the present invention, which may specifically include the following modules:

[0131] The target location address acquisition module 401 is used to acquire the basic address, map address and communication base station data of the target location. The communication base station data includes all communication data of all communication numbers associated with the target location.

[0132] The base station address positioning module 402 is used to perform base station positioning on the target location based on the communication base station data, and obtain the base station address corresponding to the target location.

[0133] The actual business address determination module 403 is used to identify the target location based on the base address, the map address and the base station address, and obtain the actual business address corresponding to the target location.

[0134] In some optional embodiments, the target location address acquisition module 401 includes:

[0135] The basic address acquisition module is used to acquire an address information database and determine the basic address corresponding to the target location from the address information database. The address information database is used to store the basic addresses of each location.

[0136] The map address generation module is used to obtain the corresponding geographic information based on the name of the target location, and to perform address conversion processing on the geographic information to obtain the corresponding map address;

[0137] The communication base station data acquisition module is used to acquire communication base station data corresponding to the target location.

[0138] In some alternative embodiments, the map address generation module includes:

[0139] The geographic information acquisition module is used to acquire the geographic information of the target location from a preset map call interface based on the name of the target location;

[0140] The address inverse encoding processing module is used to perform address inverse encoding processing on the geographic information to obtain the map address of the target location.

[0141] In some optional embodiments, the address inverse encoding processing module includes:

[0142] The map address set acquisition module is used to perform address reverse encoding processing on the geographic information to obtain the map address set of the target location;

[0143] The consistency discrimination module is used to filter the map address set according to the operating attributes of the target location, obtain filtered map addresses, and perform consistency discrimination on the filtered map addresses to determine the map address corresponding to the target location.

[0144] In some alternative embodiments, the apparatus further includes:

[0145] The communication data acquisition module is used to acquire all communication numbers associated with the target location, and collect the communication data and communication address location of each communication number.

[0146] The data analysis module is used to establish a communication base station for the target location, and to perform real-time big data analysis based on the collected communication data and communication address location to obtain the communication base station data corresponding to the communication base station.

[0147] In some optional embodiments, the communication base station data includes the geographical information of the communication base station, and the base station address positioning module 402 is specifically used for:

[0148] The geographical information of the base station is processed by address conversion to obtain the base station address corresponding to the target location.

[0149] In some optional embodiments, the base address includes at least the installation address, relocation address, and order contact address, and the actual business address determination module 403 is specifically used for:

[0150] The installation address, relocation address, order contact address, map address, and base station address are standardized and input into an address confidence model for address identification, and the actual operating address of the target location is output.

[0151] In some optional embodiments, the actual business address determination module 403 includes:

[0152] The address standardization processing module is used to first perform address correction processing on the installation address, the relocation address, the order contact address, the map address, and the base station address, and then input them into the entity recognition model for address standardization processing to obtain standard installation address, standard relocation address, standard order contact address, standard map address, and standard base station address, respectively.

[0153] The address recognition module is used to input the standard installation address, the standard relocation address, the standard order contact address, the standard map address, and the standard base station address into the address confidence model for address recognition, and output the actual operating address of the target location.

[0154] In some alternative embodiments, the address recognition module includes:

[0155] The address confidence generation module is used to input the standard installation address, the standard relocation address, the standard order contact address, the standard map address, and the standard base station address into the address confidence model for address recognition, and obtain the installation address confidence, relocation address confidence, order contact address confidence, map address confidence, and base station address confidence respectively;

[0156] The optimized business address generation module is used to perform address confidence weighting processing using the installation address confidence, the relocation address confidence, the order contact address confidence, the map address confidence, and the base station address confidence to obtain at least one optimized business address;

[0157] The optimized business address selection module is used to select the one with the highest confidence weighted score from at least one of the optimized business addresses as the actual business address of the target location, and output the actual business address.

[0158] In this embodiment of the invention, a business address identification device is provided. This device can be applied to a business address identification method. First, it acquires the basic address, map address, and communication base station data of the target location. The communication base station data includes all communication data of all communication numbers associated with the target location. Then, it performs base station positioning on the target location based on the communication base station data to obtain the base station address corresponding to the target location. Next, it performs address identification on the target location based on the basic address, map address, and base station address to obtain the actual business address corresponding to the target location. By integrating multi-source address information, including big data from communications, and performing secondary verification of the address through the base station address, the actual business address of the target location can be identified in a timely and effective manner, and the final output address has higher credibility and accuracy.

[0159] As the device embodiment is basically similar to the method embodiment, the description is relatively simple, and relevant parts can be found in the description of the method embodiment.

[0160] In addition, this invention also provides an electronic device, including: a processor, a memory, and a computer program stored in the memory and executable on the processor. When the computer program is executed by the processor, it implements the various processes of the above-described business address identification method embodiments and achieves the same technical effect. To avoid repetition, it will not be described again here.

[0161] This invention also provides a computer-readable storage medium storing a computer program. When executed by a processor, the computer program implements the various processes of the above-described business address identification method embodiments and achieves the same technical effects. To avoid repetition, it will not be described again here. The computer-readable storage medium may be a read-only memory (ROM), a random access memory (RAM), a magnetic disk, or an optical disk, etc.

[0162] Figure 5 A schematic diagram of the hardware structure of an electronic device for implementing various embodiments of the present invention.

[0163] The electronic device 500 includes, but is not limited to, components such as: a radio frequency unit 501, a network module 502, an audio output unit 503, an input unit 504, a sensor 505, a display unit 506, a user input unit 507, an interface unit 508, a memory 509, a processor 510, and a power supply 511. Those skilled in the art will understand that the electronic device structure involved in the embodiments of the present invention does not constitute a limitation on the electronic device. An electronic device may include more or fewer components than illustrated, or combine certain components, or have different component arrangements. In the embodiments of the present invention, the electronic device includes, but is not limited to, mobile phones, tablet computers, laptops, PDAs, in-vehicle terminals, wearable devices, and pedometers.

[0164] It should be understood that, in this embodiment of the invention, the radio frequency unit 501 can be used for receiving and transmitting signals during information transmission or calls. Specifically, it receives downlink data from the base station and processes it with the processor 510; additionally, it transmits uplink data to the base station. Typically, the radio frequency unit 501 includes, but is not limited to, an antenna, at least one amplifier, a transceiver, a coupler, a low-noise amplifier, a duplexer, etc. Furthermore, the radio frequency unit 501 can also communicate with networks and other devices through a wireless communication system.

[0165] The electronic device provides users with wireless broadband internet access through the network module 502, such as helping users send and receive emails, browse web pages, and access streaming media.

[0166] The audio output unit 503 can convert audio data received by the radio frequency unit 501 or the network module 502 or stored in the memory 509 into audio signals and output them as sound. Furthermore, the audio output unit 503 can also provide audio output related to specific functions performed by the electronic device 500 (e.g., call signal reception sound, message reception sound, etc.). The audio output unit 503 includes a speaker, a buzzer, and a receiver, etc.

[0167] Input unit 504 is used to receive audio or video signals. Input unit 504 may include a graphics processing unit (GPU) 5041 and a microphone 5042. The GPU 5041 processes image data of still images or videos acquired by an image capture device (such as a camera) in video capture mode or image capture mode. The processed image frames can be displayed on display unit 506. The image frames processed by GPU 5041 can be stored in memory 509 (or other storage medium) or transmitted via radio frequency unit 501 or network module 502. Microphone 5042 can receive sound and process such sound into audio data. The processed audio data can be converted into a format that can be transmitted to a mobile communication base station via radio frequency unit 501 in telephone call mode.

[0168] The electronic device 500 also includes at least one sensor 505, such as a light sensor, a motion sensor, and other sensors. Specifically, the light sensor includes an ambient light sensor and a proximity sensor. The ambient light sensor can adjust the brightness of the display panel 5061 according to the ambient light level, and the proximity sensor can turn off the display panel 5061 and / or backlight when the electronic device 500 is moved to the ear. As a type of motion sensor, an accelerometer sensor can detect the magnitude of acceleration in various directions (generally three axes). When stationary, it can detect the magnitude and direction of gravity and can be used to identify the posture of the electronic device (such as landscape / portrait switching, related games, magnetometer posture calibration), vibration recognition related functions (such as pedometer, tapping), etc. The sensor 505 may also include a fingerprint sensor, pressure sensor, iris sensor, molecular sensor, gyroscope, barometer, hygrometer, thermometer, infrared sensor, etc., which will not be described in detail here.

[0169] The display unit 506 is used to display information input by the user or information provided to the user. The display unit 506 may include a display panel 5061, which may be configured in the form of a liquid crystal display (LCD), an organic light-emitting diode (OLED), or the like.

[0170] User input unit 507 can be used to receive input numerical or character information, and to generate key signal inputs related to user settings and function control of electronic devices. Specifically, user input unit 507 includes a touch panel 5071 and other input devices 5072. Touch panel 5071, also known as a touch screen, can collect touch operations performed by the user on or near it (such as operations performed by the user using a finger, stylus, or any suitable object or accessory on or near touch panel 5071). Touch panel 5071 may include two parts: a touch detection device and a touch controller. The touch detection device detects the user's touch position and the signal generated by the touch operation, and transmits the signal to the touch controller; the touch controller receives touch information from the touch detection device, converts it into touch point coordinates, and sends it to the processor 510, which receives and executes commands from the processor 510. In addition, touch panel 5071 can be implemented using various types such as resistive, capacitive, infrared, and surface acoustic wave. Besides touch panel 5071, user input unit 507 may also include other input devices 5072. Specifically, other input devices 5072 may include, but are not limited to, physical keyboards, function keys (such as volume control buttons, power buttons, etc.), trackballs, mice, joysticks, etc., which will not be described in detail here.

[0171] Furthermore, the touch panel 5071 can cover the display panel 5061. When the touch panel 5071 detects a touch operation on or near it, it transmits the information to the processor 510 to determine the type of touch event. Subsequently, the processor 510 provides corresponding visual output on the display panel 5061 according to the type of touch event. It is understood that in one embodiment, the touch panel 5071 and the display panel 5061 are implemented as two independent components to realize the input and output functions of the electronic device. However, in some embodiments, the touch panel 5071 and the display panel 5061 can be integrated to realize the input and output functions of the electronic device. The specific implementation is not limited here.

[0172] Interface unit 508 serves as an interface for connecting external devices to electronic device 500. For example, external devices may include a wired or wireless headphone port, an external power supply (or battery charger) port, a wired or wireless data port, a memory card port, a port for connecting a device with an identification module, an audio input / output (I / O) port, a video I / O port, a headphone port, and so on. Interface unit 508 can be used to receive input from external devices (e.g., data, power, etc.) and transmit the received input to one or more components within electronic device 500, or it can be used to transmit data between electronic device 500 and external devices.

[0173] The memory 509 can be used to store software programs and various data. The memory 509 may primarily include a program storage area and a data storage area. The program storage area may store the operating system, applications required for at least one function (such as sound playback, image playback, etc.), etc.; the data storage area may store data created based on the use of the mobile phone (such as audio data, phonebook, etc.). Furthermore, the memory 509 may include high-speed random access memory, and may also include non-volatile memory, such as at least one disk storage device, flash memory device, or other volatile solid-state storage device.

[0174] The processor 510 is the control center of the electronic device. It connects various parts of the electronic device via various interfaces and lines. By running or executing software programs and / or modules stored in the memory 509, and by calling data stored in the memory 509, it performs various functions and processes data, thereby providing overall monitoring of the electronic device. The processor 510 may include one or more processing units; preferably, the processor 510 may integrate an application processor and a modem processor. The application processor mainly handles the operating system, user interface, and applications, while the modem processor mainly handles wireless communication. It is understood that the modem processor may not be integrated into the processor 510.

[0175] The electronic device 500 may also include a power supply 511 (such as a battery) for supplying power to various components. Preferably, the power supply 511 can be logically connected to the processor 510 through a power management system, thereby enabling functions such as managing charging, discharging, and power consumption through the power management system.

[0176] In addition, the electronic device 500 includes some functional modules not shown, which will not be described in detail here.

[0177] It should be noted that, in this document, the terms "comprising," "including," or any other variations thereof are intended to cover non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements includes not only those elements but also other elements not expressly listed, or elements inherent to such a process, method, article, or apparatus. Unless otherwise specified, an element defined by the phrase "comprising one..." does not exclude the presence of other identical elements in the process, method, article, or apparatus that includes that element.

[0178] Through the above description of the embodiments, those skilled in the art can clearly understand that the methods of the above embodiments can be implemented by means of software plus necessary general-purpose hardware platforms. Of course, they can also be implemented by hardware, but in many cases the former is a better implementation method. Based on this understanding, the technical solution of the present invention, in essence, or the part that contributes to the prior art, can be embodied in the form of a software product. This computer software product is stored in a storage medium (such as ROM / RAM, magnetic disk, optical disk), and includes several instructions to cause a terminal (which may be a mobile phone, computer, server, air conditioner, or network device, etc.) to execute the methods described in the various embodiments of the present invention.

[0179] The embodiments of the present invention have been described above with reference to the accompanying drawings. However, the present invention is not limited to the specific embodiments described above. The specific embodiments described above are merely illustrative and not restrictive. Those skilled in the art can make many other forms under the guidance of the present invention without departing from the spirit and scope of the claims, and all of these forms are within the protection scope of the present invention.

[0180] Those skilled in the art will recognize that the units and algorithm steps of the various examples described in conjunction with the embodiments disclosed in this invention can be implemented in electronic hardware, or a combination of computer software and electronic hardware. Whether these functions are implemented in hardware or software depends on the specific application and design constraints of the technical solution. Those skilled in the art can use different methods to implement the described functions for each specific application, but such implementations should not be considered beyond the scope of this invention.

[0181] Those skilled in the art will understand that, for the sake of convenience and brevity, the specific working processes of the systems, devices, and units described above can be referred to the corresponding processes in the foregoing method embodiments, and will not be repeated here.

[0182] In the embodiments provided in this application, it should be understood that the disclosed apparatus and methods can be implemented in other ways. For example, the apparatus embodiments described above are merely illustrative. For instance, the division of units is only a logical functional division, and in actual implementation, there may be other division methods. For example, multiple units or components may be combined or integrated into another system, or some features may be ignored or not executed. Furthermore, the coupling or direct coupling or communication connection shown or discussed may be through some interfaces; the indirect coupling or communication connection between apparatuses or units may be electrical, mechanical, or other forms.

[0183] The units described as separate components may or may not be physically separate. The components shown as units may or may not be physical units; that is, they may be located in one place or distributed across multiple network units. Some or all of the units can be selected to achieve the purpose of this embodiment according to actual needs.

[0184] In addition, the functional units in the various embodiments of the present invention can be integrated into one processing unit, or each unit can exist physically separately, or two or more units can be integrated into one unit.

[0185] If the aforementioned functions are implemented as software functional units and sold or used as independent products, they can be stored in a computer-readable storage medium. Based on this understanding, the technical solution of this invention, essentially, or the part that contributes to the prior art, or a portion of the technical solution, can be embodied in the form of a software product. This computer software product is stored in a storage medium and includes several instructions to cause a computer device (which may be a personal computer, server, or network device, etc.) to execute all or part of the steps of the methods described in the various embodiments of this invention. The aforementioned storage medium includes various media capable of storing program code, such as USB flash drives, portable hard drives, ROM, RAM, magnetic disks, or optical disks.

[0186] Furthermore, the embodiments of the present invention may involve the use of user data. In practical applications, user-specific personal data may be used in the scheme described herein within the scope permitted by applicable laws and regulations, provided that it complies with the applicable laws and regulations of the country in which it is located (e.g., with the user's explicit consent, with the user being properly notified, etc.).

[0187] The above description is merely a specific embodiment of the present invention, but the scope of protection of the present invention is not limited thereto. Any variations or substitutions that can be easily conceived by those skilled in the art within the technical scope disclosed in the present invention should be included within the scope of protection of the present invention. Therefore, the scope of protection of the present invention should be determined by the scope of the claims.

Claims

1. A method for identifying a business address, characterized in that, include: Obtain the basic address, map address, and communication base station data of the target location, wherein the communication base station data includes all communication data corresponding to all communication numbers associated with the target location; Based on the communication base station data, the target location is located using a base station to obtain the base station address corresponding to the target location; The target location is identified by the base address, the map address, and the base station address to obtain the actual operating address corresponding to the target location. The base address includes at least the installation address, relocation address, and order contact address. The step of identifying the target location based on the base address, the map address, and the base station address to obtain the actual operating address corresponding to the target location includes: The installation address, relocation address, order contact address, map address, and base station address are standardized and input into an address confidence model for address identification, and the actual operating address of the target location is output. The process of standardizing the installation address, relocation address, order contact address, map address, and base station address, and inputting them into an address confidence model for address identification, and outputting the actual operating address of the target location, includes: The installation address, relocation address, order contact address, map address, and base station address are first corrected, and then input into the entity recognition model for address standardization to obtain standard installation address, standard relocation address, standard order contact address, standard map address, and standard base station address, respectively. The standard installation address, the standard relocation address, the standard order contact address, the standard map address, and the standard base station address are input into the address confidence model for address identification, and the actual operating address of the target location is output. The step of inputting the standard installation address, the standard relocation address, the standard order contact address, the standard map address, and the standard base station address into the address confidence model for address identification, and outputting the actual operating address of the target location, includes: The standard installation address, the standard relocation address, the standard order contact address, the standard map address, and the standard base station address are input into the address confidence model for address recognition, and the installation address confidence, relocation address confidence, order contact address confidence, map address confidence, and base station address confidence are obtained respectively. The confidence scores of the installation address, relocation address, order contact address, map address, and base station address are weighted to obtain at least one optimized operating address. Select the one with the highest confidence-weighted score from at least one of the optimized operating addresses as the actual operating address of the target location, and output the actual operating address.

2. The method according to claim 1, characterized in that, The acquisition of the target location's basic address, map address, and communication base station data includes: Obtain an address information database, and determine the basic address corresponding to the target location from the address information database, wherein the address information database is used to store the basic addresses of each location; Obtain the corresponding geographic information based on the name of the target location, and perform address conversion processing on the geographic information to obtain the corresponding map address; Obtain communication base station data corresponding to the target location.

3. The method according to claim 2, characterized in that, The step of obtaining the corresponding geographical information based on the name of the target location and performing address conversion processing on the geographical information to obtain the corresponding map address includes: The geographic information of the target location is obtained from a preset map call interface based on the name of the target location; The geographic information is subjected to address inverse encoding to obtain the map address of the target location.

4. The method according to claim 3, characterized in that, The step of performing address inverse encoding on the geographic information to obtain the map address of the target location includes: The geographic information is subjected to inverse address encoding to obtain a map address set of the target location; The map address set is filtered according to the operating attributes of the target location to obtain filtered map addresses, and the consistency of the filtered map addresses is judged to determine the map address corresponding to the target location.

5. The method according to claim 1, characterized in that, Before obtaining the base address, map address, and communication base station data of the target location, the method further includes: Obtain all communication numbers associated with the target location, and collect the incoming and outgoing communication data and communication address location of each communication number; A communication base station is established for the target location, and big data real-time analysis is performed based on the collected communication data and communication address location to obtain the communication base station data corresponding to the communication base station.

6. The method according to claim 5, characterized in that, The communication base station data includes the geographical information of the communication base stations. The step of locating the target location based on the communication base station data to obtain the base station address corresponding to the target location includes: The geographical information of the base station is processed by address conversion to obtain the base station address corresponding to the target location.

7. A device for identifying a business address, characterized in that, include: The target location address acquisition module is used to acquire the basic address, map address and communication base station data of the target location. The communication base station data includes all communication data of all communication numbers associated with the target location. A base station address positioning module is used to locate the target location based on the communication base station data and obtain the base station address corresponding to the target location. The actual business address determination module is used to identify the target location based on the base address, the map address, and the base station address to obtain the actual business address corresponding to the target location. The basic address includes at least the installation address, relocation address, and order contact address. The actual business address determination module is specifically used for: The installation address, relocation address, order contact address, map address, and base station address are standardized and input into an address confidence model for address identification, and the actual operating address of the target location is output. The actual business address determination module includes: The address standardization processing module is used to first perform address correction processing on the installation address, the relocation address, the order contact address, the map address, and the base station address, and then input them into the entity recognition model for address standardization processing to obtain standard installation address, standard relocation address, standard order contact address, standard map address, and standard base station address, respectively. The address recognition module is used to input the standard installation address, the standard relocation address, the standard order contact address, the standard map address, and the standard base station address into the address confidence model for address recognition, and output the actual operating address of the target location; The address recognition module includes: The address confidence generation module is used to input the standard installation address, the standard relocation address, the standard order contact address, the standard map address, and the standard base station address into the address confidence model for address recognition, and obtain the installation address confidence, relocation address confidence, order contact address confidence, map address confidence, and base station address confidence respectively; The optimized business address generation module is used to perform address confidence weighting processing using the installation address confidence, the relocation address confidence, the order contact address confidence, the map address confidence, and the base station address confidence to obtain at least one optimized business address; The optimized business address selection module is used to select the one with the highest confidence weighted score from at least one of the optimized business addresses as the actual business address of the target location, and output the actual business address.

8. An electronic device, characterized in that, It includes a processor, a communication interface, a memory, and a communication bus, wherein the processor, the communication interface, and the memory communicate with each other through the communication bus; The memory is used to store computer programs; When the processor executes a program stored in the memory, it implements the method as described in any one of claims 1-6.

9. A computer-readable storage medium having instructions stored thereon that, when executed by one or more processors, cause the processors to perform the method as described in any one of claims 1-6.