A monitoring system, method, apparatus, device, and medium
By using the central control equipment to identify normally operating proxy devices based on the status information collected from the proxy devices, and assigning access address information, the problem of excessive load and instability of proxy devices in the monitoring system is solved, and reliable process monitoring and load reduction are achieved.
Patent Information
- Authority / Receiving Office
- CN · China
- Patent Type
- Patents(China)
- Current Assignee / Owner
- BIGO TECH PTE LTD
- Filing Date
- 2021-11-26
- Publication Date
- 2026-06-26
AI Technical Summary
Existing technologies for monitoring business processes suffer from issues such as excessive load on data collection agent devices or instability in monitoring, leading to unreliable process monitoring and potential losses to property and users.
By using the central control device to collect the status information of the agent devices, the normally operating agent devices are identified and access address information is assigned. This avoids monitoring the problematic agent devices, enables parallel monitoring of multiple agent devices, and reduces the load on individual agent devices.
Ensure the reliability and stability of the monitoring system, reduce the number of processes monitored by a single agent device, reduce the load, and avoid significant property and user losses.
Smart Images

Figure CN113986687B_ABST
Abstract
Description
Technical Field
[0001] This invention relates to the field of big data processing technology, and in particular to a monitoring system, method, apparatus, equipment and medium. Background Technology
[0002] With the rapid development of the internet, especially the evolution of globalization, the internet's coverage is becoming increasingly wider. The number of applications developed by the company (such as Likee short video, Bigo Live live streaming, and Helloyo voice live streaming), the number of services each application can provide, and the number of users using these applications are also increasing, leading to more and more technical challenges for the company.
[0003] To provide stable and reliable services to users, a reliable monitoring system is needed to monitor the operational status of each process under each service, and to promptly address any failures in any process to prevent prolonged downtime and significant financial and user losses. Therefore, a reliable monitoring system is urgently needed for service monitoring. Summary of the Invention
[0004] This invention provides a monitoring system, method, apparatus, device, and medium for accurately and reliably monitoring the operational status of various processes under a business.
[0005] This invention provides a business monitoring system, which includes: a service registration device, a central control device, and multiple data acquisition agent devices;
[0006] The service registration device is used to obtain the access address information of the process corresponding to each business server and send it to the central control device;
[0007] The central control device is configured to receive each access address information sent by the service registration device; and acquire status information sent by the plurality of data collection agent devices respectively; the status information includes the identification information of the data collection agent device that sent the status information; based on each status information, determine the normally operating data collection agent device from the plurality of data collection agent devices; determine the change information of the data collection agent device according to each normally operating data collection agent device and the saved normal device information; based on the change information, determine at least one access address information corresponding to each normally operating data collection agent device and send it to the corresponding data collection agent device; wherein, different access address information is sent to different data collection agent devices;
[0008] The plurality of data acquisition agent devices are used to send their own status information to the central control device; if they receive at least one access address information sent by the central control device, they collect the running status information of the corresponding process based on the at least one access address information.
[0009] This invention provides a service monitoring method, which is applied to a central control device, and the method includes:
[0010] Receive each access address information sent by the service registration device;
[0011] It also acquires status information sent by multiple acquisition agent devices; the status information includes the identification information of the acquisition agent device that sent the status information;
[0012] Based on each of the aforementioned status information, determine the normally operating data acquisition agent device from among the plurality of data acquisition agent devices;
[0013] Based on each normally operating data acquisition agent device and the saved normal device information, determine the change information of the data acquisition agent device;
[0014] Based on the change information, at least one access address information corresponding to each normally operating data acquisition agent device is determined and sent to the corresponding data acquisition agent device; wherein, different access address information is sent to different data acquisition agent devices.
[0015] This invention provides a service monitoring method, which is applied to a data acquisition agent device. The method includes:
[0016] The system sends its own status information to the central control device, so that the central control device can obtain the status information sent by the multiple acquisition agent devices respectively. Based on each status information, the system determines the normally operating acquisition agent device from the multiple acquisition agent devices. Based on each normally operating acquisition agent device and the saved normal device information, the system determines the change information of the acquisition agent device. Based on the change information, the system determines at least one access address information corresponding to each normally operating acquisition agent device and sends it to the corresponding acquisition agent device. Different access address information is sent to different acquisition agent devices. The status information includes the identification information of the acquisition agent device.
[0017] If at least one access address information is received from the central control device, the running status information of the corresponding process is collected based on the at least one access address information.
[0018] This invention provides a service monitoring device, which is applied to a main control device and includes:
[0019] The receiving unit is configured to receive each access address information sent by the service registration device; and to acquire status information sent by multiple data collection agent devices respectively; the status information includes the identification information of the data collection agent device that sent the status information.
[0020] The first determining unit is used to determine, based on each of the status information, a normally operating acquisition agent device from the plurality of acquisition agent devices;
[0021] The second determining unit is used to determine the change information of the acquisition agent device based on each of the normally operating acquisition agent devices and the saved normal device information;
[0022] The third determining unit is used to determine at least one access address information corresponding to each of the normally operating acquisition agent devices based on the change information; wherein, different access address information is sent to different acquisition agent devices;
[0023] The sending unit is used to send at least one access address information corresponding to each of the normally operating acquisition agent devices to the corresponding acquisition agent device.
[0024] This invention provides a business monitoring device, which is applied to a data acquisition agent device. The device includes:
[0025] The sending module is used to send its own status information to the central control device, so that after the central control device receives the status information sent by the multiple acquisition agent devices, it determines the normally operating acquisition agent device from the multiple acquisition agent devices based on each status information; it determines the change information of the acquisition agent device according to each normally operating acquisition agent device and the saved normal device information; based on the change information, it determines at least one access address information corresponding to each normally operating acquisition agent device and sends it to the corresponding acquisition agent device; wherein, different access address information is sent to different acquisition agent devices; the status information includes the identification information of the acquisition agent device;
[0026] The processing module is used to collect the running status information of the corresponding process based on the at least one access address information received from the central control device.
[0027] This invention provides an electronic device, which includes a processor. The processor executes a computer program stored in a memory to implement the steps of the business monitoring method applied to a main control device as described above, or to implement the steps of the business monitoring method applied to a data acquisition agent device as described above.
[0028] This invention provides a computer-readable storage medium storing a computer program that, when executed by a processor, implements the steps of the business monitoring method applied to a main control device as described above, or implements the steps of the business monitoring method applied to a data acquisition agent device as described above.
[0029] When monitoring processes under each service, multiple data acquisition agents send their own status information to the central control device. Based on this status information, the central control device can identify the normally functioning data acquisition agents, thus avoiding the continued use of faulty agents to monitor processes and ensuring the reliability of each process monitoring. Subsequently, the central control device can determine changes in the data acquisition agents based on each normally functioning agent and the saved information on healthy devices. Based on this change information, it allocates access address information obtained from the service registration device, determining at least one access address for each normally functioning data acquisition agent and sending it to the corresponding agent. This enables sending different access address information to different data acquisition agents and allows multiple agents to monitor processes in parallel, reducing the number of processes a single agent needs to monitor and lowering its load. Attached Figure Description
[0030] To more clearly illustrate the technical solutions in the embodiments of the present invention, the accompanying drawings used in the description of the embodiments will be briefly introduced below. Obviously, the accompanying drawings described below are only some embodiments of the present invention. For those skilled in the art, other drawings can be obtained based on these drawings without creative effort.
[0031] Figure 1 This is a schematic diagram of the structure of a business monitoring system provided in an embodiment of the present invention;
[0032] Figure 2 A schematic diagram of the workflow of a main control device provided in an embodiment of the present invention;
[0033] Figure 3 This is a schematic diagram of the structure of another business monitoring system provided in an embodiment of the present invention;
[0034] Figure 4 This is a schematic diagram of the structure of another business monitoring system provided in an embodiment of the present invention;
[0035] Figure 5 This is a schematic diagram of the workflow of a data acquisition agent device provided in an embodiment of the present invention;
[0036] Figure 6 A schematic diagram of a business monitoring process provided in an embodiment of the present invention;
[0037] Figure 7 This is a schematic diagram of another business monitoring process provided in an embodiment of the present invention;
[0038] Figure 8 This is a schematic diagram of the structure of a business monitoring device provided in an embodiment of the present invention;
[0039] Figure 9 This is a schematic diagram of the structure of a business monitoring device provided in an embodiment of the present invention;
[0040] Figure 10 A schematic diagram of an electronic device structure provided in an embodiment of the present invention;
[0041] Figure 11 This is a schematic diagram of another electronic device structure provided in an embodiment of the present invention. Detailed Implementation
[0042] To make the objectives, technical solutions, and advantages of this invention clearer, the invention will be further described in detail below with reference to the accompanying drawings. Obviously, the described embodiments are only a part of the embodiments of this invention, and not all of them. Based on the embodiments of this invention, all other embodiments obtained by those skilled in the art without creative effort are within the scope of protection of this invention.
[0043] In existing technologies, to monitor the running status of processes under each service, it is generally necessary to deploy a data collection agent device. This data collection agent device typically uses the following two modes to collect the running status information of processes under each service:
[0044] I. Push Mode.
[0045] In this Push mode, the monitored device proactively sends the running status information of each process under its own business to the data acquisition agent. This method of collecting the running status information of each process under each business relies heavily on the push function of the monitored device. The monitored device needs to have push capabilities, requiring certain modifications to the processes of each business on the monitored device to enable it to proactively send the running status information of each process under its own business to the data acquisition agent. Furthermore, when there are a large number of monitored devices, if multiple monitored devices simultaneously send data (collecting the running status information of processes) to the data acquisition agent, it can lead to excessive load on the data acquisition agent, potentially causing overload, crashes, and other problems, making the data acquisition agent highly stable.
[0046] II. Pull mode.
[0047] In this Pull mode, the data acquisition agent actively initiates a request to the monitored device to obtain the running status information of processes under each service running on the monitored device. However, this method of collecting the running status information of processes under each service has limitations. Because the number of monitored devices is relatively large, the data acquisition agent cannot monitor the running status information of all processes to the fullest extent. If the data acquisition agent malfunctions or experiences overload, data loss may occur, compromising the stability of the monitoring process.
[0048] To address the aforementioned issues, embodiments of the present invention provide a monitoring system, method, apparatus, device, and medium. When monitoring processes under each service, multiple data acquisition agent devices also send their own status information to the central control device. Based on this status information, the central control device can identify the normally operating data acquisition agent devices from among the multiple data acquisition agent devices, thereby avoiding the continued use of faulty data acquisition agent devices to monitor processes and ensuring the reliability of monitoring each process. Subsequently, the central control device can determine the change information of the data acquisition agent devices based on each normally operating data acquisition agent device and the saved normal device information. Based on this change information, it allocates access address information obtained from the service registration device, determines at least one access address information corresponding to each normally operating data acquisition agent device, and sends it to the corresponding data acquisition agent device. This enables sending different access address information to different data acquisition agent devices and allows multiple data acquisition agent devices to monitor each process in parallel, reducing the number of processes that a single data acquisition agent device needs to monitor and lowering the load on the data acquisition agent devices.
[0049] It should be noted that the application scenarios described in the above embodiments are merely exemplary scenarios for illustrative purposes and are not intended to limit the application scenarios of the monitoring system, method, apparatus, device, and medium provided in the embodiments of the present invention. Those skilled in the art should understand that the monitoring system, method, apparatus, device, and medium provided in the embodiments of the present invention can be applied to all application scenarios requiring data monitoring, such as blockchain data storage.
[0050] Example 1:
[0051] Figure 1 This is a schematic diagram of a business monitoring system provided in an embodiment of the present invention. The system includes: a service registration device 11, a central control device 12, and multiple data acquisition agent devices 13.
[0052] The service registration device 11 is used to obtain the access address information of the process corresponding to each business server and send it to the central control device 12;
[0053] The central control device 12 is used to receive each access address information sent by the service registration device 11; and to obtain status information sent by the plurality of collection agent devices 13 respectively; the status information includes the identification information of the collection agent device 13 that sent the status information; based on each status information, to determine the collection agent device 13 that is operating normally among the plurality of collection agent devices 13; based on each of the operating collection agent devices 13 and the saved normal device information, to determine the change information of the collection agent device 13; based on the change information, to determine at least one access address information corresponding to each of the operating collection agent devices 13 and to send it to the corresponding collection agent device 13; wherein, different access address information is sent to different collection agent devices 13.
[0054] The plurality of acquisition agent devices 13 are used to send their own status information to the central control device 12; if they receive at least one access address information sent by the central control device 12, they collect the running status information of the corresponding process based on the at least one access address information.
[0055] In this embodiment of the invention, the business monitoring system includes a service registration device 11, a central control device 12, and multiple data acquisition agent devices 13. The service registration device 11 is primarily used to monitor the access address information of processes running under the respective business services on each business server. The central control device 12 is primarily used to monitor the status information of the multiple data acquisition agent devices 13 and assign corresponding monitoring tasks to them. The multiple data acquisition agent devices 13 are primarily used to monitor the running status information of processes running under the respective business services on each business server.
[0056] Among them, the service registration device 11, the central control device 12, and multiple data acquisition agent devices 13 are all electronic devices, such as smart devices like mobile terminals and computers, or servers.
[0057] In this embodiment of the invention, the service registration device 11 can establish a connection with each business server. For each business server, if a process on the business server changes, such as adding, deleting, or restarting a process, the business server can generate a notification message and send it to the service registration device 11 based on the access address information of the changed process and the process identifier corresponding to the changed process, to inform the service registration device 11 that a process change has occurred. After receiving the notification message, the service registration device 11 can parse the notification message to obtain the process identifier and access address information carried in the notification message. It can then determine whether the process identifier is stored. If it is determined that the process identifier is stored, the access address information corresponding to the stored process identifier is updated according to the access address information carried in the notification message; if it is determined that the process identifier is not stored, the process identifier and the access address information corresponding to the process identifier are stored. Through the above process, the service registration device 11 can achieve real-time monitoring of the access address information of the processes corresponding to each business server, ensuring the accuracy of the subsequent collection of the running status information of the processes corresponding to each business server by the data acquisition agent device 13.
[0058] The access address information for this process can be the IP address and the port number.
[0059] If the service registration device 11 receives the notification message, it indicates that the access address information of a certain process may have changed. After receiving the notification message, the service registration device 11 can also send the notification message to the central control device 12 so that the central control device 12 can promptly and accurately monitor the latest access address information of all monitorable processes, and promptly assign monitoring tasks to multiple collection agent devices 13 according to the latest access address information of each process, that is, determine which collection agent device 13 is monitoring the latest running status information of each process.
[0060] After receiving the notification message from the service registration device 11, the central control device 12 parses the notification message to obtain the process identifier and access address information carried in the notification message. It determines whether the process identifier is stored. If it is, the stored access address information corresponding to the process identifier is updated according to the access address information carried in the notification message; if it is not, the process identifier and its corresponding access address information are stored. Then, all currently stored access address information is divided to determine at least one access address to be sent to the multiple data collection agent devices 13.
[0061] In one possible implementation, for all currently stored access address information, the central control device 12 can determine the group corresponding to the access address information based on a preset group value. Subsequently, all groups are assigned to multiple acquisition agent devices 13, meaning each acquisition agent device 13 is responsible for at least one group.
[0062] For example, when determining the group corresponding to the access address information based on preset group values, a value can be randomly selected from the preset group values and used as the identifier of the group corresponding to the access address information.
[0063] In one example, the central control device 12 can further divide each group based on the number of multiple acquisition agent devices 13, thereby determining at least one group corresponding to each of the multiple acquisition agent devices 13. For example, based on the number of multiple acquisition agent devices 13, all groups can be evenly divided to ensure that the number of processes monitored by the multiple acquisition agent devices 13 is as even as possible, avoiding excessive load on some acquisition agent devices 13 and insufficient load on others. The central control device 12 can also determine at least one group corresponding to each of the multiple acquisition agent devices 13 based on the number of multiple acquisition agent devices 13 and their computing power. For example, more processes can be allocated to acquisition agent devices 13 with high computing power, and fewer processes can be allocated to acquisition agent devices 13 with low computing power.
[0064] Of course, in this embodiment of the invention, staff can also manually configure at least one access address information corresponding to each of the multiple data collection agent devices 13 as needed.
[0065] In another possible implementation, for all currently stored access address information, the central control device 12 can determine the collection agent device 13 corresponding to the access address information based on a hash algorithm, such as a simple hash algorithm or a consistent hash algorithm.
[0066] Based on the above embodiments, after the central control device 12 determines at least one access address information corresponding to each of the multiple acquisition agent devices 13, it sends the access address information contained in at least one group corresponding to each acquisition agent device 13 to the acquisition agent device 13, so that the acquisition agent device 13 can collect the running status information of the corresponding process based on the access address information contained in the at least one group.
[0067] In order to avoid different acquisition agent devices 13 collecting the running status information of the same process and causing resource waste, in this embodiment of the invention, the access address information sent to different acquisition agent devices 13 is different.
[0068] For any acquisition agent device 13, if the acquisition agent device 13 receives at least one access address information sent by the central control device 12, it can collect the running status information of the process corresponding to the access address information based on the access address information.
[0069] In practical applications, during the process of collecting the running status information of the process, the acquisition agent device 13 may experience problems such as crashes or congestion, causing it to be unable to continue collecting the running status information of the process normally. Therefore, in this embodiment of the invention, multiple acquisition agent devices 13 can send their own status information to the central control device 12, so that the central control device 12 can adjust the monitoring tasks corresponding to the multiple acquisition agent devices 13 according to their respective status information.
[0070] The status information may include one or more of the following: identification information of the acquisition agent device 13, load information, the number of monitored processes, and monitoring success rate. Optionally, the status information may also include the time when the status information was reported, to help the central control device 12 distinguish the status information sent by the same acquisition agent device 13 at different times. The monitoring success rate is the ratio of the number of processes that successfully obtained running status information within a preset time period (denoted as the second preset time period) before the acquisition agent device 13 generates the status information to the number of processes that the acquisition agent device 13 needs to monitor during that time period. The identification information of the acquisition agent device 13 may be represented by numbers, strings, or other forms, as long as the form can uniquely identify the acquisition agent device 13, it can be applied to the embodiments of the present invention. When specifically setting the identification information of the acquisition agent device 13, it can be flexibly set according to actual needs.
[0071] It should be noted that multiple data acquisition agent devices 13 can send their own status information to the central control device 12 at preset intervals (e.g., 3 seconds, 5 seconds, etc.), or at preset time points. Of course, multiple data acquisition agent devices 13 can also send their own status information to the central control device 12 upon receiving an instruction to send status information. In specific implementation, these settings can be flexibly configured according to actual needs, and no specific limitations are made here.
[0072] When setting the preset period, different values can be set according to different scenarios. If you want to manage the status of each acquisition agent device 13 as promptly as possible, you can set the period to be smaller. If you want to reduce the computing power consumed by frequently acquiring the status information of each acquisition agent device 13, you can set the period to be larger.
[0073] After the central control device 12 obtains the status information sent by multiple data acquisition agent devices 13, it can determine the data acquisition agent device 13 that is currently operating normally based on each status information obtained.
[0074] In one possible implementation, when the central control device 12 determines the normally operating acquisition agent device 13 from among the multiple acquisition agent devices 13 based on the status information corresponding to each of the multiple acquisition agent devices 13, it can do so in the following way:
[0075] Method 1: Preset normal operating conditions. For example, these normal operating conditions could be that the monitoring success rate is greater than a preset success rate threshold, or that the load information is less than a preset load threshold. For each data acquisition agent device 13, determine whether the status information corresponding to that data acquisition agent device 13 meets the preset normal operating conditions. If it is determined that the status information corresponding to that data acquisition agent device 13 meets the preset normal operating conditions, then that data acquisition agent device 13 is determined to be a normally operating data acquisition agent device 13.
[0076] For example, if the status information includes a monitoring success rate, then from each status information currently obtained, determine the collection agent device 13 whose monitoring success rate is not less than the preset success rate threshold, and determine each collection agent device 13 as a normally operating collection agent device 13.
[0077] Method 2: Pre-set abnormal operation conditions. For example, these abnormal operation conditions could be that the monitoring success rate is not greater than a preset success rate threshold, or the load information is not less than a preset load threshold. For each data acquisition agent device 13, determine whether the status information corresponding to that data acquisition agent device 13 meets the preset abnormal operation conditions. If it is determined that the status information corresponding to that data acquisition agent device 13 meets the preset abnormal operation conditions, then that data acquisition agent device 13 is determined to be an abnormal data acquisition agent device 13. Based on all data acquisition agent devices 13 and each of the determined abnormal data acquisition agent devices 13, determine each data acquisition agent device 13 that is operating normally from among all the determined data acquisition agent devices 13.
[0078] For example, if the status information includes a monitoring success rate, then from each currently acquired status information, identify the abnormal data acquisition agent device 13 whose monitoring success rate is less than a preset success rate threshold. Based on all data acquisition agent devices 13 and each of the currently identified abnormal data acquisition agent devices 13, determine the normally functioning data acquisition agent devices 13.
[0079] In one possible implementation, based on the above embodiments, if the central control device 12 determines, based on each currently acquired status information, that there is an abnormally operating acquisition agent device 13 among the multiple acquisition agent devices 13, it can send a stop monitoring instruction to the abnormally operating acquisition agent device 13 and mark the abnormally operating acquisition agent device 13 as having its monitoring process stopped, so as to avoid continuing to assign monitoring tasks to the abnormally operating acquisition agent device 13, affecting the stability of monitoring, and thus causing significant property and user losses.
[0080] Since there may be situations where the data acquisition agent device 13 is unable to send status information to the central control device 12, the central control device 12 can determine that each data acquisition agent device 13 is not operating normally if it has not received status information from the data acquisition agent device 13 within a preset time period (referred to as the third preset time period) since the last time the status information of the data acquisition agent device 13 was obtained. This indicates that the data acquisition agent device 13 may have crashed or is unable to operate normally.
[0081] In this embodiment of the invention, when multiple acquisition agent devices 13 are operating normally, each process is monitored by its corresponding acquisition agent device 13 as much as possible to improve the stability of monitoring each process. However, if a new acquisition agent device 13 is added, or at least one acquisition agent device 13 crashes, the central control device 12 needs to reallocate at least one access address information corresponding to each of the currently operating acquisition agent devices 13 to ensure that each process can be monitored through all currently operating acquisition agent devices 13. Therefore, the central control device 12 stores the device information of each acquisition agent device 13 that was previously determined to be operating normally (denoted as normal device information). Based on the device information of each currently determined to be operating normally and the stored normal device information, the central control device 12 can determine the changes in the currently operating acquisition agent devices 13, that is, determine the change information of the acquisition agent devices 13. Based on the currently obtained change information, the central control device 12 can determine at least one access address information corresponding to each currently operating acquisition agent device 13. And for each currently operating acquisition agent device 13, the central control device 12 sends at least one access address information corresponding to that acquisition agent device 13.
[0082] The initial normal device information stored in the central control device 12 can be empty. In the subsequent process of monitoring the process, the stored normal device information is updated according to the device information of each normally operating acquisition agent device 13.
[0083] In one possible implementation, the central control device 12 determines at least one access address information corresponding to each normally operating data acquisition agent device 13 based on the currently acquired change information. This mainly includes the following situations:
[0084] Scenario 1: At least one new data acquisition agent device 13 has been added among multiple data acquisition agent devices 13.
[0085] If, after each normally functioning data acquisition agent device 13 was previously identified, at least one new data acquisition agent device 13 is added to these data acquisition agent devices 13—for example, due to an increase in the number of processes requiring monitoring, or because a data acquisition agent device 13 previously discontinued for process monitoring is reused—resulting in each currently identified normally functioning data acquisition agent device 13 having a data acquisition agent device different from all previously identified normally functioning data acquisition agent devices, i.e., a newly added data acquisition agent device 13 exists among the currently identified normally functioning data acquisition agent devices 13, then, based on each normally functioning data acquisition agent device 13 and the saved normal device information, the change information of the identified data acquisition agent device 13 represents the newly added device. This newly added device refers to a data acquisition agent device among each currently identified normally functioning data acquisition agent device that is different from all data acquisition agent devices corresponding to the normal device information. Based on this change information, when determining at least one access address information corresponding to each currently normally functioning data acquisition agent device 13, it is necessary to reconsider the correspondence between each currently normally functioning data acquisition agent device 13 and each access address information. The central control device 12 can determine at least one access address information corresponding to each currently operating acquisition agent device 13 based on each currently operating acquisition agent device 13 and each access address information.
[0086] It should be noted that the process of determining at least one access address information corresponding to each currently operating data acquisition agent device 13 based on each currently operating data acquisition agent device 13 and each access address information is similar to the process of determining at least one access address information corresponding to multiple data acquisition agent devices 13 as described above, and will not be repeated here.
[0087] Scenario 2: No new acquisition agent device 13 is added among multiple acquisition agent devices 13, but at least one acquisition agent device 13 is reduced.
[0088] If, after the previous determination of each normally functioning acquisition agent device 13, no new acquisition agent devices 13 are added among these acquisition agent devices 13, and at least one acquisition agent device 13 is reduced—for example, if an acquisition agent device 13, during the monitoring process, cannot continue monitoring normally due to problems such as crashes or overloads and is stopped from being used for monitoring—that acquisition agent device 13 is removed from the multiple acquisition agent devices 13. In this case, each currently determined normally functioning acquisition agent device 13 is only a portion of the previously determined normally functioning acquisition agent devices 13. Based on the currently determined normally functioning acquisition agent devices 13 and the saved normal device information, the change information of the determined acquisition agent devices 13 represents the reduction of devices. This reduction refers to each currently determined normally functioning acquisition agent device 13 being only a portion of all acquisition agent devices 13 corresponding to the normal device information. Based on this change information, when determining at least one access address information corresponding to each normally operating acquisition agent device 13, the access address information corresponding to each normally operating acquisition agent device 13 can be determined based on the correspondence between all acquisition agent devices 13 and access address information corresponding to the saved normal device information, that is, at least one access address information corresponding to all normally operating acquisition agent devices 13 determined in the last time.
[0089] In one example, based on all the acquisition agent devices 13 corresponding to the normal device information and each currently operating acquisition agent device 13, the target acquisition agent device 13 to be removed from the currently operating acquisition agent devices can be determined. Based on the saved correspondence between all the acquisition agent devices 13 corresponding to the normal device information and the access address information, the access address information corresponding to the target acquisition agent device 13 is determined. The access address information corresponding to the target acquisition agent device 13 is then re-divided so that each currently operating acquisition agent device 13 can temporarily replace the target acquisition agent device 13 in monitoring the process. That is, for each currently operating acquisition agent device 13, if at least one monitoring task corresponding to the target acquisition agent device 13 is assigned to that acquisition agent device 13, then that acquisition agent device 13 will not only continue to execute the previously assigned monitoring task, but also execute at least one monitoring task corresponding to the target acquisition agent device 13.
[0090] The central control device 12 can evenly distribute the access address information corresponding to the target acquisition agent device 13 to each acquisition agent device 13 that is currently operating normally, or it can divide the access address information corresponding to the target acquisition agent device 13 according to the computing power of each acquisition agent device 13 that is currently operating normally.
[0091] Based on the above embodiments, after the central control device 12 determines at least one access address information corresponding to each currently operating acquisition agent device 13, it can send at least one access address information corresponding to the acquisition agent device 13 to the acquisition agent device 13.
[0092] In one possible implementation, the central control device 12 can update the stored normal device information based on the device information of each normally operating acquisition agent device 13 currently determined, so as to ensure the accuracy of the change information determined next time.
[0093] Figure 2 This invention provides a schematic diagram of the workflow of a master control device, which includes:
[0094] First, the main control device can traverse all the status information sent by the acquisition agent device 13 according to a preset cycle.
[0095] Secondly, based on each acquired status information, the main control device determines the normally operating acquisition agent device 13 from among the multiple acquisition agent devices 13.
[0096] From this point on, the main control device determines the change information of the acquisition agent device 13 based on each normally operating acquisition agent device 13 and the saved normal device information.
[0097] Next, based on the change information, it is determined whether the currently operating data acquisition agent device 13 has changed. If it has changed, an abnormal data acquisition agent device 13 is identified, and a stop monitoring instruction is sent to the abnormal data acquisition agent device 13. Based on the change information, at least one access address information corresponding to each operating data acquisition agent device 13 is determined and sent to the corresponding data acquisition agent device 13. If there is no change, the process ends.
[0098] After receiving the access address information sent by the central control device 12, any normally functioning data acquisition agent device 13 can monitor the process corresponding to that access address information and obtain the process's running status information. Simultaneously, the data acquisition agent device 13 will also send status information to the central control device 12 to facilitate the central control device 12 in monitoring its own normal operation.
[0099] Considering that the continuous collection of running status information of a certain process by the acquisition agent device 13 may increase the load on the business server and the acquisition agent device 13, in this embodiment of the invention, the business monitoring system may further include a configuration device. This configuration device is used to acquire and save pre-configured acquisition cycles for each process. Before sending the access address information corresponding to the acquisition agent device 13 to the central control device 12, the central control device 12 can obtain the acquisition cycle of the process corresponding to the access address information from the configuration device, and send the access address information and the corresponding acquisition cycle to the acquisition agent device 13 together. This enables personalized monitoring of different processes and improves the flexibility of process monitoring. After receiving at least one access address information and the acquisition cycle corresponding to the at least one access address information sent by the central control device 12, the acquisition agent device 13, based on the acquisition cycle corresponding to the at least one access address information, collects the running status information of the process corresponding to the access address information. Figure 3 This is a schematic diagram of another business monitoring system provided in an embodiment of the present invention. Figure 3 As shown, the configuration device and service registration device 11 can interact with the central control device 12, and multiple acquisition agent devices 13 can also interact with the central control device 12, thereby facilitating the central control device 12 to obtain the acquisition cycle corresponding to the access address information from the configuration device.
[0100] In one possible implementation, the business monitoring system provided in this embodiment of the invention may include multiple electronic devices capable of performing the functions of a master control device. Distributed locking technology can be used to select a master node device and multiple slave node devices from these multiple electronic devices. Figure 4 This is a schematic diagram of another business monitoring system provided in an embodiment of the present invention. Figure 4 As shown, this business monitoring system includes three electronic devices capable of performing the functions of a master control device. One master node device and two slave node devices are selected from these. The three electronic devices can communicate with each other, and multiple data acquisition agents 13 send status information to the master node device. When monitoring the process under each business, the master node device primarily performs the functions of the master control device. If the master node device fails, any slave node device can replace it to perform the functions of the master control device, ensuring the stability of the business monitoring system and achieving good disaster recovery.
[0101] When monitoring the processes under each service, multiple data acquisition agent devices 13 also send their own status information to the central control device 12. Based on the status information of the multiple data acquisition agent devices 13, the central control device 12 can determine the normally operating data acquisition agent device 13 from among the multiple data acquisition agent devices 13, thereby avoiding the continued use of the problematic data acquisition agent device 13 to monitor the process and ensuring the reliability of monitoring each process. Subsequently, the central control device 12 can determine the change information of the data acquisition agent device 13 based on each normally operating data acquisition agent device 13 and the saved normal device information. Based on the change information, it allocates the access address information obtained from the service registration device 11, determines at least one access address information corresponding to each normally operating data acquisition agent device 13, and sends it to the corresponding data acquisition agent device 13. This realizes sending different access address information to different data acquisition agent devices 13, and multiple data acquisition agent devices 13 monitoring each process in parallel, reducing the number of processes that a single data acquisition agent device 13 needs to monitor, and reducing the load on the data acquisition agent device 13.
[0102] Example 2:
[0103] To avoid the load of the acquisition agent device 13 itself affecting its monitoring process, based on the above embodiments, in this embodiment of the invention, the plurality of acquisition agent devices 13 are specifically used to obtain their own load information; based on the load information, determine target access address information from the at least one access address information; and based on the target access address information, collect the running status information of the process corresponding to the target access address information.
[0104] During the monitoring process of the data acquisition agent device 13, if the availability of the data acquisition agent device 13 is insufficient, it may fail to accurately monitor the process at certain critical moments. For example, monitoring of a certain process may be missing, or the obtained process running status information may be inaccurate, resulting in significant financial and user losses. Therefore, in this embodiment of the invention, the data acquisition agent device 13 can adjust the monitored processes according to its own load information to ensure that important processes can be monitored, avoiding the impact of the data acquisition agent device 13's own load on the monitoring of important processes.
[0105] Specifically, the acquisition agent device 13 can obtain its own load information, such as the CPU utilization rate and load value. Based on the obtained load information, the target access address information is determined from at least one access address information currently obtained by the acquisition agent device 13. Then, based only on the target access address information, the running status information of the process corresponding to the target access address information is collected.
[0106] In one possible implementation, the process by which the acquisition proxy device 13 determines the target access address information from at least one access address information currently acquired by the acquisition proxy device 13 based on the acquired load information mainly includes the following two cases:
[0107] Scenario 1: If, based on the load information of the acquisition agent device 13, it is determined that the acquisition agent device 13 is not in an overloaded state, that is, the state of the acquisition agent device 13 is relatively good, it means that the acquisition agent device 13 can effectively collect the running status information of the corresponding process based on each access address information obtained. Therefore, each access address information obtained by the acquisition agent device 13 can be determined as the target access address information.
[0108] Scenario 2: If, based on the load information of the acquisition agent device 13, it is determined that the acquisition agent device 13 is in an overloaded state, for example, the CPU utilization rate is greater than 90% and the load value is greater than the number of CPU cores, it indicates that the acquisition agent device 13 cannot effectively collect the running status information of the corresponding process based on each access address information obtained. In this case, monitoring of a portion of the processes corresponding to each access address information obtained by the acquisition agent device 13 can be abandoned, and the remaining access address information can be determined as the target access address information.
[0109] In one possible implementation, the data acquisition agent device 13 can, in the following manner, abandon monitoring of a portion of the processes corresponding to each access address information obtained by the data acquisition agent device 13:
[0110] Method A: Generally, the longer the overloaded state of the data collection agent device 13 lasts, the greater the congestion level of the data collection agent device 13, and the more access address information needs to be abandoned; conversely, the shorter the overloaded state of the data collection agent device 13 lasts, the less congestion level of the data collection agent device 13, and the fewer access address information needs to be abandoned. Therefore, in this embodiment of the invention, a correspondence between duration ranges and ratios is pre-configured. Different duration ranges can represent the congestion level of the data collection agent device 13. This ratio can be a degradation ratio, i.e., the ratio between the number of abandoned access address information and the number of access address information pieces acquired by the data collection agent device 13, or a preservation ratio, i.e., the ratio between the number of retained access address information and the number of access address information pieces acquired by the data collection agent device 13. In specific implementation, it can be flexibly set according to actual needs. For example, if the duration range is [0, 1] minutes, the downgrade rate for this duration range is 10%; if the duration range is [1, 2] minutes, the downgrade rate is 20%; if the duration range is [2, 3] minutes, the downgrade rate is 30%; if the duration range is [3, 4] minutes, the downgrade rate is 40%; and if the duration range is [4, 5] minutes, the downgrade rate is 50%.
[0111] In the specific implementation process, if the data acquisition agent device 13 determines that it is in an overload state based on its own load information, it can obtain the duration of the overload state. Then, based on the duration and each preset duration range, it determines the target duration range in which the duration falls. Then, based on the pre-configured correspondence between duration ranges and ratios, it determines the target ratio corresponding to the target duration range. Based on the target ratio, it determines the target access address information from at least one access address information.
[0112] Specifically, the target proportion of target access address information can be randomly determined from at least one access address information, or the target proportion of target access address information can be determined from at least one access address information based on the importance of the processes corresponding to at least one preset access address information.
[0113] Method B: By extending the collection cycle of some processes, high-frequency monitoring of these processes can be abandoned.
[0114] In one possible implementation, the acquisition period corresponding to each process can be extended in the following way:
[0115] Method 1: Staff can set the service level corresponding to each process based on its importance. This service level reflects the importance of the process, allowing the data acquisition agent 13 to focus on monitoring processes with higher importance. Therefore, in this embodiment, the configuration device in the business monitoring system can also acquire and save the pre-configured service levels of each process. Before sending the access address information corresponding to the data acquisition agent 13 to the central control device 13, the central control device 12 can obtain the service level and collection cycle of the process corresponding to the access address information from the configuration device. This access address information, the corresponding collection cycle, and the service level are then sent to the data acquisition agent 13, enabling personalized monitoring of processes with different levels of importance and improving the flexibility of process monitoring. If the data acquisition agent device 13 receives at least one access address information, the data acquisition period corresponding to the at least one access address information, and the service level, respectively, it can determine the processes whose data acquisition period needs to be extended from the processes corresponding to the at least one access address information obtained by the data acquisition agent device 13 based on the service level corresponding to the at least one access address information, thereby abandoning high-frequency monitoring of these processes whose data acquisition period needs to be extended.
[0116] In specific implementation, if the data acquisition agent device 13 obtains service levels corresponding to at least one access address from the central control device 12, and it is determined that the data acquisition agent device 13 is in an overloaded state based on its load information, it can filter out the access address information of processes with lower importance from the at least one access address information according to the service levels corresponding to those access address information, thus determining the degraded access address information. For these degraded access address information, the data acquisition agent device 13 can appropriately reduce the number of times it monitors the processes with these degraded access address information. For example, the data acquisition period corresponding to the degraded access address information can be extended, and then the running status information of the processes corresponding to the degraded access address information can be collected according to the extended data acquisition period.
[0117] In one possible implementation, when the data acquisition agent device 13 determines the downgraded access address information from the at least one access address information according to the service level corresponding to each of the at least one access address information, it can determine all of the at least one access address information as downgraded access address information.
[0118] In another possible implementation, the acquisition agent device 13 can also determine which service level access address information needs to have its acquisition cycle extended based on the duration of its own overload state. That is, based on the duration of its own overload state, it can determine which service level access address information is currently degraded access address information. This can ensure the stability of the acquisition agent device 13 in monitoring high-importance processes as much as possible, thereby obtaining the running status information of high-importance processes in a timely manner.
[0119] The longer the overload duration of the data collection agent device 13, the greater its congestion level. Therefore, the service level corresponding to different degradation time ranges can be set for this duration. When determining degradation access address information based on the duration of its overload, the data collection agent device 13 can determine the target degradation time range based on each pre-configured degradation time range. Then, based on the service levels corresponding to different degradation time ranges, it determines the service level corresponding to the target degradation time range and uses the access address information of the service level corresponding to the target degradation time range as the degradation access address information.
[0120] When extending the collection period corresponding to the downgraded access address information, the extension multiple of the service level corresponding to the downgraded access address information can be determined by setting the correspondence between service level and extension multiple. The extended collection period corresponding to the downgraded access address information can then be determined by multiplying the determined extension multiple by the collection period. Alternatively, the extension duration of the service level corresponding to the downgraded access address information can be determined by setting the correspondence between service level and extension duration. The extended collection period corresponding to the downgraded access address information can then be determined by summing the determined extension duration with the collection period.
[0121] Method 2: During the process of collecting the running status information of a process, the acquisition agent device 13 may encounter situations where multiple collection attempts or prolonged collection failures occur for a particular process. For these processes, the acquisition agent device 13 can reduce the frequency of collection, i.e., extend the collection cycle for these processes. Based on this, in this embodiment of the invention, the acquisition agent device 13 determines whether to extend the collection cycle corresponding to a certain access address information based on the historical running status information of the process corresponding to that access address information. Specifically, before collecting the running status information of the process corresponding to that access address information according to the collection cycle corresponding to that access address information, the acquisition agent device 13 obtains the historical running status information of the process corresponding to that access address information; based on the historical running status information, it extends the collection cycle corresponding to that access address information; and based on the extended collection cycle, it updates the collection cycle corresponding to that access address information.
[0122] In one possible implementation, when extending the collection period corresponding to the access address information based on the historical operation status information, if each historical operation status information obtained within a first preset time period before the current time indicates an operation failure, then the collection period corresponding to the access address information is extended.
[0123] In another possible implementation, when extending the collection period corresponding to the access address information based on the historical operation status information, if the historical operation status information obtained before the current time all indicate operation failure, then the collection period corresponding to the access address information is extended.
[0124] The data acquisition agent device 13 can, in the manner described above, abandon monitoring of the processes corresponding to each access address information obtained by the data acquisition agent device 13, thereby effectively reducing the load on the data acquisition agent device 13, achieving load balancing, which is beneficial to the normal operation of the data acquisition agent device 13, improving the availability of the data acquisition agent device 13 and the stability of the business monitoring system.
[0125] To further reduce the load on the data collection agent device 13, a blacklist is maintained within it. This blacklist contains access address information (denoted as blocked access address information) that has experienced prolonged or repeated monitoring failures. Since monitoring processes with blocked access address information is likely to fail within a short period, the data collection agent device 13 can choose not to monitor these processes, thus further reducing its load. The initial blacklist can be empty. During subsequent process monitoring, the access address information of processes that require data collection to be stopped is added to the blacklist, preventing the data collection agent device 13 from monitoring any process with blocked access address information in the blacklist.
[0126] In the specific implementation process, before collecting the running status information of the corresponding process based on at least one access address information, the acquisition proxy device 13 can first determine whether the access address information is the same as any of the blocked access address information in the blacklist. If it is determined that the access address information is the same as any of the blocked access address information, then the running status information of the corresponding process is not collected based on the access address information. If it is determined that the access address information is different from each of the blocked access address information, then the running status information of the corresponding process is collected based on the access address information.
[0127] By using the above method, processes that may not be able to collect running status information can be "shielded", which greatly reduces the invalid collection of the collection agent device 13. That is, the process that is very likely to fail to be monitored is collected, which improves the utilization rate of the collection agent device 13 and avoids the high load and performance problems of the collection agent device 13 due to invalid collection.
[0128] To monitor all processes as comprehensively as possible, this embodiment of the invention pre-configures deletion conditions to remove blocked access address information from the blacklist that meets these conditions. These deletion conditions may include the duration the blocked access address information has been added to the blacklist reaching a preset threshold, or receiving an instruction to delete the blocked access address information. The acquisition proxy device 13 can determine whether each blocked access address information in the blacklist meets the preset deletion conditions. If it is determined that the blocked access address information meets the preset deletion conditions, it is deleted from the blacklist; otherwise, the next blocked access address information is obtained.
[0129] Since the business monitoring system includes multiple data acquisition agent devices 13, the steps described in the above embodiments can be executed for each data acquisition agent device 13.
[0130] Figure 5 This is a schematic diagram of the workflow of a data acquisition agent device 13 provided in an embodiment of the present invention. The workflow includes:
[0131] S501: Collect agent device 13 to obtain its own load information.
[0132] S502: The data acquisition agent device 13 determines whether it is in an overload state based on the load information. If so, execute S511; otherwise, execute S503.
[0133] S503: Obtain any access address information from the access address information of the process that needs to be monitored.
[0134] S504: Determine whether the access address information is the same as any of the blocked access address information in the blacklist. If yes, execute S505; otherwise, execute S507.
[0135] S505: Determine whether the same blocked access address information meets the preset deletion conditions. If yes, execute S506; otherwise, terminate the monitoring of the process for the access address information.
[0136] S506: Delete the blocked access address information.
[0137] S507: Based on the access address information, collect the running status information of the corresponding process.
[0138] S508: Determine whether the data collection was successful. If yes, end the monitoring of the process that accessed the address information; otherwise, execute S509.
[0139] S509: Determine whether the number of failed collections of the access address information has reached the preset threshold. If yes, execute S510; otherwise, terminate the monitoring of the process for the access address information.
[0140] S510: Add the access address information as a blocked access address information to the blacklist.
[0141] S511: Based on this load information, determine each target access address from all the obtained access address information, and determine each obtained target access address as the access address information of the process that needs to be monitored, and execute S503.
[0142] Example 3:
[0143] This invention provides a service monitoring method applied to central control equipment. Figure 6 A schematic diagram of a business monitoring process provided in an embodiment of the present invention includes:
[0144] S601: Receive each access address information sent by the service registration device.
[0145] S602: And acquire the status information sent by multiple acquisition agent devices respectively; the status information includes the identification information of the acquisition agent device that sent the status information.
[0146] S603: Based on each of the aforementioned status information, determine the normally operating acquisition agent device from among the plurality of acquisition agent devices.
[0147] S604: Based on each normally operating acquisition agent device and the saved normal device information, determine the change information of the acquisition agent device.
[0148] S605: Based on the change information, determine at least one access address information corresponding to each of the normally operating acquisition agent devices and send it to the corresponding acquisition agent device; wherein, different access address information is sent to different acquisition agent devices.
[0149] It should be noted that the principle of the business monitoring method provided in this embodiment of the invention to solve the technical problem can be found in the description in the above embodiments 1-2, and the repeated parts will not be repeated.
[0150] Further, the step of determining at least one access address information corresponding to each normally operating data acquisition agent device based on the change information and sending it to the corresponding data acquisition agent device includes:
[0151] If the change information indicates a new device, then based on each access address information, at least one access address information corresponding to each normally operating acquisition agent device is determined; wherein, the new device refers to an acquisition agent device in each normally operating acquisition agent device that is different from all acquisition agent devices corresponding to the normal device information;
[0152] If the change information indicates a reduction in devices, then based on the correspondence between all the acquisition agent devices and access address information corresponding to the saved normal device information, the access address information corresponding to each of the normally operating acquisition agent devices is determined; wherein, the reduction in devices refers to each of the normally operating acquisition agent devices, which is a portion of the devices among all the acquisition agent devices corresponding to the normal device information.
[0153] When monitoring processes under each service, multiple data acquisition agents send their own status information to the central control device. Based on this status information, the central control device can identify the normally functioning data acquisition agents, thus avoiding the continued use of faulty agents to monitor processes and ensuring the reliability of each process monitoring. Subsequently, the central control device can determine changes in the data acquisition agents based on each normally functioning agent and the saved information on normal devices. Based on this change information, it allocates access address information obtained from the service registration device, determining at least one access address for each normally functioning data acquisition agent and sending it to the corresponding agent. This enables sending different access address information to different data acquisition agents and allows multiple agents to monitor processes in parallel, reducing the number of processes a single agent needs to monitor and lowering its load.
[0154] Example 4:
[0155] This invention provides a service monitoring method applied to a data acquisition agent device. Figure 7 This is a schematic diagram of another business monitoring process provided in an embodiment of the present invention, the process including:
[0156] S701: Send its own status information to the central control device, so that after the central control device obtains the status information sent by the multiple acquisition agent devices respectively, it determines the normally operating acquisition agent device from the multiple acquisition agent devices based on each status information; determines the change information of the acquisition agent device according to each normally operating acquisition agent device and the saved normal device information; determines at least one access address information corresponding to each normally operating acquisition agent device based on the change information and sends it to the corresponding acquisition agent device; wherein, different access address information is sent to different acquisition agent devices; the status information includes the identification information of the acquisition agent device.
[0157] S702: If at least one access address information is received from the central control device, the running status information of the corresponding process is collected based on the at least one access address information.
[0158] It should be noted that the principle of the business monitoring method provided in this embodiment of the invention to solve the technical problem can be found in the description in the above embodiments 1-2, and the repeated parts will not be repeated.
[0159] Furthermore, the step of collecting the running status information of the corresponding process based on the at least one access address information includes:
[0160] Based on the load information, the target access address information is determined from the at least one access address information;
[0161] Based on the target access address information, the running status information of the process corresponding to the target access address information is collected.
[0162] Further, the step of collecting the running status information of the process corresponding to the target access address information based on the target access address information includes:
[0163] If, based on the load information, it is determined that it is in an overload state, then, according to the duration of the overload state and each preset duration range, the target duration range in which the duration is located is determined.
[0164] Based on the pre-configured correspondence between duration range and proportion, determine the target proportion corresponding to the target duration range;
[0165] Based on the target ratio, the target access address information is determined from the at least one access address information.
[0166] Furthermore, the step of collecting the running status information of the corresponding process based on the at least one access address information includes:
[0167] If the collection period corresponding to the at least one access address information is obtained from the central control device, then for the at least one access address information, according to the collection period corresponding to the access address information, the running status information of the process corresponding to the access address information is collected based on the access address information.
[0168] Furthermore, the step of collecting the running status information of the corresponding process based on the at least one access address information includes:
[0169] If the service level corresponding to the at least one access address information is obtained from the central control device, and it is determined that it is in an overload state based on the load information, then the downgraded access address information is determined from the at least one access address information according to the service level corresponding to the at least one access address information.
[0170] And the collection period corresponding to the downgraded access address information is extended;
[0171] Based on the extended collection period corresponding to the downgraded access address information, the running status information of the process corresponding to the downgraded access address information is collected.
[0172] Further, determining the downgraded access address information from the at least one access address information based on the service level corresponding to each of the at least one access address information includes:
[0173] Based on the duration of the overload state and the service level corresponding to the at least one access address information, the downgraded access address information is determined from the at least one access address information.
[0174] Furthermore, before collecting the running status information of the process corresponding to the access address information according to the collection period corresponding to the access address information, the method further includes:
[0175] Obtain the historical running status information of the process corresponding to the access address information;
[0176] Based on the historical operating status information, the collection period corresponding to the access address information is extended;
[0177] The collection period corresponding to the access address information is updated based on the extended collection period.
[0178] Furthermore, extending the collection period corresponding to the access address information based on the historical operating status information includes:
[0179] If each piece of historical running status information acquired within a first preset time period prior to the current time indicates a running failure, then the collection period corresponding to that access address information is extended; or
[0180] If the historical running status information obtained before the current time for a preset number of times all indicate running failure, then the collection period corresponding to the access address information will be extended.
[0181] Furthermore, before collecting the running status information of the corresponding process based on the at least one access address information, the method further includes:
[0182] For the at least one access address information, determine whether the access address information is the same as any of the blocked access address information in the blacklist;
[0183] If it is determined that the access address information is the same as any of the aforementioned blocked access address information, then the running status information of the corresponding process will not be collected based on the access address information.
[0184] Furthermore, the method also includes:
[0185] For each blocked access address in the blacklist, if it is determined that the blocked access address meets the preset deletion conditions, then the blocked access address is deleted from the blacklist.
[0186] When monitoring processes under each service, multiple data acquisition agents send their own status information to the central control device. Based on this status information, the central control device can identify the normally functioning data acquisition agents, thus avoiding the continued use of faulty agents to monitor processes and ensuring the reliability of each process monitoring. Subsequently, the central control device can determine changes in the data acquisition agents based on each normally functioning agent and the saved information on normal devices. Based on this change information, it allocates access address information obtained from the service registration device, determining at least one access address for each normally functioning data acquisition agent and sending it to the corresponding agent. This enables sending different access address information to different data acquisition agents and allows multiple agents to monitor processes in parallel, reducing the number of processes a single agent needs to monitor and lowering its load.
[0187] Example 5:
[0188] This invention also provides a service monitoring device, which is applied to a main control device. Figure 8 This is a schematic diagram of a business monitoring device provided in an embodiment of the present invention. The structure includes:
[0189] The receiving unit 81 is configured to receive each access address information sent by the service registration device; and to acquire status information sent by multiple collection agent devices respectively; the status information includes the identification information of the collection agent device that sent the status information;
[0190] The first determining unit 82 is used to determine, based on each of the status information, a normally operating acquisition agent device from the plurality of acquisition agent devices;
[0191] The second determining unit 83 is used to determine the change information of the acquisition agent device based on each of the normally operating acquisition agent devices and the saved normal device information;
[0192] The third determining unit 84 is used to determine at least one access address information corresponding to each of the normally operating acquisition agent devices based on the change information; wherein, different access address information is sent to different acquisition agent devices;
[0193] The sending unit 85 is used to send at least one access address information corresponding to each of the normally operating acquisition agent devices to the corresponding acquisition agent device.
[0194] It should be noted that the principle of the business monitoring device provided in this embodiment of the invention to solve the technical problem can be found in the description in the above embodiments 1-2, and the repeated parts will not be repeated.
[0195] When monitoring processes under each service, multiple data acquisition agents send their own status information to the central control device. Based on this status information, the central control device can identify the normally functioning data acquisition agents, thus avoiding the continued use of faulty agents to monitor processes and ensuring the reliability of each process monitoring. Subsequently, the central control device can determine changes in the data acquisition agents based on each normally functioning agent and the saved information on normal devices. Based on this change information, it allocates access address information obtained from the service registration device, determining at least one access address for each normally functioning data acquisition agent and sending it to the corresponding agent. This enables sending different access address information to different data acquisition agents and allows multiple agents to monitor processes in parallel, reducing the number of processes a single agent needs to monitor and lowering its load.
[0196] Example 6:
[0197] This invention also provides a business monitoring device, which is applied to a data acquisition agent device. Figure 9 This is a schematic diagram of another business monitoring device provided in an embodiment of the present invention. The structure includes:
[0198] The sending module 91 is used to send its own status information to the central control device, so that after the central control device receives the status information sent by the multiple acquisition agent devices, it determines the normally operating acquisition agent device from the multiple acquisition agent devices based on each status information; it determines the change information of the acquisition agent device according to each normally operating acquisition agent device and the saved normal device information; based on the change information, it determines at least one access address information corresponding to each normally operating acquisition agent device and sends it to the corresponding acquisition agent device; wherein, different access address information is sent to different acquisition agent devices; the status information includes the identification information of the acquisition agent device;
[0199] The processing module 92 is used to collect the running status information of the corresponding process based on the at least one access address information sent by the central control device if it receives at least one access address information.
[0200] It should be noted that the principle of the business monitoring device provided in this embodiment of the invention to solve the technical problem can be found in the description in the above embodiments 1-2, and the repeated parts will not be repeated.
[0201] When monitoring processes under each service, multiple data acquisition agents send their own status information to the central control device. Based on this status information, the central control device can identify the normally functioning data acquisition agents, thus avoiding the continued use of faulty agents to monitor processes and ensuring the reliability of each process monitoring. Subsequently, the central control device can determine changes in the data acquisition agents based on each normally functioning agent and the saved information on normal devices. Based on this change information, it allocates access address information obtained from the service registration device, determining at least one access address for each normally functioning data acquisition agent and sending it to the corresponding agent. This enables sending different access address information to different data acquisition agents and allows multiple agents to monitor processes in parallel, reducing the number of processes a single agent needs to monitor and lowering its load.
[0202] Example 7:
[0203] Figure 10 This is a schematic diagram of an electronic device structure provided in an embodiment of the present invention. The electronic device includes: a processor 1001, a communication interface 1002, a memory 1003, and a communication bus 1004, wherein the processor 1001, the communication interface 1002, and the memory 1003 communicate with each other through the communication bus 1004.
[0204] The memory 1003 stores a computer program, which, when executed by the processor 1001, causes the processor 1001 to perform the following steps:
[0205] Receive each access address information sent by the service registration device;
[0206] It also acquires status information sent by multiple acquisition agent devices; the status information includes the identification information of the acquisition agent device that sent the status information;
[0207] Based on each of the aforementioned status information, determine the normally operating data acquisition agent device from among the plurality of data acquisition agent devices;
[0208] Based on each normally operating data acquisition agent device and the saved normal device information, determine the change information of the data acquisition agent device;
[0209] Based on the change information, at least one access address information corresponding to each normally operating data acquisition agent device is determined and sent to the corresponding data acquisition agent device; wherein, different access address information is sent to different data acquisition agent devices.
[0210] Since the principles behind the problems solved by the aforementioned electronic devices are similar to those of business monitoring methods, the implementation of the aforementioned electronic devices can be found in the implementation of the methods, and the repetitive parts will not be repeated.
[0211] Example 8:
[0212] Figure 11 The present invention provides another schematic diagram of an electronic device structure. The electronic device includes: a processor 1101, a communication interface 1102, a memory 1103, and a communication bus 1104, wherein the processor 1101, the communication interface 1102, and the memory 1103 communicate with each other through the communication bus 1104.
[0213] The memory 1103 stores a computer program, which, when executed by the processor 1101, causes the processor 1101 to perform the following steps:
[0214] The system sends its own status information to the central control device, so that the central control device can obtain the status information sent by the multiple acquisition agent devices respectively. Based on each status information, the system determines the normally operating acquisition agent device from the multiple acquisition agent devices. Based on each normally operating acquisition agent device and the saved normal device information, the system determines the change information of the acquisition agent device. Based on the change information, the system determines at least one access address information corresponding to each normally operating acquisition agent device and sends it to the corresponding acquisition agent device. Different access address information is sent to different acquisition agent devices. The status information includes the identification information of the acquisition agent device.
[0215] If at least one access address information is received from the central control device, the running status information of the corresponding process is collected based on the at least one access address information.
[0216] Since the principles behind the problems solved by the aforementioned electronic devices are similar to those of business monitoring methods, the implementation of the aforementioned electronic devices can be found in the implementation of the methods, and the repetitive parts will not be repeated.
[0217] The communication bus mentioned in the above electronic devices can be a Peripheral Component Interconnect (PCI) bus or an Extended Industry Standard Architecture (EISA) bus, etc. This communication bus can be divided into address bus, data bus, control bus, etc. For ease of illustration, only one thick line is used to represent it in the diagram, but this does not mean that there is only one bus or one type of bus.
[0218] Communication interface 1102 is used for communication between the above-mentioned electronic device and other devices.
[0219] The memory may include random access memory (RAM) or non-volatile memory (NVM), such as at least one disk storage device. Optionally, the memory may also be at least one storage device located remotely from the aforementioned processor.
[0220] The processors mentioned above can be general-purpose processors, including central processing units, network processors (NPs), etc.; they can also be digital signal processors (DSPs), application-specific integrated circuits, field-programmable gate arrays or other programmable logic devices, discrete gate or transistor logic devices, discrete hardware components, etc.
[0221] When monitoring processes under each service, multiple data acquisition agents send their own status information to the central control device. Based on this status information, the central control device can identify the normally functioning data acquisition agents, thus avoiding the continued use of faulty agents to monitor processes and ensuring the reliability of each process monitoring. Subsequently, the central control device can determine changes in the data acquisition agents based on each normally functioning agent and the saved information on normal devices. Based on this change information, it allocates access address information obtained from the service registration device, determining at least one access address for each normally functioning data acquisition agent and sending it to the corresponding agent. This enables sending different access address information to different data acquisition agents and allows multiple agents to monitor processes in parallel, reducing the number of processes a single agent needs to monitor and lowering its load.
[0222] Example 9:
[0223] Based on the above embodiments, this invention also provides a computer-readable storage medium storing a computer program executable by a processor. When the program is run on the processor, the processor executes the following steps:
[0224] Receive each access address information sent by the service registration device;
[0225] It also acquires status information sent by multiple acquisition agent devices; the status information includes the identification information of the acquisition agent device that sent the status information;
[0226] Based on each of the aforementioned status information, determine the normally operating data acquisition agent device from among the plurality of data acquisition agent devices;
[0227] Based on each normally operating data acquisition agent device and the saved normal device information, determine the change information of the data acquisition agent device;
[0228] Based on the change information, at least one access address information corresponding to each normally operating data acquisition agent device is determined and sent to the corresponding data acquisition agent device; wherein, different access address information is sent to different data acquisition agent devices.
[0229] Since the principle of computer-readable storage media in solving the problem is similar to the business monitoring method in the above embodiments, the specific implementation can be found in the implementation of the server selection method.
[0230] Example 10:
[0231] Based on the above embodiments, this invention also provides a computer-readable storage medium storing a computer program executable by a processor. When the program is run on the processor, the processor executes the following steps:
[0232] The system sends its own status information to the central control device, so that the central control device can obtain the status information sent by the multiple acquisition agent devices respectively. Based on each status information, the system determines the normally operating acquisition agent device from the multiple acquisition agent devices. Based on each normally operating acquisition agent device and the saved normal device information, the system determines the change information of the acquisition agent device. Based on the change information, the system determines at least one access address information corresponding to each normally operating acquisition agent device and sends it to the corresponding acquisition agent device. Different access address information is sent to different acquisition agent devices. The status information includes the identification information of the acquisition agent device.
[0233] If at least one access address information is received from the central control device, the running status information of the corresponding process is collected based on the at least one access address information.
[0234] Since the principle of computer-readable storage media in solving the problem is similar to the business monitoring method in the above embodiments, the specific implementation can be found in the implementation of the server selection method.
[0235] When monitoring processes under each service, multiple data acquisition agents send their own status information to the central control device. Based on this status information, the central control device can identify the normally functioning data acquisition agents, thus avoiding the continued use of faulty agents to monitor processes and ensuring the reliability of each process monitoring. Subsequently, the central control device can determine changes in the data acquisition agents based on each normally functioning agent and the saved information on normal devices. Based on this change information, it allocates access address information obtained from the service registration device, determining at least one access address for each normally functioning data acquisition agent and sending it to the corresponding agent. This enables sending different access address information to different data acquisition agents and allows multiple agents to monitor processes in parallel, reducing the number of processes a single agent needs to monitor and lowering its load.
[0236] Those skilled in the art will understand that embodiments of this application can be provided as methods, systems, or computer program products. Therefore, this application can take the form of a completely hardware embodiment, a completely software embodiment, or an embodiment combining software and hardware aspects. Furthermore, this application can take the form of a computer program product embodied on one or more computer-usable storage media (including but not limited to disk storage, CD-ROM, optical storage, etc.) containing computer-usable program code.
[0237] This application is described with reference to flowchart illustrations and / or block diagrams of methods, apparatus (systems), and computer program products according to this application. It should be understood that each block of the flowchart illustrations and / or block diagrams, and combinations of blocks in the flowchart illustrations and / or block diagrams, can be implemented by computer program instructions. These computer program instructions can be provided to a processor of a general-purpose computer, special-purpose computer, embedded processor, or other programmable data processing apparatus to produce a machine, such that the instructions, which execute via the processor of the computer or other programmable data processing apparatus, generate instructions for implementing the flowchart illustrations. Figure 1 One or more processes and / or boxes Figure 1 A device that provides the functions specified in one or more boxes.
[0238] These computer program instructions may also be stored in a computer-readable storage medium that can direct a computer or other programmable data processing device to function in a particular manner, such that the instructions stored in the computer-readable storage medium produce an article of manufacture including instruction means, which are implemented in a process Figure 1 One or more processes and / or boxes Figure 1 The function specified in one or more boxes.
[0239] These computer program instructions may also be loaded onto a computer or other programmable data processing equipment to cause a series of operational steps to be performed on the computer or other programmable equipment to produce a computer-implemented process, thereby providing instructions that execute on the computer or other programmable equipment for implementing the process. Figure 1 One or more processes and / or boxes Figure 1 The steps of the function specified in one or more boxes.
[0240] Obviously, those skilled in the art can make various modifications and variations to this application without departing from the spirit and scope of this application. Therefore, if such modifications and variations fall within the scope of the claims of this application and their equivalents, this application also intends to include such modifications and variations.
Claims
1. A business monitoring system, characterized in that, The system includes: a service registration device, a central control device, and multiple data acquisition agent devices; The service registration device is used to obtain the access address information of the process corresponding to each business server and send it to the central control device; The central control device is configured to receive each access address information sent by the service registration device; and acquire status information sent by the plurality of data collection agent devices respectively; the status information includes the identification information of the data collection agent device that sent the status information; based on each status information, determine the normally operating data collection agent device from the plurality of data collection agent devices; determine the change information of the data collection agent device according to each normally operating data collection agent device and the saved normal device information; based on the change information, determine at least one access address information corresponding to each normally operating data collection agent device and send it to the corresponding data collection agent device; wherein, different access address information is sent to different data collection agent devices; The plurality of data acquisition agent devices are used to send their own status information to the central control device; if they receive at least one access address information sent by the central control device, they collect the running status information of the corresponding process based on the at least one access address information. Specifically, the central control device is used to determine at least one access address for each normally operating acquisition agent device based on each access address information if the change information indicates a new device; wherein, a new device refers to an acquisition agent device among the normally operating acquisition agent devices that is different from all acquisition agent devices corresponding to the normal device information; and to determine the access address for each normally operating acquisition agent device based on the stored correspondence between all acquisition agent devices and access address information corresponding to the normal device information if the change information indicates a reduction in devices; wherein, a reduction in devices refers to a portion of the normally operating acquisition agent devices among all acquisition agent devices corresponding to the normal device information.
2. The system as described in claim 1, characterized in that, The plurality of acquisition agent devices are specifically used to determine target access address information from the at least one access address information based on load information; and to collect the running status information of the process corresponding to the target access address information based on the target access address information.
3. The system as described in claim 2, characterized in that, The plurality of data acquisition agent devices are specifically configured to, if determined to be in an overload state based on the load information, determine the target duration range in which the overload state is located according to the duration of the overload state and each preset duration range; determine the target ratio corresponding to the target duration range according to the pre-configured correspondence between duration range and ratio; and determine the target access address information from the at least one access address information according to the target ratio.
4. The system as described in claim 1, characterized in that, The plurality of acquisition agent devices are specifically used to acquire, based on the acquisition period corresponding to the at least one access address information obtained from the central control device, the running status information of the process corresponding to the at least one access address information, according to the acquisition period corresponding to the access address information.
5. The system as described in claim 1, characterized in that, Specifically, the plurality of data acquisition agent devices are used to, if they obtain the service levels corresponding to the at least one access address information from the central control device, and determine that they are in an overload state based on load information, determine degraded access address information from the at least one access address information according to the service levels corresponding to the at least one access address information; extend the collection period corresponding to the degraded access address information; and collect the running status information of the process corresponding to the degraded access address information based on the extended collection period corresponding to the degraded access address information.
6. The system as described in claim 5, characterized in that, The plurality of data collection agent devices are specifically used to determine downgraded access address information from the at least one access address information based on the duration of the overload state and the service level corresponding to the at least one access address information.
7. The system as described in any one of claims 1-6, characterized in that, The plurality of acquisition agent devices are specifically used to obtain historical running status information of the process corresponding to the access address information before acquiring the running status information of the process corresponding to the access address information according to the acquisition period corresponding to the access address information; extend the acquisition period corresponding to the access address information based on the historical running status information; and update the acquisition period corresponding to the access address information according to the extended acquisition period.
8. The system as described in claim 7, characterized in that, The plurality of acquisition agent devices are specifically used to extend the acquisition period corresponding to the access address information if each of the historical running status information obtained within a first preset time period before the current time indicates a running failure. Alternatively, if the historical running status information obtained before the current time for a preset number of times all indicate running failure, the collection period corresponding to the access address information is extended.
9. The system as described in claim 1, characterized in that, The plurality of acquisition agent devices are further configured to, before acquiring the running status information of the corresponding process based on the at least one access address information, determine whether the access address information is the same as any of the blocked access address information in the blacklist; if it is determined that the access address information is the same as any of the blocked access address information, then the running status information of the corresponding process is not acquired based on the access address information.
10. The system as described in claim 9, characterized in that, The plurality of data collection proxy devices are also used to delete each of the blocked access address information in the blacklist from the blacklist if it is determined that the blocked access address information meets the preset deletion conditions.
11. A business monitoring method, characterized in that, The method is applied to central control equipment, and the method includes: Receive each access address information sent by the service registration device; It also acquires status information sent by multiple acquisition agent devices; the status information includes the identification information of the acquisition agent device that sent the status information; Based on each of the aforementioned status information, determine the normally operating data acquisition agent device from among the plurality of data acquisition agent devices; Based on each normally operating data acquisition agent device and the saved normal device information, determine the change information of the data acquisition agent device; Based on the change information, at least one access address information corresponding to each normally operating data acquisition agent device is determined and sent to the corresponding data acquisition agent device; wherein, different access address information is sent to different data acquisition agent devices. The step of determining at least one access address information corresponding to each normally operating data acquisition agent device based on the change information and sending it to the corresponding data acquisition agent device includes: If the change information indicates a new device, then based on each access address information, at least one access address information corresponding to each normally operating acquisition agent device is determined; wherein, the new device refers to an acquisition agent device in each normally operating acquisition agent device that is different from all acquisition agent devices corresponding to the normal device information; If the change information indicates a reduction in devices, then based on the correspondence between all the acquisition agent devices and access address information corresponding to the saved normal device information, the access address information corresponding to each of the normally operating acquisition agent devices is determined; wherein, the reduction in devices refers to each of the normally operating acquisition agent devices, which is a portion of the devices among all the acquisition agent devices corresponding to the normal device information.
12. A business monitoring method, characterized in that, The method is applied to a data acquisition agent device, and the method includes: The system sends its own status information to the central control device, enabling the central control device to obtain the status information sent by multiple acquisition agent devices. Based on each status information, it determines the normally operating acquisition agent device from among the multiple acquisition agent devices. According to each normally operating acquisition agent device and the saved normal device information, it determines the change information of the acquisition agent devices. If the change information indicates a newly added device, it determines at least one access address corresponding to each normally operating acquisition agent device based on each access address information received from the service registration device. Here, a newly added device refers to an acquisition agent device among each normally operating acquisition agent device that is different from all acquisition agent devices corresponding to the normal device information. If the change information indicates a reduced device, it determines the access address information corresponding to each normally operating acquisition agent device based on the saved correspondence between all acquisition agent devices corresponding to the normal device information and the access address information. Here, a reduced device refers to each normally operating acquisition agent device being a subset of all acquisition agent devices corresponding to the normal device information. Different access address information is sent to different acquisition agent devices. The status information includes the identification information of the acquisition agent device. If at least one access address information is received from the central control device, the running status information of the corresponding process is collected based on the at least one access address information.
13. A business monitoring device, characterized in that, The device is used in central control equipment, and the device includes: The receiving unit is configured to receive each access address information sent by the service registration device; and to acquire status information sent by multiple data collection agent devices respectively; the status information includes the identification information of the data collection agent device that sent the status information. The first determining unit is used to determine, based on each of the status information, a normally operating acquisition agent device from the plurality of acquisition agent devices; The second determining unit is used to determine the change information of the acquisition agent device based on each of the normally operating acquisition agent devices and the saved normal device information; The third determining unit is used to determine at least one access address information corresponding to each of the normally operating acquisition agent devices based on the change information; wherein, different access address information is sent to different acquisition agent devices; The sending unit is used to send at least one access address information corresponding to each of the normally operating acquisition agent devices to the corresponding acquisition agent device; Specifically, the third determining unit is configured to: if the change information indicates a newly added device, determine at least one access address information corresponding to each normally operating acquisition agent device based on each access address information; wherein, the newly added device refers to an acquisition agent device among each normally operating acquisition agent device that is different from all acquisition agent devices corresponding to the normal device information; if the change information indicates a reduced device, determine the access address information corresponding to each normally operating acquisition agent device based on the stored correspondence between all acquisition agent devices corresponding to the normal device information and access address information; wherein, the reduced device refers to each normally operating acquisition agent device being a subset of all acquisition agent devices corresponding to the normal device information.
14. A business monitoring device, characterized in that, The device is used in a data acquisition agent device, and the device includes: A sending module is used to send its own status information to the central control device, so that after the central control device receives the status information sent by multiple acquisition agent devices, it determines the normally operating acquisition agent device from the multiple acquisition agent devices based on each status information; according to each normally operating acquisition agent device and the saved normal device information, it determines the change information of the acquisition agent device; if the change information indicates a new device, it determines at least one access address information corresponding to each normally operating acquisition agent device based on each access address information received from the service registration device; wherein, a new device refers to an acquisition agent device in each normally operating acquisition agent device that is different from all acquisition agent devices corresponding to the normal device information; if the change information indicates a reduced device, it determines the access address information corresponding to each normally operating acquisition agent device based on the correspondence between all acquisition agent devices corresponding to the saved normal device information and access address information; wherein, a reduced device refers to each normally operating acquisition agent device being a part of all acquisition agent devices corresponding to the normal device information; wherein, different access address information is sent to different acquisition agent devices; the status information includes the identification information of the acquisition agent device; The processing module is used to collect the running status information of the corresponding process based on the at least one access address information received from the central control device.
15. An electronic device, characterized in that, The electronic device includes a processor, which executes a computer program stored in a memory to implement the steps of the service monitoring method as described in claim 11, or to implement the steps of the service monitoring method as described in claim 12.
16. A computer-readable storage medium, characterized in that, It stores a computer program that, when executed by a processor, implements the steps of the business monitoring method as described in claim 11, or implements the steps of the business monitoring method as described in claim 12.