Method for monitoring running state of device, electronic device and storage medium

By monitoring resource utilization on the device and automatically invoking exception handling rules, the problem of maintenance personnel being unable to handle device anomalies in a timely manner is solved, and automated optimization of device operating status and stability improvement are achieved.

CN116010203BActive Publication Date: 2026-06-19BEIJING TOPSEC NETWORK SECURITY TECH +2

Patent Information

Authority / Receiving Office
CN · China
Patent Type
Patents(China)
Current Assignee / Owner
BEIJING TOPSEC NETWORK SECURITY TECH
Filing Date
2022-12-28
Publication Date
2026-06-19

AI Technical Summary

Technical Problem

Maintenance personnel are unable to address equipment malfunctions in a timely manner, leading to prolonged abnormal equipment operation.

Method used

By monitoring multiple resource items on the target device, the resource utilization rate is periodically obtained according to the monitoring rules, and the exception handling rules are automatically invoked when the threshold is reached. Combining manual and automated processing methods, the utilization rate threshold and priority processing rules are dynamically generated.

Benefits of technology

It achieves automated optimization of equipment resource utilization, timely handling of abnormal states, avoids long-term abnormal operation of equipment, and improves system stability.

✦ Generated by Eureka AI based on patent content.

Smart Images

  • Figure CN116010203B_ABST
    Figure CN116010203B_ABST
Patent Text Reader

Abstract

This application provides a method for monitoring the operating status of equipment, as well as an electronic device and a computer-readable storage medium. The method includes: periodically acquiring the resource utilization rate corresponding to each resource monitoring item for multiple resource monitoring items on a target device, based on the monitoring frequency indicated by the monitoring rule corresponding to each resource monitoring item; determining whether the resource utilization rate corresponding to the resource monitoring item reaches the utilization rate threshold indicated by its corresponding monitoring rule; if so, calling the exception handling rule corresponding to the resource monitoring item from a preset exception handling rule library to perform exception handling. This application's solution solves the problem of long-term abnormal equipment operation caused by maintenance personnel's inability to handle exceptions in a timely manner.
Need to check novelty before this filing date? Find Prior Art

Description

Technical Field

[0001] This application relates to the field of computer technology, and in particular to a method for monitoring the operating status of a device, an electronic device, and a computer-readable storage medium. Background Technology

[0002] In a system's operating environment, CPU (Central Processing Unit) utilization, memory utilization, and disk utilization are important indicators for measuring current system performance. When a system runs various business functions, it requires the support of resources such as CPU, memory, and disk. If a process occupies too many resources for an extended period, it may freeze or become unresponsive, thus affecting the overall system operation. If resource usage remains excessively high for a prolonged period, it can have uncontrollable impacts on certain business processes.

[0003] Therefore, resource monitoring is used to monitor the status of various system resources in real time. If resource usage is too high, the abnormal resource monitoring results are fed back to the operations and maintenance personnel, enabling them to resolve the problem and prevent uncontrollable errors from causing losses. However, in practical applications, operations and maintenance personnel may be unable to handle anomalies in a timely manner for various reasons, leading to prolonged abnormal operation of the equipment. Summary of the Invention

[0004] The purpose of this application is to provide a method for monitoring the operating status of equipment, as well as an electronic device and a computer-readable storage medium, to solve the problem of long-term abnormal operation of equipment due to the inability of maintenance personnel to handle abnormalities in a timely manner.

[0005] On the one hand, this application provides a method for monitoring the operating status of equipment, including:

[0006] For multiple resource monitoring items on the target device, the resource utilization rate corresponding to each resource monitoring item is periodically obtained according to the monitoring frequency indicated by the monitoring rule corresponding to each resource monitoring item.

[0007] Determine whether the resource utilization rate corresponding to the resource monitoring item has reached the utilization rate threshold indicated by its corresponding monitoring rule;

[0008] If the condition is met, the exception handling rule corresponding to the resource monitoring item will be called from the preset exception handling rule library to handle the exception.

[0009] Through the above measures, when the resource utilization rate corresponding to the resource monitoring item reaches the utilization rate threshold, the abnormal handling rules corresponding to the resource monitoring item can be automatically invoked for processing, thereby automatically optimizing resource usage and promptly handling abnormal equipment operation status.

[0010] In one embodiment, before periodically acquiring the resource utilization rate corresponding to each resource monitoring item based on the monitoring frequency indicated by the monitoring rule corresponding to each resource monitoring item on the target device, the method further includes:

[0011] Obtain the monitoring rules corresponding to each of the multiple resource monitoring items from the management terminal.

[0012] Through the above measures, monitoring rules can be flexibly configured for each target device via the management terminal. 5. In one embodiment, the usage rate threshold indicated by the monitoring rule is a dynamic threshold;

[0013] Before determining whether the resource utilization rate corresponding to the resource monitoring item has reached the utilization rate threshold indicated by its corresponding monitoring rule, the method further includes:

[0014] Periodically determine the resource utilization rate corresponding to the resource monitoring item, and select two feasible resource utilization rates from the continuously acquired resource utilization rates;

[0015] 0. The weight is determined by the smaller value of the utilization rate of the two feasible resources and the absolute value of the difference;

[0016] The utilization threshold is determined based on the larger of the weights and the utilization rates of the two feasible resources.

[0017] The above measures can be used to dynamically generate usage thresholds for resource monitoring items.

[0018] In one embodiment, selecting two feasible resource utilization rates from continuously acquired resource utilization rates includes:

[0019] Determine whether the absolute value of the difference between the resource utilization rate at the first moment and the resource utilization rate at the second moment is less than the target difference;

[0020] If so, determine the resource utilization rate at the first moment and the resource utilization rate at the second moment as the feasible resource utilization rate;

[0021] If not, determine whether the absolute value of the difference between the resource utilization rate at the first moment and the resource utilization rate at the third moment is less than the target difference;

[0022] If the absolute value of the difference between the resource utilization rate at the first time step and the resource utilization rate at the third time step is less than the target difference, and the resource utilization rate at the second time step is greater than the resource utilization rates at the first and third time steps, then the resource utilization rates at the first and third time steps are determined as feasible resource utilization rates.

[0023] If the absolute value of the difference between the resource utilization rate at the first time step and the resource utilization rate at the third time step is not less than the target difference, and the absolute value of the difference between the resource utilization rate at the second time step and the resource utilization rate at the third time step is less than the target difference, then the resource utilization rates at the second time step and the third time step are determined as feasible resource utilization rates.

[0024] The above measures can be used to determine the feasible resource utilization rate, which can then be used to calculate the dynamic threshold.

[0025] In one embodiment, the method further includes:

[0026] If the absolute value of the difference between the resource utilization rate at the first time step and the resource utilization rate at the third time step is less than the target difference, and the resource utilization rate at the second time step is less than the resource utilization rates at the first and third time steps, the previous utilization rate threshold will be used; or,

[0027] If a feasible resource utilization rate cannot be selected from the resource utilization rates at the first, second, and third time points, the utilization rate threshold from the previous time point will be used.

[0028] The above measures allow for the use of previously calculated utilization thresholds when it is not possible to select a feasible resource utilization rate.

[0029] In one embodiment, the method further includes:

[0030] If the resource utilization rate corresponding to the resource monitoring item reaches the utilization rate threshold, check whether the processing method in the monitoring rule corresponding to the resource monitoring item is manual processing.

[0031] If it is handled manually, an alarm message is sent based on the alarm method indicated by the monitoring rules;

[0032] Determine whether the abnormal handling instruction corresponding to the alarm information is received within the preset waiting time. If not received, continue to execute the step of calling the abnormal handling rule corresponding to the resource monitoring item from the preset abnormal handling rule library to perform abnormal handling.

[0033] Through the above measures, when the monitoring rules indicate that the processing method is manual, we can wait for manual processing first. Then, when an exception handling instruction is received, we can respond to the exception handling instruction and process it. Even when an exception handling instruction cannot be received, we can still achieve automated processing with the help of exception handling rules, thus avoiding the problem of unresolved abnormal resource occupation.

[0034] In one embodiment, the method further includes:

[0035] If an exception handling instruction is received within the specified waiting time, exception handling shall be performed according to the exception handling instruction.

[0036] In one embodiment, the exception handling rule base includes multiple exception handling rules corresponding to the resource monitoring item, and different exception handling rules are configured with different priorities;

[0037] The step of calling the exception handling rule corresponding to the resource monitoring item from the preset exception handling rule library to perform exception handling includes:

[0038] Based on priority, the exception handling rules corresponding to the resource monitoring items are selected one by one for exception handling until the resource utilization rate corresponding to the resource monitoring item is less than the utilization rate threshold.

[0039] By implementing the above measures, exception handling rules with higher priority can be selected and processed step by step according to their priority order.

[0040] On the other hand, this application provides an electronic device, the electronic device comprising:

[0041] processor;

[0042] Memory used to store processor-executable instructions;

[0043] The processor is configured to execute the above-mentioned method for monitoring the operating status of the device.

[0044] In addition, this application provides a computer-readable storage medium storing a computer program that can be executed by a processor to perform the above-mentioned method for monitoring the operating status of the device. Attached Figure Description

[0045] To more clearly illustrate the technical solutions of the embodiments of this application, the accompanying drawings used in the embodiments of this application will be briefly described below.

[0046] Figure 1 A schematic diagram illustrating an application scenario of the device operation status monitoring method provided in an embodiment of this application;

[0047] Figure 2 This is a schematic diagram of the structure of an electronic device provided in an embodiment of this application;

[0048] Figure 3 A flowchart illustrating a method for monitoring the operating status of a device according to an embodiment of this application;

[0049] Figure 4 A schematic diagram of an exception handling rule base provided in an embodiment of this application;

[0050] Figure 5 A schematic diagram illustrating the configuration of monitoring rules provided in one embodiment of this application;

[0051] Figure 6 A flowchart illustrating a method for generating a usage threshold according to an embodiment of this application;

[0052] Figure 7 A schematic diagram of a management page provided in an embodiment of this application;

[0053] Figure 8 This is an overall architecture diagram of a device operation status monitoring method provided in an embodiment of this application;

[0054] Figure 9 A block diagram of a device for monitoring the operating status of an equipment provided in an embodiment of this application. Detailed Implementation

[0055] The technical solutions in the embodiments of this application will now be described with reference to the accompanying drawings.

[0056] Similar reference numerals and letters in the following figures indicate similar items; therefore, once an item is defined in one figure, it does not need to be further defined and explained in subsequent figures. Furthermore, in the description of this application, terms such as "first," "second," etc., are used only to distinguish descriptions and should not be construed as indicating or implying relative importance.

[0057] Figure 1 This is a schematic diagram illustrating an application scenario for the device operation status monitoring method provided in this application embodiment. For example... Figure 1 As shown, the application scenario includes a target device 20 and a management terminal 30. The management terminal 30 can be a user terminal such as a mobile phone, tablet computer, or host, which can respond to user configuration and generate several monitoring rules for the target device 20. The target device 20 can be a server, security gateway, or other device, which can obtain the monitoring rules issued by the management terminal 30 and thus monitor its own operating status according to the monitoring rules.

[0058] like Figure 2 As shown, this embodiment provides an electronic device 1, including: at least one processor 11 and a memory 12. Figure 2 Taking a processor 11 as an example, the processor 11 and the memory 12 are connected via a bus 10. The memory 12 stores instructions that can be executed by the processor 11. The instructions are executed by the processor 11 to enable the electronic device 1 to perform all or part of the processes of the methods in the embodiments described below. In one embodiment, the electronic device 1 may be the aforementioned target device 20, used to execute a method for monitoring the operating status of a device.

[0059] The memory 12 can be implemented by any type of volatile or non-volatile storage device or a combination thereof, such as static random access memory (SRAM), electrically erasable programmable read-only memory (EEPROM), erasable programmable read-only memory (EPROM), programmable red-only memory (PROM), read-only memory (ROM), magnetic storage, flash memory, magnetic disk or optical disk.

[0060] This application also provides a computer-readable storage medium storing a computer program that can be executed by a processor 11 to perform the device operation status monitoring method provided in this application.

[0061] See Figure 3 This is a flowchart illustrating a method for monitoring the operating status of a device according to an embodiment of this application. Figure 3 As shown, the method may include the following steps 310-330.

[0062] Step 310: For multiple resource monitoring items on the target device, periodically obtain the resource utilization rate corresponding to each resource monitoring item according to the monitoring frequency indicated by the monitoring rule corresponding to each resource monitoring item.

[0063] The resource monitoring items are the device resources that need to be monitored. For example, resource monitoring items may include CPU utilization, disk utilization, memory utilization, etc.

[0064] For each resource monitoring item on the target device that needs to be monitored, the target device can pre-configure corresponding monitoring rules. The monitoring rules can include a monitoring frequency, which indicates the period for obtaining the resource utilization rate corresponding to the resource monitoring item. For example, if the monitoring frequency is 60 seconds, then the resource utilization rate needs to be obtained every 60 seconds.

[0065] On the target device, for each resource monitoring item, a corresponding process or thread can be allocated, and the monitoring method for the device's operating status can be executed through that process or thread.

[0066] Step 320: Determine whether the resource utilization rate corresponding to the resource monitoring item has reached the utilization rate threshold indicated by its corresponding monitoring rule.

[0067] Step 330: If the condition is met, call the exception handling rule corresponding to the resource monitoring item from the preset exception handling rule library to perform exception handling.

[0068] After obtaining the resource utilization rate corresponding to the resource monitoring item, the target device can determine whether the resource utilization rate has reached the utilization rate threshold indicated by the corresponding monitoring rule. Here, the utilization rate threshold can be a manually configured fixed value or an intelligently configured dynamic threshold, which can be pre-configured as needed.

[0069] In one scenario, if the resource utilization rate does not reach the utilization threshold, it indicates that the resource usage of that monitored item is normal and requires no action; monitoring of that resource item can continue. Alternatively...

[0070] On the one hand, if the resource utilization rate reaches the utilization rate threshold, then the zero resource usage for that resource monitoring item is considered abnormal. The target device can retrieve the corresponding exception handling rule from the exception handling rule base.

[0071] Regularly configured rules are used to handle abnormal resource usage. Here, the exception handling rule base can be pre-configured with multiple exception handling rules, and each resource monitoring item has multiple exception handling rules in the exception handling rule base. See also Figure 4 This is an exception handling rule library provided in one embodiment of this application.

[0072] A schematic diagram, such as Figure 4 As shown, the exception handling rule base contains exception handling rules corresponding to various resource monitoring items. Multiple exception handling rules corresponding to the same resource monitoring item can be arranged in order of priority.

[0073] Through the above measures, when the resource utilization rate corresponding to the resource monitoring item reaches the utilization rate threshold, the abnormal handling rules corresponding to the resource monitoring item can be automatically invoked for processing, thereby automatically optimizing resource usage and promptly handling abnormal equipment operation status.

[0074] In one embodiment, before performing step 310, the target device can obtain the monitoring rules corresponding to each of the multiple resource monitoring items from the management terminal.

[0075] Operations and maintenance personnel can configure corresponding monitoring rules for each resource monitoring item on the management interface. The management interface can provide a web page, allowing operations and maintenance personnel to configure monitoring rules for target devices on the web page.

[0076] See Figure 5 This is a schematic diagram illustrating the configuration of monitoring rules provided in an embodiment of this application, as shown below. Figure 5As shown, monitoring rules can include resource monitoring items, monitoring switches, monitoring frequencies, processing methods, alarm methods, threshold management methods, and utilization thresholds. Resource monitoring items can be configured as needed; for example, they can include CPU utilization, memory utilization, disk utilization, sending traffic, and receiving traffic. Monitoring switches indicate whether monitoring of resource monitoring items is enabled or disabled. Monitoring frequencies indicate the period for obtaining and comparing the resource utilization corresponding to a resource monitoring item. Processing methods can include manual processing and intelligent processing. Manual processing involves handling anomalies through manual intervention; intelligent processing involves automatically handling anomalies through anomaly handling rules. Alarm methods limit how alarm information is sent to operations and maintenance personnel and can include SMS, email, and mini-programs. Threshold management methods can include manually configured fixed values ​​or intelligently configured dynamic thresholds. When intelligent configuration is selected, an initial threshold can be manually set or not set, and the initial threshold can be calculated.

[0077] After the management terminal configures the monitoring rules for each resource monitoring item on the target device, it can send the monitoring rules to the target device so that the target device can monitor its own operating status according to the monitoring rules.

[0078] In one embodiment, if the monitoring rule indicates that the utilization threshold is a dynamic threshold, a utilization threshold can be dynamically generated for the resource monitoring item. See also Figure 6 This is a flowchart illustrating a method for generating a usage rate threshold according to an embodiment of this application. Figure 6 As shown, the method may include steps 610 to 630.

[0079] Step 610: Periodically determine the resource utilization rate corresponding to the resource monitoring item, and select two feasible resource utilization rates from the continuously acquired resource utilization rates.

[0080] The period for periodically determining resource utilization can be pre-configured as needed. This period can be the same as or different from the period for obtaining resource utilization when performing steps 310 to 330. For example, the period is 10 seconds.

[0081] For resource monitoring items that dynamically generate usage thresholds, the target device can allocate a dedicated process or thread to execute steps 510 to 530.

[0082] For example, taking CPU utilization as an example, the device resource log file (usually / proc / stat in Linux systems) can be read, all data can be summarized to obtain the current total CPU usage and idle time. By subtracting the idle time from the total usage and dividing the difference by the total usage, an approximate value of CPU utilization can be obtained. The target device can periodically perform this calculation process to periodically obtain CPU utilization.

[0083] The target device can select two feasible resource utilization rates from continuously acquired resource utilization rates, where the absolute value of the difference between these two feasible resource utilization rates is less than a preset target difference. Here, the target difference can be configured according to requirements; for example, the target difference can be 3%.

[0084] Step 620: Determine the weights based on the smaller of the two feasible resource utilization rates and the absolute value of the difference.

[0085] The target device can obtain the weight by dividing the absolute value of the difference between the utilization rates of two feasible resources by the smaller value of the two feasible resource utilization rates.

[0086] Step 630: Determine the utilization threshold based on the weights and the larger of the two feasible resource utilization rates.

[0087] After obtaining the weights, the target device can multiply the larger of the two feasible resource utilization rates by the weights to obtain the utilization threshold.

[0088] In one embodiment, when the target device selects two feasible resource utilization rates from continuously acquired resource utilization rates, it can determine whether the absolute value of the difference between the resource utilization rate at the first time moment and the resource utilization rate at the second time moment is less than a target difference. Here, the first time moment and the second time moment are both general terms, and there may be a period of time difference between the first time moment and the second time moment, with the first time moment preceding the second time moment.

[0089] After acquiring the resource utilization rates at two consecutive moments, the target device can calculate the absolute value of the difference between the resource utilization rates at these two moments, and then determine whether the absolute value of the difference is less than the target difference.

[0090] On the one hand, if so, the resource utilization rates at the first and second time points are determined as feasible resource utilization rates. Furthermore, a utilization threshold can be calculated based on the resource utilization rates at the first and second time points, as described in the previous section, and will not be repeated here.

[0091] On the other hand, if not, the target device can determine whether the absolute value of the difference between the resource utilization rate at the first moment and the resource utilization rate at the third moment is less than the target difference. The third moment is one cycle longer than the second moment, with the second moment preceding the third.

[0092] In one scenario, if the absolute value of the difference between the resource utilization rate at the first time step and the resource utilization rate at the third time step is less than the target difference, and the resource utilization rate at the second time step is greater than the resource utilization rates at both the first and third time steps, the target device can determine the resource utilization rates at the first and third time steps as feasible resource utilization rates. Furthermore, a utilization rate threshold can be calculated based on the resource utilization rates at the first and third time steps, as described in the preceding text, and will not be repeated here.

[0093] In another scenario, if the absolute value of the difference between the resource utilization rate at the first moment and the resource utilization rate at the third moment is less than the target difference, and the resource utilization rate at the second moment is less than the resource utilization rates at the first and third moments, it can be determined that the resource utilization rates at the first and third moments are too large relative to the resource utilization rate at the second moment, indicating that there is a sharp rise and fall in the resource utilization rate in the short term. In this case, the utilization rate threshold can be calculated without using the resource utilization rates at the first, second, or third moments, and the previous utilization rate threshold can be used instead.

[0094] In another scenario, if the absolute value of the difference between the resource utilization rate at the first time step and the resource utilization rate at the third time step is not less than the target difference, and the absolute value of the difference between the resource utilization rate at the second time step and the resource utilization rate at the third time step is less than the target difference, the target device can determine the resource utilization rates at the second and third time steps as feasible resource utilization rates. Furthermore, a utilization threshold can be calculated based on the resource utilization rates at the second and third time steps, as described in the preceding text, and will not be repeated here.

[0095] Furthermore, if a feasible resource utilization rate cannot be selected from the resource utilization rates at the first, second, and third time points—in other words, if the absolute value of the difference between the resource utilization rates at the first and second time points is not less than the target difference, and the absolute value of the difference between the resource utilization rates at the first and third time points is not less than the target difference, it can be concluded that the resource utilization rate is not stable in the short term. In this case, the utilization rate threshold can be calculated without using the resource utilization rates at the first, second, or third time points, and the previous utilization rate threshold can be used instead.

[0096] By implementing the above measures, a utilization threshold can be dynamically calculated when resource utilization is relatively stable. This dynamic threshold can then be used to accurately determine whether there is an abnormal surge in resource occupancy.

[0097] In one embodiment, after the target device executes step 320, if the resource utilization rate corresponding to the resource monitoring item reaches the utilization rate threshold, the target device can check whether the processing method in the monitoring rule corresponding to the resource monitoring item is manual processing.

[0098] On the one hand, if the processing method is intelligent processing, the target device can continue to execute step 330, and call the exception processing rules from the exception processing rule base to perform exception processing.

[0099] On the other hand, if the processing method is manual, the target device can send alarm information based on the alarm methods indicated by the monitoring rules. The target device can send alarm information to maintenance personnel via SMS, email, or mini-programs as indicated by the monitoring rules. This alarm information indicates that the resource utilization rate of a monitored resource item has reached a utilization threshold. For example: CPU utilization exceeds the threshold of 90%, please handle it.

[0100] After sending an alarm message, the target device can determine whether to receive the corresponding exception handling instruction within a preset waiting time. The exception handling instruction is a manually generated instruction used to handle abnormal resource usage. The waiting time is the duration for waiting for a response from operations and maintenance personnel, and can be configured as needed; for example, the waiting time can be 5 minutes. See also... Figure 7 This is a schematic diagram of a management page provided in an embodiment of this application, as shown below. Figure 7 As shown, the management page can include processes consuming resources, and selectable handling methods such as "restart," "shut down," and "limit resources." After receiving and viewing alarm information through the management terminal, maintenance personnel can log in to the management page, generate exception handling commands, and send them to the target device through the management terminal.

[0101] On the one hand, if an exception handling instruction is received within the waiting period, the target device can perform exception handling according to the instruction, thereby completing the manual handling process for abnormal resource occupancy. On the other hand, if no exception handling instruction is received within the waiting period, the target device can continue to execute step 330, calling exception handling rules from the exception handling rule base to perform exception handling.

[0102] Through the above measures, when the monitoring rules indicate that the processing method is manual, we can wait for manual processing first. Then, when an exception handling instruction is received, we can respond to the exception handling instruction and process it. Even when an exception handling instruction cannot be received, we can still achieve automated processing with the help of exception handling rules, thus avoiding the problem of unresolved abnormal resource occupation.

[0103] In one embodiment, the exception handling rule base includes multiple exception handling rules corresponding to resource monitoring items, and different exception handling rules are configured with different priorities. When the target device executes step 330, it can select the exception handling rules corresponding to the resource monitoring items one by one according to the priority order for exception handling. After processing, it can check whether the resource utilization rate corresponding to the resource monitoring item is less than the utilization rate threshold. On the one hand, if it is less than the utilization rate threshold, the processing is completed. On the other hand, if it is not less than the utilization rate threshold, the next exception handling rule can be selected to continue exception handling. This process can be repeated multiple times until the resource utilization rate corresponding to the resource monitoring item is less than the utilization rate threshold.

[0104] By implementing the above measures, exception handling rules with higher priority can be selected and processed step by step according to their priority order.

[0105] See Figure 8 This is an overall architecture diagram of a device operation status monitoring method provided in an embodiment of this application, as shown below. Figure 8 As shown, the management console can configure monitoring rules, including threshold management methods (manual or intelligent configuration) and anomaly handling methods (manual or intelligent handling), and then distribute the monitoring rules to the target devices. The target devices can periodically acquire resource information and compare it with the utilization thresholds indicated by the monitoring rules. When the resource utilization reaches the threshold, an alarm is output according to the alarm method indicated by the monitoring rules, and corresponding anomaly handling is then performed.

[0106] The overall processing flow is illustrated below with an example:

[0107] As a security gateway for the target device, upon startup, it reads the CPU usage monitoring rules issued by the management terminal. These rules specify a monitoring frequency of 60 seconds, an alarm method via WeChat mini-program, intelligent handling for anomalies, intelligent generation (dynamic threshold) for threshold generation, and a threshold change cycle of 10 minutes. During operation, the security gateway continuously reads the current system resource status at a 60-second monitoring frequency, periodically updates the CPU resource usage ranking list, saves the resource status data, and compares the acquired resource status data with the thresholds.

[0108] After the security gateway system has been running for 10 minutes, it collects all status data and calculates the utilization threshold as 70% based on the threshold algorithm. 70% is then used as the alarm threshold for the next cycle.

[0109] After running for another 6 minutes, the current CPU utilization of the security gateway exceeded 70%, and the maintainer was notified via WeChat mini-program. Furthermore, the security gateway can read the exception handling rule library and, based on the highest priority exception handling rule (cpulimit), limit the process with the highest CPU usage. After limiting it, the CPU utilization and the utilization threshold are checked again. If the CPU utilization is still higher than the threshold, the second-priority exception handling rule is selected from the exception handling rule library, and the process with the highest CPU usage is shut down. At this point, the CPU utilization has decreased to 60%, and the exception handling is complete. The security gateway awaits monitoring in the next cycle.

[0110] 0 Figure 9 This is a block diagram of a device for monitoring the operating status of an equipment according to an embodiment of the present invention, as shown below. Figure 9 As shown, the device may include:

[0111] The acquisition module 910 is used to periodically acquire the corresponding data for each resource monitoring item on the target device, based on the monitoring frequency indicated by the monitoring rules for each resource monitoring item.

[0112] Resource utilization rate;

[0113] 5. Judgment module 920 is used to determine whether the resource utilization rate corresponding to the resource monitoring item has reached the utilization rate threshold indicated by its corresponding monitoring rule.

[0114] The processing module 930 is used to, if the condition is met, call the exception handling rule corresponding to the resource monitoring item from the preset exception handling rule library to perform exception handling.

[0115] The specific implementation process of the functions and roles of each module in the above-mentioned device can be found in the implementation process of the corresponding steps in the monitoring method for the device in operation state 0, and will not be repeated here.

[0116] The apparatuses and methods disclosed in the several embodiments provided in this application can also be implemented in other ways. The apparatus embodiments described above are merely illustrative. For example, the flowcharts and block diagrams in the accompanying drawings illustrate the architecture, functionality, and operation of possible implementations of apparatuses, methods, and computer program products according to various embodiments of this application. In this regard, each block in a flowchart or block diagram may represent a module, segment, or portion of code, which contains one or more executable instructions for implementing a specified logical function. In some alternative implementations, the functions marked in the blocks may occur in a different order than those marked in the drawings. For example, two consecutive blocks may actually be executed substantially in parallel, and they may sometimes be executed in reverse order, depending on the functions involved. It should also be noted that each block in a block diagram and / or flowchart, and combinations of blocks in block diagrams and / or flowcharts, can be implemented using a dedicated hardware-based system that performs the specified function or action, or using a combination of dedicated hardware and computer instructions.

[0117] In addition, the functional modules in the various embodiments of this application can be integrated together to form an independent part, or each module can exist independently, or two or more modules can be integrated to form an independent part.

[0118] If a function is implemented as a software module and sold or used as an independent product, it can be stored in a computer-readable storage medium. Based on this understanding, the technical solution of this application, in essence, or the part that contributes to the prior art, or a portion of the technical solution, can be embodied in the form of a software product. This computer software product is stored in a storage medium and includes several instructions to cause a computer device (which may be a personal computer, server, or network device, etc.) to execute all or part of the steps of the methods of the various embodiments of this application. The aforementioned storage medium includes various media capable of storing program code, such as a USB flash drive, a portable hard drive, a read-only memory (ROM), a random access memory (RAM), a magnetic disk, or an optical disk.

Claims

1. A method for monitoring the operating status of equipment, characterized in that, include: For multiple resource monitoring items on the target device, the resource utilization rate corresponding to each resource monitoring item is periodically obtained according to the monitoring frequency indicated by the monitoring rule corresponding to each resource monitoring item. Determine whether the resource utilization rate corresponding to the resource monitoring item has reached the utilization rate threshold indicated by the corresponding monitoring rule; the utilization rate threshold indicated by the monitoring rule is a dynamic threshold. If the condition is met, the exception handling rule corresponding to the resource monitoring item is called from the preset exception handling rule library to perform exception handling. The method further includes, before determining whether the resource utilization rate corresponding to the resource monitoring item has reached the utilization rate threshold indicated by its corresponding monitoring rule: Periodically determine the resource utilization rate corresponding to the resource monitoring item; Determine whether the absolute value of the difference between the resource utilization rate at the first moment and the resource utilization rate at the second moment is less than the target difference; If so, determine the resource utilization rate at the first moment and the resource utilization rate at the second moment as the feasible resource utilization rate; If not, determine whether the absolute value of the difference between the resource utilization rate at the first moment and the resource utilization rate at the third moment is less than the target difference; If the absolute value of the difference between the resource utilization rate at the first time step and the resource utilization rate at the third time step is less than the target difference, and the resource utilization rate at the second time step is greater than the resource utilization rates at the first and third time steps, then the resource utilization rates at the first and third time steps are determined as feasible resource utilization rates. If the absolute value of the difference between the resource utilization rate at the first time step and the resource utilization rate at the third time step is not less than the target difference, and the absolute value of the difference between the resource utilization rate at the second time step and the resource utilization rate at the third time step is less than the target difference, then the resource utilization rates at the second time step and the third time step are determined as feasible resource utilization rates. The weight is obtained by dividing the absolute value of the difference between the utilization rates of the two feasible resources by the smaller value of the utilization rates of the two feasible resources; Multiply the larger of the two feasible resource utilization rates by the weight to obtain the utilization rate threshold; If the absolute value of the difference between the resource utilization rate at the first time step and the resource utilization rate at the third time step is less than the target difference, and the resource utilization rate at the second time step is less than the resource utilization rates at the first and third time steps, the previous utilization rate threshold will be used; or, If a feasible resource utilization rate cannot be selected from the resource utilization rates at the first, second, and third time points, the utilization rate threshold from the previous time point will be used.

2. The method according to claim 1, characterized in that, Before periodically acquiring the resource utilization rate corresponding to each resource monitoring item based on the monitoring frequency indicated by the monitoring rule corresponding to each resource monitoring item for multiple resource monitoring items on the target device, the method further includes: Obtain the monitoring rules corresponding to each of the multiple resource monitoring items from the management terminal.

3. The method according to claim 1, characterized in that, The method further includes: If the resource utilization rate corresponding to the resource monitoring item reaches the utilization rate threshold, check whether the processing method in the monitoring rule corresponding to the resource monitoring item is manual processing. If it is handled manually, an alarm message is sent based on the alarm method indicated by the monitoring rules; Determine whether the abnormal handling instruction corresponding to the alarm information is received within the preset waiting time. If not received, continue to execute the step of calling the abnormal handling rule corresponding to the resource monitoring item from the preset abnormal handling rule library to perform abnormal handling.

4. The method according to claim 3, characterized in that, The method further includes: If an exception handling instruction is received within the specified waiting time, exception handling shall be performed according to the exception handling instruction.

5. The method according to claim 1, characterized in that, The exception handling rule base includes multiple exception handling rules corresponding to the resource monitoring item, and different exception handling rules are configured with different priorities. The step of calling the exception handling rule corresponding to the resource monitoring item from the preset exception handling rule library to perform exception handling includes: Based on priority, the exception handling rules corresponding to the resource monitoring items are selected one by one for exception handling until the resource utilization rate corresponding to the resource monitoring item is less than the utilization rate threshold.

6. An electronic device, characterized in that, The electronic device includes: processor; Memory used to store processor-executable instructions; The processor is configured to execute the device operation status monitoring method according to any one of claims 1-5.

7. A computer-readable storage medium, characterized in that, The storage medium stores a computer program that can be executed by a processor to perform the device operation status monitoring method according to any one of claims 1-5.