The invention provides a method and a system for controlling the instant congestion of a server. In the method, controlling state comprises a first state, a second state and a third state, in each state, a data packet is processed by different methods under different conditions. The method comprises the following steps of: 1) receiving a data packet, then discarding or forwarding the data packet according to the controlling state; and 2) comparing the average request return time delay of the data packet with a detection threshold value, then changing controlling state or keeping an original state according to a comparison result, when the average request return time delay is greater than the detection threshold value, switching from the first state to the second state, or switching from the second state to the third state, and when congestion condition is improved, switching from the third state to the first state. By the method and the system, session integrity can be protected, the retransmission of massive data packets caused by the oscillation of server load can be avoided, and learning or training is not required and computation cost is low.