The invention discloses an audio and video conference method based on a 5G low-delay spanning tree, and the method employs a terminal video source module, a terminal video viewing module, a terminal audio module, a terminal sound mixing module, a terminal distribution module, a terminal detection module, a terminal 5G communication module, a terminal D2D communication module, a 5G base station, anedge detection module, an edge distribution module, an edge sound mixing module, a 5G bearer network, a 5G core network, a central conference module, a central distribution module, a central sound mixing module, a central delay spanning tree module and a central detection module. According to the invention, a 5G network centralized core network is changed into a distributed optimization and 5G terminal ad hoc network and has forwarding capability, the delay of video forwarding and audio mixing is reduced by using a delay spanning tree and the self-forwarding of an edge server and a terminal,and the joining of an audio and video conference is realized through the forwarding of the terminal by using the signal difference between a 5G D2D technology terminal and a base station.