The invention discloses an audio optimization method, a related device, electronic equipment and a storage medium. The audio optimization method comprises the steps: extracting a first audio representation of a collected audio, and extracting a second audio representation of a reference audio; based on the first audio representation and the second audio representation, respectively extracting a first echo representation, a first voice representation and a first noise representation; performing interaction processing on the first voice representation, the first echo representation and the first noise representation to obtain a second voice representation, a second echo representation and a second noise representation, wherein the interactive processing comprises echo suppression, noise suppression and speech enhancement; and acquiring an optimized target audio based on at least one of the second speech representation, the second echo representation and the second noise representation. According to the method, the audio optimization effect can be improved.