The invention discloses a speech synthesis method, and belongs to the technical field of speech synthesis. The method comprises the following steps: performing rhythm analysis on a to-be-synthesized text; segmenting a long text into a short text set with a proper length according to a rhythm analysis result, and recording a text segmenting sequence; calling a voice synthesis model for the text objects in the text set to generate acoustic features in parallel; splicing the acoustic features obtained by the text objects according to the text segmenting sequence; and enabling the spliced completeacoustic features to pass through a vocoder model, and finally, outputting an audio. On the basis of traditional speech synthesis methods, parallel processing in the speech spectrum generation process of a to-be-synthesized text is reasonably utilized, the problems that when the text is too long, the speech synthesis speed is low, and speech synthesis model spectrum synthesis is prone to failuredue to the fact that the text is too long are effectively solved, the speech synthesis speed is effectively increased, so that the speech synthesis system is more efficient, stable and natural.