The method includes procedures: monitoring whether command of editing voice expression is received; if yes, then further receiving ID of photo expression (PE), and displaying expression of photo corresponding to the ID of PE; when receiving command of storing voice information (VI), the method receives and stores VI; when voice storage reaches to the prearranged voice storage space, or the finish save command is received in procedure of storing voice, the method closes receiving and storing voice, and synthesizes voice expression from the said PE and stored VI. Corresponding to the method, the invention also discloses an instant communication tool including reception unit, storage unit, display unit, first control unit, and transmission unit. Storing words the user wants to say into voice expression, the invention combines the voice expression with PE so as to express emotion and meaning vividly and visually, enrich modes of intercommunion, and raise degree of satisfaction.