The invention discloses a microblog acquisition
system and method based on events and belongs to the technical field of
information security. The
system comprises an URL structure module, a JSSH
client module, a browser acquisition module and an
HTML analysis module, wherein the URL structure module is connected with the JSSH
client module and used for transmitting acquired URL information, the JSSH
client module is connected with the browser acquisition module and used for transmitting JSSH instructions, and the browser acquisition module is connected with the
HTML analysis module and used for transmitting
HTML text messages. By means of the microblog acquisition
system and method based on the events, abstract data such as a microblog
author name, a microblog author homepage URL, a microblog author head portrait URL, microblog body content, a microblog short link, microblog issue time, a microblog issue client, the number of forwarding times and the number of comments of a microblog message can be acquired through analysis, each piece of
unstructured data is changed into structural data, and therefore abstract data can become concrete to be used in follow-up
data mining.