The invention relates to a
threat intelligence oriented entity identification method and
system. The method comprises the following steps: 1) performing coarse word segmentation on a
threat information text serving as a training corpus; 2) constructing a
threat information entity
common word dictionary
library and a rule
library, and performing dictionary matching and
rule matching on a coarse word segmentation result; 3) marking an entity
label for each word based on a matching result to form a
training set; 4) constructing a feature template, establishing an indication word
bank to perfect the screening form of the feature template, generating context features for the
training set by using the feature template, screening, and inputting the screened features into a
machine learning modelto carry out parameter iterative training; and 5) performing coarse word segmentation, dictionary matching and
rule matching on the threat information text to be identified, and performing entity identification by using the trained
machine learning model. According to the threat information entity extraction method, the threat information entity extraction is completed by adopting a means of combining a rule, a dictionary and a model, so that the entity identification precision of the threat information is remarkably improved.