The invention discloses a
plain text oriented enterprise entity classification method. The
plain text oriented enterprise entity classification method comprises the steps of S1, carrying out type labeling for the enterprise entities in collected
plain text data and regarding the enterprise entities being subjected to type labeling as a
training set of an enterprise entity identification module; carrying out type labeling for the enterprise entities in the collected plain text data according to business nature and regarding the enterprise entities being subjected to the type labeling as a training sample set of an enterprise entity classification module; and S2, carrying out enterprise entity identification model training through a condition random field model to obtain an enterprise entity identification model; S3, carrying out semantic vectorization construction for the text data of an original
training set; S4, training by regarding the data of the
training set after being subjected to type labeling and semantic vectorization as training parameters to obtain an enterprise entity classification model; and S5, classifying the enterprise entity in a to-be-predicted text by utilizing the enterprise entity classification model. According to the plain text oriented enterprise entity classification method, as the obtained
semantic vector serves as the feature of the entity, dependence on artificial features and
external data is reduced, and the universality and robustness are guaranteed.