The invention discloses a Word2Vec-BiLSTM-CRF-based
named entity recognition method in the legal field, and the method specifically comprises the following steps: obtaining
original data in the legal field, and carrying out the preprocessing of the data, so as to obtain training corpus data; inputting the obtained training corpus data into a Word2Vec
algorithm and combining the Word2Vec
algorithm with a CBOW model so as to obtain word vectors aiming at the legal field; labeling the training corpus data obtained through preprocessing in combination with
template matching and Chinese corpus pause and other
modes, obtaining labeled corpus, serving Bi-LSTM as a coding layer of a model, and combining the obtained labeled corpus and obtained word vectors to serve as input and output of the coding layer to obtain text
semantic information features; and taking the text
semantic information features obtained by the Bi-LSTM layer as input of the CRF, and finally outputting an identification result of the
named entity. The method has the advantages that rich entities in the legal document can be identified, fine-grained description of the entities in the legal field and data structuring in the legal field are realized, and the method is of great significance in further mining the relationship among different entities in the legal field.