The invention discloses a tourism
named entity recognition method based on a BBLC model, and the method comprises the steps: carrying out the BIO marking of statements in a corpus, and obtaining a BIOmarking set; inputting the BIO
annotation set into a BERT pre-training
language model, and outputting vector representation of each word in the statement, namely a
word embedding sequence in each statement; 3, taking the
word embedding sequence as the input of each
time step of the bidirectional LSTM, and carrying out the further semantic coding to obtain a statement
feature matrix; taking the statement
feature matrix as input of a CRF model, labeling and decoding the statement x to obtain a word
label sequence of the statement x, outputting a probability value that a
label of the statement xis equal to y, solving an optimal path by using a dynamically planned
Viterbi algorithm, and outputting a
label sequence with the maximum probability. According to the method, local context information can be obtained by adding the BERT pre-training
language model, the accuracy,
recall rate and F value are higher, the generalization ability and robustness are stronger, and the defects of a traditional model can be overcome.