New word mining method and device, computer device and storage medium

A technology of new words and filtering methods, applied in computing, unstructured text data retrieval, instruments, etc., can solve the problems of difficulty in determining parameters and low accuracy of new words, and achieve the effect of improving accuracy.
CN109635296AActive Publication Date: 2019-04-16GUANGZHOU LIZHI NETWORK TECH CO LTD

Patent Information

Authority / Receiving Office
CN Β· China
Current Assignee / Owner
GUANGZHOU LIZHI NETWORK TECH CO LTD
Publication Date
2019-04-16

Smart Images

  • Figure 1
    Figure 1
  • Figure 2
    Figure 2
  • Figure 3
    Figure 3
Patent Text Reader

Abstract

The invention relates to a new word mining method and device, a computer device and a storage medium. The method comprises the following steps: obtaining text information to be subjected to new word mining; filtering the text information according to a preset filtering method to generate a plurality of text statements; inputting the plurality of text statements into a preset Nago algorithm model to generate a plurality of candidate words; inputting each candidate word into a pre-trained classifier, and carrying out word classification discrimination; and selecting new words meeting requirements according to a word classification judgment result. According to the new word mining method, the candidate words are generated by adopting the Nago algorithm, and the candidate words are detected and judged by adopting the pre-trained classifier, so that the words with low accuracy can be removed, and the new word generation accuracy is improved.
Need to check novelty before this filing date? Find Prior Art

Description

technical field

[0001] The invention relates to the technical field of information mining, in particular to a new word mining method, device, computer equipment and storage medium. Background technique

[0002] New word mining is to extract some words or characters from the known corpus to form new vocabulary, so as to form accurate summary text information, such as tagging, convenient statistics, index construction, and long text characteristics. generated by word mining. The new word mining algorithm is a commonly used technology in new word mining, and it is mainly used in scenarios such as search word segmentation thesaurus, knowledge graph, text classification and tag recommendation engine. However, it is difficult to determine the parameters of the commonly used new word mining algorithms at present, resulting in a low accuracy rate of new words generated. Contents of the invention

[0003] Based on this, it is necessary to provide a new word mining method, device,...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More