Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

New word mining method and device, computer device and storage medium

A technology of new words and filtering methods, applied in computing, unstructured text data retrieval, instruments, etc., can solve the problems of difficulty in determining parameters and low accuracy of new words, and achieve the effect of improving accuracy.

Active Publication Date: 2019-04-16
GUANGZHOU LIZHI NETWORK TECH CO LTD
View PDF3 Cites 14 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] Based on this, it is necessary to provide a new word mining method, device, computer equipment and storage medium for the problem that the parameters of the current new word mining algorithm are difficult to determine, resulting in low accuracy of new words generated

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • New word mining method and device, computer device and storage medium
  • New word mining method and device, computer device and storage medium
  • New word mining method and device, computer device and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0050] The content of the present invention will be further described in detail below in conjunction with preferred embodiments and accompanying drawings. Apparently, the embodiments described below are only used to explain the present invention, not to limit the present invention. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention. It should be noted that, for the convenience of description, only parts related to the present invention are shown in the drawings but not all content.

[0051] 【Related description part】

[0052] It should be noted that the term "first\second\third" involved in the embodiment of the present invention is only to distinguish similar objects, and does not represent a specific ordering of objects. Understandably, "first\second\ "Third" can be interchanged for a specific order or sequence where...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to a new word mining method and device, a computer device and a storage medium. The method comprises the following steps: obtaining text information to be subjected to new word mining; filtering the text information according to a preset filtering method to generate a plurality of text statements; inputting the plurality of text statements into a preset Nago algorithm model to generate a plurality of candidate words; inputting each candidate word into a pre-trained classifier, and carrying out word classification discrimination; and selecting new words meeting requirements according to a word classification judgment result. According to the new word mining method, the candidate words are generated by adopting the Nago algorithm, and the candidate words are detected and judged by adopting the pre-trained classifier, so that the words with low accuracy can be removed, and the new word generation accuracy is improved.

Description

technical field [0001] The invention relates to the technical field of information mining, in particular to a new word mining method, device, computer equipment and storage medium. Background technique [0002] New word mining is to extract some words or characters from the known corpus to form new vocabulary, so as to form accurate summary text information, such as tagging, convenient statistics, index construction, and long text characteristics. generated by word mining. The new word mining algorithm is a commonly used technology in new word mining, and it is mainly used in scenarios such as search word segmentation thesaurus, knowledge graph, text classification and tag recommendation engine. However, it is difficult to determine the parameters of the commonly used new word mining algorithms at present, resulting in a low accuracy rate of new words generated. Contents of the invention [0003] Based on this, it is necessary to provide a new word mining method, device,...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F17/27G06F16/35
CPCG06F40/284
Inventor 谢春发
Owner GUANGZHOU LIZHI NETWORK TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products