Concept extraction method and device, electronic equipment and storage medium

A concept and seed technology, applied in the computer field, can solve problems such as low accuracy, achieve the effect of reducing labor and improving the degree of automation

Pending Publication Date: 2021-03-19
TSINGHUA UNIV
View PDF0 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] Embodiments of the present invention provide a concept extraction method, device, electronic equipment, and storage medium, which are used to solve the defect in the prior art that the accuracy of concept extraction results is low in the case of less labeled data, and realize More accurate concept extraction with little or no labeled data

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Concept extraction method and device, electronic equipment and storage medium
  • Concept extraction method and device, electronic equipment and storage medium
  • Concept extraction method and device, electronic equipment and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0039] In order to make the purpose, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below in conjunction with the drawings in the embodiments of the present invention. Obviously, the described embodiments It is a part of embodiments of the present invention, but not all embodiments. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without creative efforts fall within the protection scope of the present invention.

[0040] In the description of the embodiments of the present invention, it should be noted that the terms "center", "upper", "lower", "left", "right", "vertical", "horizontal", "inner", "outer " and other indicated orientations or positional relationships are based on the orientations or positional relationships shown in the drawings, and are only ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The embodiment of the invention provides a concept extraction method and device, electronic equipment and a storage medium, and the method comprises the steps: carrying out the term extraction of a to-be-extracted text according to a preset word list, obtaining a first candidate concept list, carrying out the entity linking of the to-be-extracted text according to a preset knowledge graph, and obtaining a second candidate concept list; reordering the candidate concepts in the first candidate concept list and the second candidate concept list, and obtaining a concept extraction result of the to-be-extracted text according to a reordering result, wherein the text to be extracted is an unstructured text. According to the concept extraction method and device, the electronic equipment and the storage medium provided by the embodiment of the invention, the candidate concepts obtained by performing term extraction and entity link acquisition on the to-be-extracted text are reordered, and theconcept extraction result is obtained according to the reordering result, so that under the condition of less annotation data or even no annotation data, concepts are extracted from the unstructured text more efficiently and accurately.

Description

technical field [0001] The present invention relates to the field of computer technology, in particular to a concept extraction method, device, electronic equipment and storage medium. Background technique [0002] Concept, also known as scientific concept, is a term phrase used in scientific corpus to represent specific technologies and important knowledge points. For example, "binary tree" is an important concept in the computer field. [0003] Traditional concept extraction methods mainly include three categories: key phrase and term extraction, entity linking, and concept / set expansion. For key phrase and term extraction, candidate phrases are generally obtained by word segmentation and other methods, and then the candidate phrases are sorted by confidence, and the candidate phrases with higher scores are selected as the extraction results. Entity linking is the way of finding out from text the different mentions of entities existing in its background knowledge base. T...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/332G06F16/33G06F16/36
CPCG06F16/3329G06F16/3346G06F16/367
Inventor 李涓子王禹权于济凡陈凯源孙凯侯磊张鹏唐杰许斌孙茂松
Owner TSINGHUA UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products