Term extraction method and device based on deep learning network and storage medium

A deep learning network and extraction method technology, applied in the field of term extraction, can solve the problems of low term extraction rate, difficulty in completing massive text extraction, etc., and achieve the effect of improving the extraction rate

Active Publication Date: 2019-02-12
GCI SCI & TECH +1
View PDF3 Cites 5 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] There are three traditional term extraction methods: rule-based methods, statistical methods, and machine learning-based methods. The above-mentioned methods are mostly manual or non-manual based on the information in the corpus, and the rate of term extraction is low. For In today's era of information explosion, it is difficult to extract Chinese terms from massive texts by manual or semi-manual methods alone.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Term extraction method and device based on deep learning network and storage medium
  • Term extraction method and device based on deep learning network and storage medium
  • Term extraction method and device based on deep learning network and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0038] The following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only some, not all, embodiments of the present invention. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without creative efforts fall within the protection scope of the present invention.

[0039] see figure 1 , which is a schematic flowchart of a term extraction method based on a deep learning network provided by an embodiment of the present invention. The methods include:

[0040] S100: Perform term labeling on the target text;

[0041] S200: Perform word segmentation processing on the tagged target text to obtain a word segmentation text, and extract keywords from the word segmentation text;

[0042] S300: Train the pre-established RNN deep lear...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a term extraction method and device based on a deep learning network and a storage medium. The method comprises the following steps of labeling a term on a target text; carryingout the word segmentation on the labeled target text to obtain a word segmentation text and extracting keywords; according to the keywords, training the pre-established RNN deep learning network to obtain the term prediction model, and obtaining the term prediction result outputted by the term prediction model; according to the term prediction results and term tagging corresponding to the targettext, training the pre-established CNN depth learning network to obtain the term extraction model, and obtaining the term extraction results outputted from the term extraction model. The method of theinvention integrates the RNN and CNN deep learning network to form a deeper deep learning network. According to the extracted keywords and the terminology labeling result of the target text, the method of the invention predicts and extracts the terminology of the target text, which can effectively improve the extraction speed of the terminology and realize the extraction of the Chinese terminology of the massive text.

Description

technical field [0001] The present invention relates to the technical field of term extraction, in particular to a term extraction method, device and storage medium based on a deep learning network. Background technique [0002] A term represents a profession or a research direction in a field. Term extraction has research significance in the field of natural language processing, especially in machine translation and cross-language information retrieval. [0003] There are three traditional term extraction methods: rule-based methods, statistics-based methods, and machine learning-based methods. The above methods are mostly manual or non-manual based on the information in the corpus, and the rate of term extraction is low. For In today's era of information explosion, it is difficult to extract Chinese terms from massive texts only by manual or semi-manual methods. Contents of the invention [0004] Based on this, the present invention proposes a term extraction method, de...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/27G06N3/08
CPCG06N3/08G06F40/284
Inventor 杨旭杜翠凤周善明张添翔叶绍恩梁晓文
Owner GCI SCI & TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products