Unlock instant, AI-driven research and patent intelligence for your innovation.

Grain condition named entity recognition method based on new word discovery and Flat-lattice

A technology for named entity recognition and new word discovery, applied in character and pattern recognition, other database retrieval, network data retrieval, etc. Error propagation, improved entity recognition, improved effectiveness

Pending Publication Date: 2021-12-03
HENAN UNIVERSITY OF TECHNOLOGY
View PDF0 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] The present invention provides a grain situation named entity recognition method based on new word discovery and Flat-lattice, which is used to solve the low recognition rate of proper nouns in the grain situation field, wrong word segmentation affects the effect of entity recognition, and the current lack of follow-up research The structured grain data set and other issues

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Grain condition named entity recognition method based on new word discovery and Flat-lattice
  • Grain condition named entity recognition method based on new word discovery and Flat-lattice
  • Grain condition named entity recognition method based on new word discovery and Flat-lattice

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0019] To make the objectives, technical solutions and advantages of the present invention will become more apparent hereinafter in conjunction with the accompanying drawings and specific embodiments of the present invention will be further described in detail:

[0020] like figure 1 As shown in this example and found that the grain situation Flat-lattice named entity recognition method, comprising the steps based on the new words:

[0021] Step (A), establishment of grain condition NER text corpus, using climbing python crawler technology related text taken from the food grain situation and HowNet dictionary, and stores it as a txt file. After finishing acquired corpus grain situation, for lack of data redundancy and data quality issues, the establishment of property constraints and integrity constraints remove redundant data filtering and duplicate data.

[0022] Step (B), N-grams algorithm constructing grain situation dictionary, using N-grams algorithm acquisition of new words...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a grain condition named entity recognition method based on new word discovery and Flat-lattice. The method comprises the following steps: firstly, crawling grain condition related data from a known network and a grain dictionary by utilizing a python crawler technology to form a text corpus, and preprocessing corpora; then, obtaining new words from the grain condition text corpus by using an N-grams algorithm, assisting a word segmentation algorithm to perform word segmentation, and constructing a grain condition dictionary by using Word2vec according to a result after word segmentation; then, dividing the dictionary into 15 entity category labels, and performing BIOES labeling on the grain situation corpus according to the labels; thirdly, coding the input characters and all words which can be matched in the dictionary by adopting a Flat-lattice model, and inputting the coded input characters and all words into the model for training; and finally, performing prediction by using the trained deep learning model. According to the grain condition named entity recognition method based on new word discovery and Flat-lattice, grain condition entities can be effectively extracted from multi-source heterogeneous data, and a basis is provided for downstream tasks such as grain condition knowledge graph construction.

Description

Technical field [0001] The present invention is a field of natural language processing, in particular to a new word based on discovery of Flat-lattice and grain condition named entity recognition method. Background technique [0002] With the rapid development of information technology in the food industry, "information explosion" and "lack of knowledge" increasingly serious contradiction. The mass of grain condition record text data, papers and patents grain condition accumulation, efficiently and accurately excavated grain condition data from these entities, for follow-up studies, such as grain condition decision system constructed grain condition and mapping knowledge with for greater convenience. In extracting data from multiple sources in a heterogeneous specific physical process, named entity recognition (Named Entity Recognition, referred NER) is an indispensable technique. By named entity recognition technique, may be extracted from grain condition data showing informatio...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F40/295G06F40/242G06F40/117G06F16/33G06F16/951G06K9/62
CPCG06F40/295G06F40/242G06F40/117G06F16/3344G06F16/951G06F18/214
Inventor 肖乐李家馨葛亮吴涛段梦诗岳思雯陈啸林单昕
Owner HENAN UNIVERSITY OF TECHNOLOGY