Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Open domain conference information named entity identification method and system

A named entity recognition and named entity technology, applied in character and pattern recognition, digital data information retrieval, instruments, etc., can solve the problems of poor rule portability, low accuracy, and low accuracy of recognition and extraction of open domain texts

Active Publication Date: 2019-07-02
北京市科学技术研究院
View PDF11 Cites 12 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] Early named entity recognition and extraction were mainly based on rule-based methods. However, due to the variability and complexity of different named entity types in different fields, rules based on linguistic knowledge are poorly transplantable and have limitations.
In recent years, with the rise of machine learning and deep learning, statistics-based methods only need to use labeled corpora for training, and the accuracy of statistics on features from corpus is low, and there is a lack of public labeled corpus in specific fields, resulting in Recognition of low accuracy in extracting open-domain text

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Open domain conference information named entity identification method and system
  • Open domain conference information named entity identification method and system
  • Open domain conference information named entity identification method and system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0088] The following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only some, not all, embodiments of the present invention. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.

[0089] The object of the present invention is to provide a method and system for recognizing named entities of open domain conference information that can improve the recognition accuracy of named entities of open domain texts.

[0090] In order to make the above objects, features and advantages of the present invention more comprehensible, the present invention will be further described in detail below in conjunction with the accompanying drawings and specific...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses an open domain conference information named entity identification method and an open domain conference information named entity identification system. The identification methodspecifically comprises the steps of obtaining original text information of an open domain data conference; converting the original text information into a plurality of digital sequences, wherein eachdigital sequence is one sentence; mapping the digital sequence into a word vector through a word embedding layer to obtain a word vector; adopting a named entity recognition model for the word vectorto obtain an optimal label combination index of each label at each time; converting the optimal combined tag index into a tag name through a word list; synthesizing the label names corresponding to the characters into word labels; and obtaining a conference name named entity and a conference place named entity according to the word tags. According to the method, the first character, the middle character and the last character of the entity type are marked on the basis of the characters, the marked type of one word can be formed, and the influence of new word processing, different word segmentation tools and word segmentation errors on the recognition and extraction effect is avoided.

Description

technical field [0001] The invention relates to the field of conference information retrieval, in particular to a method and system for identifying named entities of open domain conference information. Background technique [0002] With the rapid development of science and technology, there are more and more platforms and methods for academic exchanges among scientific and technological workers. Academic conferences are a platform for scientific and technological workers to introduce and share their scientific research work and achievements, as well as learn about research content and research results in related fields, by conducting academic lectures and publishing academic papers. Through academic conferences, it is possible to track the research directions and research hotspots in related fields, understand the research difficulties and key technical methods in the current research, and obtain instructive conclusions. In addition, tracking relevant information of academi...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F16/35G06F16/31G06K9/62G06F17/27G06F16/951G06F16/953G06F16/9535
CPCG06F40/242G06F18/2411G06F18/214
Inventor 熊蕊吴晨生
Owner 北京市科学技术研究院
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products