Eureka AIR delivers breakthrough ideas for toughest innovation challenges, trusted by R&D personnel around the world.

Method and device for extracting entity from text sequence

A text sequence and entity technology, applied in text database query, unstructured text data retrieval, special data processing applications, etc., can solve the problems of semantic information deviation and low entity recognition accuracy, and achieve the effect of improving accuracy

Active Publication Date: 2021-12-17
航天宏康智能科技(北京)有限公司
View PDF10 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] In view of the fact that the existing entity recognition method has the problem that the accuracy of entity recognition is not high and the extracted semantic information has a large deviation, this application provides a method and device for extracting entities from text sequences

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and device for extracting entity from text sequence
  • Method and device for extracting entity from text sequence
  • Method and device for extracting entity from text sequence

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0025]In order to make the purpose, technical solutions and advantages of the embodiments of the present application clearer, the technical solutions in the embodiments of the present application will be clearly and completely described below in conjunction with the drawings in the embodiments of the present application. Obviously, the described embodiments are only It is a part of the embodiments of this application, not all of them. The components of the embodiments of the application generally described and illustrated in the figures herein may be arranged and designed in a variety of different configurations. Accordingly, the following detailed description of the embodiments of the application provided in the accompanying drawings is not intended to limit the scope of the claimed application, but merely represents selected embodiments of the application. Based on the embodiments of the present application, every other embodiment obtained by those skilled in the art without...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a method and a device for extracting an entity from a text sequence. The method comprises the following steps: acquiring the text sequence; calculating a first entity position probability of each character in the text sequence based on the text sequence; determining a probability mean value based on the first entity position probability; comparing each first entity position probability with a probability mean value, determining candidate characters, and adding position identifiers of the candidate characters to a first entity position list; and on the basis of the first entity position list, determining a character appearing at a predetermined reference position so as to extract a first entity from the text sequence. According to the method and the device for extracting the entity from the text sequence, the problem of relatively large deviation of semantic information extraction caused by low entity recognition accuracy is solved, and the entity position probability mean value of the characters of the whole text sequence can be counted based on the entity position probability of each character in the text sequence; therefore, the position of the entity is determined more accurately, and the accuracy of entity extraction is improved.

Description

technical field [0001] The present application relates to the field of natural language processing, and more specifically, relates to a method and device for extracting entities from text sequences. Background technique [0002] With the rapid development of Internet technology, the demand for processing natural language text data has surged, and obtaining valuable semantic information from text data has always been one of the key tasks of research. [0003] In the semantic information processing of text data, it is usually necessary to extract the entities contained in the text data and the relationship information between entities. Here, in natural language processing, an entity refers to a collection of specific things. During the extraction process, it is usually necessary to determine the positions of several entities in the text data and the relationship between entities, so as to obtain semantic information. Therefore, the accuracy of entity extraction will affect th...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/33G06F40/295G06F40/30
CPCG06F16/3344G06F16/3346G06F40/295G06F40/30
Inventor 郑俊康经小川王潇茵张家华丁醒醒
Owner 航天宏康智能科技(北京)有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Eureka Blog
Learn More
PatSnap group products