Check patentability & draft patents in minutes with Patsnap Eureka AI!

Text data processing method, apparatus and device, and medium

A technology of text data and processing methods, applied in electronic digital data processing, natural language data processing, instruments, etc., can solve the problems of undetectable relationship, error accumulation, and wrong relationship classification results.

Active Publication Date: 2020-03-24
SOUNDAI TECH CO LTD
View PDF6 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0011] When using the above relationship classification method for attribute extraction, the object must be an entity that can be recognized by the NER model. If the model cannot recognize a certain type of entity, then the relationship related to this type of entity cannot be detected.
Moreover, errors will accumulate in the NER process. If there is a recognition error, the result of the relationship classification must be wrong. If the traditional machine learning model is adopted, it is also necessary to extract features. The information that can be contained in the features depends on manual experience.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Text data processing method, apparatus and device, and medium
  • Text data processing method, apparatus and device, and medium
  • Text data processing method, apparatus and device, and medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0094] In order to make the object, technical solution and advantages of the present invention clearer, the present invention will be further described in detail below in conjunction with the accompanying drawings. Obviously, the described embodiments are only some embodiments of the present invention, rather than all embodiments . Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.

[0095] The following is an explanation of some words that appear in the text:

[0096] 1. The term "and / or" in the embodiments of the present invention describes the association relationship of associated objects, indicating that there may be three types of relationships, for example, A and / or B, which may mean: A exists alone, A and B exist simultaneously, and There are three cases of B. The character " / " generally indicates that t...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to a text data processing method, apparatus and device, and a medium, for improving the accuracy and efficiency of attribute value labeling, reducing the labor cost and avoidingerror accumulation. The text data processing method comprises the steps of converting a target text into a text vector sequence based on a predetermined attribute vector sequence used for representingan attribute sequence in the target text; and inputting the text vector sequence into a sequence labeling model used for labeling word attributes in a text, and labeling attributes of each word in the target text in a label form, the attributes being entity attributes of entities corresponding to subjects in the target text.

Description

technical field [0001] The present invention relates to the technical field of natural language processing, in particular to a text data processing method, device, equipment and medium. Background technique [0002] In the context of the large-scale emergence of artificial intelligence technology and applications, the use of triples to represent knowledge is the basis of knowledge graphs and a powerful driving force for the development of artificial intelligence technology. A piece of knowledge expressed in the form of a triple (for example, Zhang San, date of birth, January 18, 1979) expresses the attribute of "date of birth" of the entity "Zhang San", and the three parts of the triple are called the subject , predicate, object. Introductory articles usually describe the same subject entity and densely introduce a large number of attributes about the subject entity. How to extract the attribute values ​​(knowledge triples) of structured representations from the natural la...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F40/289G06F40/30
Inventor 高丛苏少炜陈孝良常乐
Owner SOUNDAI TECH CO LTD
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More