Unlock instant, AI-driven research and patent intelligence for your innovation.

Character action related data extraction method, device and equipment, and storage medium

A technology related to data and extraction methods, which is applied in the fields of electronic digital data processing, natural language data processing, and digital data information retrieval, etc. It can solve problems such as not meeting the requirements of data extraction, large errors in extraction methods, and unsatisfactory extraction results.

Pending Publication Date: 2021-04-02
ONE CONNECT SMART TECH CO LTD SHENZHEN
View PDF0 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] In recent years, driven by big data and deep learning, natural language processing technology has developed rapidly. At present, there are roughly two types of subject-predicate-object extraction algorithms for text data, one is based on deep learning, and the other is based on language The rule-based method and the method based on deep learning require a large amount of labeled data, and the extraction effect of the language description related to the action of the character is not ideal, while the extraction method based on the language rule has a large error, which does not meet the requirements of the data extraction related to the action of the character. demand, and the extracted data is noisy

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Character action related data extraction method, device and equipment, and storage medium
  • Character action related data extraction method, device and equipment, and storage medium
  • Character action related data extraction method, device and equipment, and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0027] Embodiments of the present invention provide a method, device, device and storage medium for extracting data related to character actions. The Chinese natural language processing HanLP algorithm is used to perform syntax analysis and part-of-speech tagging on text data, and based on the grammatical relationship and modality of subject, predicate and object The verb filters out the relevant data of the action that is taking place, which improves the accuracy of data extraction and reduces the noise of the extracted data set.

[0028] The terms "first", "second", "third", "fourth", etc. (if present) in the description and claims of the present invention and the above-mentioned drawings are used to distinguish similar objects and are not necessarily used to describe a specific order or sequence. It is to be understood that data so used may be interchanged under appropriate circumstances so that the embodiments described herein can be practiced in sequences other than those...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to the field of artificial intelligence, and discloses a character action related data extraction method, device and equipment, and a storage medium, which are used for performing syntactic analysis and part-of-speech tagging on text data through a Chinese natural language processing HanLP algorithm and screening out related data of behavior actions occurring currently, thereby improving the accuracy of data extraction, and improving the accuracy of data extraction and reducing the noise of the extracted data set. The character action related data extraction method comprises the steps of obtaining preset text data, performing classification processing on preset text data, and screening out text data containing figure information to obtain initial text data, performingword segmentation processing and part-of-speech tagging on the initial text data to generate intermediate text data, performing dependency syntactic analysis and semantic dependency analysis on the intermediate text data to generate analysis text data, and filtering the analysis text data to generate target text data. In addition, the invention also relates to a blockchain technology, and the target text data can be stored in the blockchain.

Description

technical field [0001] The invention relates to the field of natural language processing, and in particular, to a method, device, device and storage medium for extracting data related to a character's action. Background technique [0002] Natural language processing includes two parts: natural language understanding and natural language generation. Realizing natural language communication between humans and machines means that computers can not only understand the meaning of natural language texts, but also express given intentions and texts in natural language texts. Thoughts, etc., the former is called natural language understanding, the latter is called natural language generation, and natural language processing is an important direction in the field of computer science and artificial intelligence. Among them, the Chinese natural language processing HanLP algorithm is a text data extraction algorithm , including word segmentation, part-of-speech tagging, and entity recog...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/35G06F16/335G06F40/211G06F40/253G06F40/284G06F40/289G06F40/30
CPCG06F16/355G06F16/335G06F40/289G06F40/284G06F40/253G06F40/211G06F40/30
Inventor 蔡壮壮
Owner ONE CONNECT SMART TECH CO LTD SHENZHEN