Modern Chinese literary work author identification system and method

A modern Chinese and author technology, applied in character and pattern recognition, natural language translation, text database clustering/classification, etc., can solve the problem of mature identification technology that has not been reported yet, and achieves less training samples and high discrimination accuracy. , Discriminate the effect of fast speed

Pending Publication Date: 2020-11-13
施建军
View PDF7 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

These are all academic explorations, and there are no reports on the mature identification technology of authors of modern Chinese writing works

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Modern Chinese literary work author identification system and method
  • Modern Chinese literary work author identification system and method
  • Modern Chinese literary work author identification system and method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0039] The present invention will be further described below in conjunction with accompanying drawings and examples of implementation.

[0040] Such as figure 1 As shown, an identification system for the author of modern Chinese written works, including: the author known and unknown sample data processing module, sample writing feature extraction and vectorization module, multi-layer neural network model training module, "anonymous written works" author Discrimination module. The known author and unknown author sample data processing module is used to process the modern Chinese text works with known author and unknown author, mark each sample with known author and unknown author label, and make the author identification system required Training sample data and discriminant sample data, the samples known to the author are marked as training data, and the samples unknown to the author are marked as data to be discriminated. The writing feature extraction and vectorization modu...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a modern Chinese literary work author identification system and method. The system comprises an author known and author unknown sample data processing module, a sample writingfeature extraction and vectorization module, a multi-layer neural network model training module and an anonymous literary work author discrimination module. The sample data processing module is used for making training sample data and discrimination sample data required by the author identification system; the writing feature extraction and vectorization module is used for extracting language features reflecting writing habits of an author so as to make a training sample vector and a discrimination sample vector; and the multi-layer neural network model training module is used for training a multi-layer neural network model by utilizing the training sample vector data and establishing a discrimination model. The discrimination module is used for discriminating an 'anonymous literary work'author by utilizing the discrimination model according to the writing habit language feature vector. According to the system and method, the identification precision of the'anonymous literary works' author within a specified range is very high and even can be close to 100%.

Description

technical field [0001] The present invention relates to a system and method for identifying authors of modern Chinese writing works within a given range. Background technique [0002] The problem of the authorship of Chinese articles has existed since ancient times. The four major classics of ancient Chinese literature all have authorship problems, among which the problem of "Dream of Red Mansions" is the most well-known. Who is the author of "Dream of Red Mansions"? Is the first 80 chapters and the last 40 chapters the same author? These issues are not only academic issues that have been debated for a long time in the academic circle, but also have received widespread attention from the society. For a long time, the academic circles have used various methods to study the authorship of "A Dream of Red Mansions". Among these methods, the textual research method advocated by Hu Shi and the statistical method advocated by Gao Benhan are the most representative. [0003] In re...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/35G06F40/44G06K9/62G06N3/04
CPCG06F16/355G06F40/44G06N3/045G06F18/241G06F18/214
Inventor 施建军
Owner 施建军
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products