Chinese reading difficulty grading method and system based on machine learning

A reading difficulty and machine learning technology, applied in the field of artificial intelligence, can solve the problems of not considering the characteristics of language age changes, unable to accurately reflect the structural nature, and language characteristics not enough to reflect the reading comprehension process, etc., and achieve accurate reading difficulty level classification. Effect

Inactive Publication Date: 2017-12-22
北京享阅教育科技有限公司
View PDF4 Cites 28 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] The disadvantage of the existing technology is that it does not consider the characteristics of language changes with the times, and thus cannot update the Chinese character difficulty classification table and word frequency table; only using sentence length or word length as an indicator of complexity is too intuitive and cannot accurately reflect its Structural nature; a small number of shallow-level local language features are not enough to reflect the real reading comprehension process; reading difficulty classification technology is only applicable to English, and English itself and Chinese are very different in language features

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Chinese reading difficulty grading method and system based on machine learning
  • Chinese reading difficulty grading method and system based on machine learning
  • Chinese reading difficulty grading method and system based on machine learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0065] In order to enable those skilled in the art to better understand the technical solutions of the present invention, the present invention will be further described in detail below in conjunction with the accompanying drawings.

[0066] Such as figure 1 As shown, a kind of Chinese reading difficulty grading method based on machine learning provided by the embodiment of the present invention comprises the following steps:

[0067] S101. Obtain training text and text to be detected;

[0068] The training text refers to the text used to train the classifier through the features constructed before training; the text to be detected refers to the text that needs to judge the reading difficulty level.

[0069] Such as figure 2 As shown, in step S101, the acquisition of training text includes the following steps:

[0070] S201. Construct a corpus according to multiple reading difficulty levels and updated Chinese texts;

[0071] S202. Retrieve corresponding text from the cor...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a Chinese reading difficulty grading method and system based on machine learning. In the grading method, training samples can be updated in real time so that the feature that language changes along with times is taken into consideration and therefore the Chinese difficulty grading table and the word frequency table can be updated. The introduction of features such as semantic, sentences, texts and subjects makes it more objective by using the above features, sentence length and word length as the index of complexity and therefore structural property can be accurately reflected. By using feature set to make up for the lack of a few shallow local linguistic features, it can reflect the real process of reading comprehension and classify the reading difficulty level more accurately. By this method, the reading difficulty grading technique can be applied to Chinese, which accords with the language characteristics of Chinese. The grading system comprises a text obtaining unit, a constructing unit and a training and predicting unit, which realizes the same beneficial effects of the grading method for Chinese text reading difficulty.

Description

technical field [0001] The invention relates to the field of artificial intelligence, in particular to a method and system for grading Chinese reading difficulty based on machine learning. Background technique [0002] Artificial intelligence technology refers to understanding the essence of intelligence and producing a new kind of intelligent machine that can respond in a similar way to human intelligence. Artificial intelligence often achieves text reading difficulty classification through the combination of technologies such as natural language processing and machine learning and linguistic research results. [0003] Grading of reading difficulty can essentially be summarized as a measure of text readability. By definition, readability refers to the degree to which a text is easy to read and understand. Usually, highly readable text content matches the reader's background knowledge, and will appropriately paraphrase the above content and provide relevant knowledge; more...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/27G06K9/62G06N99/00
CPCG06N20/00G06F40/211G06F40/216G06F40/253G06F18/2411G06F18/214
Inventor 任易赵梓淳
Owner 北京享阅教育科技有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products