Chinese parallelism sentence recognition method and system

A recognition method and alignment technology, applied in unstructured text data retrieval, text database clustering/classification, special data processing applications, etc., can solve problems such as alignment and granularity omissions, and achieve automatic article scoring and evaluation system promotion Effect

Active Publication Date: 2018-04-20
CAPITAL NORMAL UNIVERSITY
View PDF4 Cites 3 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, the current related technology has omissions in terms of alignment and granularity in natural language processing

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Chinese parallelism sentence recognition method and system
  • Chinese parallelism sentence recognition method and system
  • Chinese parallelism sentence recognition method and system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0039] Embodiments of the present invention are described in detail below, examples of which are shown in the drawings, wherein the same or similar reference numerals designate the same or similar elements or elements having the same or similar functions throughout. The embodiments described below by referring to the figures are exemplary only for explaining the present invention and should not be construed as limiting the present invention.

[0040] In describing the present invention, it should be understood that the terms "center", "longitudinal", "transverse", "upper", "lower", "front", "rear", "left", "right", " The orientations or positional relationships indicated by "vertical", "horizontal", "top", "bottom", "inner" and "outer" are based on the orientations or positional relationships shown in the drawings, and are only for the convenience of describing the present invention and Simplified descriptions, rather than indicating or implying that the device or element refe...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a Chinese parallelism sentence recognition method and system. The method comprises the steps that a target dataset is selected, wherein the target dataset comprises multiple types of articles; every two sentences in a current article form a sentence pair, and features of different dimensions are constructed for the obtained sentence pairs; a classification model is constructed according to a random forest method, and the obtained features are used as model input, so that the features are screened according to preset evaluation indexes, and a group of best features are obtained to train the model; and parallelism sentences are extracted from a new test set according to the trained model, and the extracted sentence pairs are combined into sentence chunks for evaluation, so that classification of the sentence pairs is expanded into classification of the sentence chunks. Through the Chinese parallelism sentence recognition method and system, the parallelism sentencesin the articles can be effectively recognized and classified, and therefore an automatic scoring and evaluation system of the articles is actively promoted.

Description

technical field [0001] The invention relates to the technical field of natural language processing, in particular to a Chinese parallel sentence recognition method and system. Background technique [0002] Parallel sentences are a very important rhetorical device. It is defined as: a sentence pair or sentence group composed of two or more semantically related, similar in tone, and identical in structure. The use of parallel sentences can increase the sense of balance and rhythm of speech or writing, making it more vivid and persuasive. In addition, the use of parallel sentences can make the point expressed in the whole article more clear. The combination of multiple sentences with similar expression forms has the same importance in the article. Therefore, proper use of parallel sentences can improve the quality of the text. [0003] Parallel sentence recognition is a brand-new topic, and there are many factors to consider. Since parallel sentences have the characteristi...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30G06F17/27
CPCG06F16/35G06F40/205
Inventor 宋巍刘彤刘丽珍
Owner CAPITAL NORMAL UNIVERSITY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products