Unlock instant, AI-driven research and patent intelligence for your innovation.

Word Classification Method in Text, Speech Creativity Evaluation Method and System

A classification method and creative technology, applied in text database clustering/classification, natural language data processing, unstructured text data retrieval, etc., can solve problems such as inability to apply language creative thinking, and achieve friendly interface and accurate classification of results. , the effect of improving classification accuracy or accuracy

Active Publication Date: 2022-03-08
HOHAI UNIV
View PDF4 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Therefore, the above method cannot be applied to language creative thinking, especially divergent thinking tests

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Word Classification Method in Text, Speech Creativity Evaluation Method and System
  • Word Classification Method in Text, Speech Creativity Evaluation Method and System
  • Word Classification Method in Text, Speech Creativity Evaluation Method and System

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0034] Such as figure 1 As shown, it is a flow chart of the word classification method in the text disclosed by the present invention, including the following steps:

[0035] Step 1. Read the text by line, divide each line of text data in a regular way, filter punctuation marks and numbers, and obtain phrases and words;

[0036] Split each line of text data in a regular way, including defining specific characters and combinations of specific characters to form a rule string; search for text matching one or more rule strings, and filter text data;

[0037] Punctuation marks such as ",", ".", and ";" between phrases in the text are automatically converted into spaces to obtain the corresponding phrases.

[0038] Step 2, further segmenting the phrases and words obtained in step 1, and filtering stop words to obtain simple words, assuming that a total of L simple words are obtained; the word frequency of each simple word is counted;

[0039] In the present invention, phrases and...

Embodiment 2

[0049] Usually, the classification result in Embodiment 1 can meet the requirements, and the user does not need to process it. In some cases where high accuracy is required, in order to obtain a more accurate classification effect, the user can manually intervene in the classification. After the preliminary classification in step 3, this embodiment also includes the user's self-improvement of the classification accuracy. The user's self-improvement of the classification accuracy is: designing similar word texts and similar word texts, merging the M-type words after the preliminary classification, and obtaining N categories, N≤M; thus step 4 operates on the result of the user's self-improvement of the classification accuracy, and the step (4) is: for the result of the user's self-improvement of the classification accuracy, select the word with the highest word frequency in each class Words serve as the subject of this class.

[0050] In this embodiment, the similar word text i...

Embodiment 3

[0054] The present invention also discloses a verbal creativity evaluation method using the word classification method in the above text, which is based on the empathy evaluation technology or consensus evaluation principle of the creativity test, and scores from three aspects: fluency, originality and flexibility , including the following steps:

[0055] (S1) Acquiring the speech text input by the user;

[0056] (S2) adopting the word classification method in the above-mentioned text to classify the speech text;

[0057] (S3) Calculate the statistical results of originality, fluency, and flexibility according to the speech text classification results, and obtain the creativity evaluation results of the user; the frequency of occurrences of words entered by the user group;

[0058] The fluency is the sum of the number of words of all categories in the speech classification result;

[0059] The flexibility is the number of categories in the speech classification results.

...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The present invention discloses a method for classifying words in a text, a method and a system for evaluating creativity of speech, wherein the method for classifying words in a text includes the following steps: 1. Read the text line by line, divide each line of text data in a regular manner, and obtain phrases and words; 2. The phrases and words obtained in step 1 are further segmented by stammering word segmentation to obtain simple words; 3. Set the classification parameters, and obtain candidate topics according to the word frequency for preliminary classification; 4. Select the word with the highest word frequency in each category as this class 5. For each class of words, traverse all words in this class to judge whether they belong to this class topic, if so, then divide them into the subject; otherwise they are divided into low-frequency word collections; 6. For low-frequency words The set is further divided using word2vec.model; 7. Statistical classification results. This word classification method is applicable to the scene where words or words appear independently or in isolation, rather than in the form of chapters or sentences.

Description

technical field [0001] The invention belongs to the fields of data processing, machine learning and classification, and in particular relates to a method for classifying words in a text, and a speech creativity evaluation method and system. Background technique [0002] Classification, Estimation, Prediction, Affinity grouping or association rules, Clustering, Description and Visualization, Complex data type mining (Text / Web / graphic image / video / audio, etc.) all belong to data mining technology. Classification is a fundamental machine learning task. Classification and analysis of things can determine their categories or the correlation between each other, and can classify or divide similar, similar or different things into appropriate categories or groups according to the similarity or dissimilarity of the characteristics of things. [0003] In the existing technology, the vector space model text similarity calculation method based on TF-IDF (Term Frequency–Inverse Documen...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F16/35G06F40/289G06F40/216
CPCG06F40/284
Inventor 沈汪兵邵美玲
Owner HOHAI UNIV