Unlock instant, AI-driven research and patent intelligence for your innovation.

A Text Classification Method Based on Similarity Matching

A similarity matching and text classification technology, applied in the field of data processing, can solve the problems of poor classification effect and low text classification efficiency, and achieve the effect of improving accuracy

Active Publication Date: 2021-09-07
上海诺柱知识产权服务有限公司
View PDF5 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] In view of the deficiencies in the above-mentioned prior art, the purpose of the present invention is to provide users with a text classification method based on similarity matching, which overcomes the defects of low text classification efficiency or poor classification effect in the prior art

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A Text Classification Method Based on Similarity Matching
  • A Text Classification Method Based on Similarity Matching
  • A Text Classification Method Based on Similarity Matching

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0051] The following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only some, not all, embodiments of the present invention. Based on the embodiments of the present invention, all other embodiments obtained by those skilled in the art without creative efforts fall within the protection scope of the present invention.

[0052] The invention discloses a text classification method based on similarity matching, such as figure 1 , the method includes:

[0053] Step S101, the server receives the first text to be classified uploaded by the user.

[0054] The server receives the first text uploaded by the user through the client or directly on the server, and needs to identify the text category of the first text, and classify the text into the recognized text set.

[0055] Concretely, in the prese...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The present invention provides a text classification method based on similarity matching. The server receives the first text to be classified uploaded by the user, performs word frequency statistics on the first text, inputs the word frequency statistical results into the classification model, and identifies Find the first-level text category to which it belongs; According to the first-level text category, obtain multiple second texts corresponding to the first-level text category in the server; the server calculates the first text and each second text in turn The similarity between; determine whether the calculated maximum similarity exceeds the preset threshold; if exceeded, then classify the first text into the second-level text category to which the second text corresponding to the maximum similarity belongs; otherwise , classify the first text into the unrecognized text set. The text classification method disclosed by the invention adds a similar text matching step on the basis of the prior art, thereby improving the efficiency and accuracy of text classification.

Description

technical field [0001] The invention relates to the field of data processing, in particular to a text classification method based on similarity matching. Background technique [0002] Modern society is a society of information explosion, and there are massive amounts of data on the Internet. [0003] In the existing technology, users may have the need to classify and store multiple texts. For example, electronic libraries need to classify texts according to their content for easy search, and patent documents need to classify texts to find and process related files. [0004] The document classification method in the prior art generally summarizes the core idea of ​​the manuscript after reading the manuscript manually, and then summarizes the keywords, and then classifies according to the type of the document, or simply classifies according to the word frequency, the former is inefficient , the latter method is too mechanical to achieve better classification results. [0005...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F16/35G06F16/33G06F40/194
CPCG06F40/194
Inventor 向湘杰
Owner 上海诺柱知识产权服务有限公司