A Chinese and English paper data classification and query method
A query method and data classification technology, which is applied in text database query, text database clustering/classification, unstructured text data retrieval, etc., can solve the problem of word segmentation not achieving the effect, Chinese and English integration is difficult to accurately identify, cross-language query It is difficult to achieve the expected effect and other problems, to achieve the effect of improving retrieval accuracy and improving accuracy
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0030] Embodiment 1: Aiming at the defects and problems of the current data classification that the word segmentation cannot achieve the effect, the fusion of Chinese and English is difficult to accurately identify, and the cross-language query is difficult to achieve the expected effect, the present invention provides a method based on the construction of how to unify the labels of papers in Chinese and English. Chinese and English paper data classification and query method to improve the accuracy of cross-language query. The method includes the following contents.
[0031] Step 1. First, according to the Chinese and English keywords included in the Chinese papers when they were published, traverse the original data of the Chinese papers and extract the Chinese and English keywords in all Chinese papers.
[0032] Then, exclude the abnormal data in Chinese and English keywords, mainly exclude the lack of Chinese or English keyword data, and aggregate the results of Chinese tra...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


