A Semantic Classification Method of Network Text Based on Baidu Encyclopedia
A Baidu Encyclopedia, network text technology, applied in the field of network text semantic classification, can solve problems such as a large amount of training data, inability to process, and inability to train data exhaustively.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0043] Each open classification of Baidu Encyclopedia entries is a semantic topic. A meaningful Chinese text expresses the specific semantic theme to be expressed through certain phrases. It exists in the form of encyclopedia entries in Baidu Encyclopedia, which are referred to as entries below. By observing and analyzing the relationship between text, lexical entries and semantic topics, we have the following basic points of view:
[0044] Viewpoint 1. Entries are the extension of knowledge relations. The basic unit used to express content in Chinese natural language is the entry. Entries have the characteristics of polysemy, variety, and non-exhaustiveness. They are the extension of knowledge relations and are what the text wants to express. The external representation of meaning. Therefore, the traditional method of training and classifying in the form of statistical entries often requires a large amount of training data, and cannot deal with new and new words that do not...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 