A Topic Detection Method Based on Document Content and Interrelationships
A technique of interrelationship and detection method, which is applied in the direction of unstructured text data retrieval, network data retrieval, and other database retrieval, and can solve problems such as only considering document content
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0032] The present invention will be described in further detail below in conjunction with the examples, but the protection scope of the present invention is not limited thereto.
[0033] The invention relates to a topic detection method based on document content and interrelationships. The method includes the following steps.
[0034] Step 1: Obtain N documents, and preprocess the documents to obtain a document-feature co-occurrence matrix X and a pairwise relationship matrix R.
[0035] In the step 1, the preprocessing includes English text preprocessing and Chinese text preprocessing; the English text preprocessing includes word stem restoration and stop word elimination; the Chinese text preprocessing includes word segmentation and removal of low-frequency words.
[0036] In the present invention, the document-feature co-occurrence matrix X refers to a matrix based on documents and words.
[0037] In the present invention, the pairwise relationship matrix R represents the...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com