E-Science environment-oriented multi-domain Web text feature extracting system and method
Patent Information
- Authority / Receiving Office
- CN · China
- Patent Type
- Patents(China)
- Current Assignee / Owner
- UNIV OF SCI & TECH BEIJING
- Publication Date
- 2013-12-11
- Estimated Expiration
- Not applicable · inactive patent
Smart Images
Figure 1 Figure 2 Figure 3
Abstract
Description
technical field
[0001] The invention relates to feature extraction of Web text, in particular to a multi-field Web text feature extraction system and method for e-Science environment. Background technique
[0002] Khaled Khelif (2007) proposed an ontology-based information extraction method, aiming to help biologists acquire professional knowledge more effectively. This method relies on semantic annotation of scientific and technological documents, automatically generates domain ontology and provides corresponding information retrieval interface. Tara McIntosh (2007) proposed a full-text information extraction system for the biomedical field to solve the shortcomings of the traditional analysis methods based on literature summarization. ZiyaOzkan Gokturk and Nihan Kesim Cicekli et al. (2007) used web crawler technology to extract and classify web page metadata using pre-set regular expressions. In the experiment, taking the European Cup and the UEFA Champions League as exa...