Method and device for processing text-related structural data
A technology of structured data and processing methods, applied in the field of the Internet, can solve the problems of difficulty in displaying webpages on mobile devices, inability to see webpages, and support.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0078] figure 1 It is a flowchart of a method for processing text-related structured data according to Embodiment 1 of the present invention. Such as figure 1 As shown, the execution subject of the method for processing text-related structured data in this embodiment may specifically be a text-related structured data processing device. The method for processing text-related structured data in this embodiment may specifically include the following steps:
[0079] 100. Perform block processing on the nodes in the Document Object Model (DOM) tree of the web page according to the preset candidate block node types to obtain several candidate block nodes;
[0080] The type of the candidate block node in this embodiment is the node type corresponding to the label used to store the body of the webpage; for example, the label for storing the body of the webpage in the prior art may be a DIV label or a TABLE label, and the corresponding one is used for storage. The node type corresponding t...
Embodiment 2
[0126] Figure 4 It is a schematic structural diagram of an apparatus for processing structured data related to text provided in the second embodiment of the present invention. Such as Figure 4 As shown, the apparatus for processing structured data related to text in this embodiment may specifically include: a block processing module 10, a filtering module 11, a data extraction module 12, and a display module 13.
[0127] The block processing module 10 is used to block the nodes in the DOM tree of the web page according to the preset candidate block node types to obtain several candidate block nodes; the type of the candidate block nodes is used for storage The node type corresponding to the label of the main text of the webpage; the filtering module 11 is connected to the block processing module 10, and the filtering module 11 is used to filter out several candidate block nodes processed by the block processing module 10 that store the main text of the webpage A candidate block...
Embodiment 3
[0131] Figure 5 It is a schematic structural diagram of processing text-related structured data provided in the third embodiment of the present invention. Figure 5 The apparatus for processing structured data related to the text of the illustrated embodiment is described above Figure 4 On the basis of the illustrated embodiment, the following technical solutions may also be included.
[0132] Such as Figure 5 As shown, the apparatus for processing structured data related to the text of this embodiment further includes an integration module 14 and / or a packaging module 15. Figure 5 The illustrated embodiment takes the integration module 14 and the packaging module 15 as an example.
[0133] The integration module 14 can be connected to the segmentation processing module 10 and the filtering module 11; the integration module 14 is used for the segmentation processing module 10 to perform processing on the nodes in the document object model tree of the web page according to the pre...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More - R&D
- Intellectual Property
- Life Sciences
- Materials
- Tech Scout
- Unparalleled Data Quality
- Higher Quality Content
- 60% Fewer Hallucinations
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2025 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com
