Method and system for segmenting text paragraphs
A technology for paragraphs and texts, applied in the field of text paragraph slicing methods and systems, can solve problems such as spending a lot of time and difficult to merge small paragraphs into semantic paragraphs, and achieve the effect of accurate semantic information
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0028] Such as figure 1 As shown, the embodiment of the present invention proposes a text paragraph slicing method, including the following steps:
[0029] Step 101, acquire Internet data from a storage system, wherein the Internet data includes HTML (HyperText Mark-up Language, hypertext markup language) text, title, meta and anchor text. HTML is currently the most widely used language on the Internet, and it is also the main language that constitutes web documents. It is a descriptive text composed of HTML commands. HTML commands can explain text, graphics, animations, sounds, tables, links, etc.; the structure of HTML files includes The header (title) and the body (meta) are two parts. The header describes the information required by the browser, while the body contains the specific content to be explained. Anchor text is the text part of a hyperlink on a web page, which is an important factor affecting the ranking of a web page search engine. Anchor text refers to a web p...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 