Extraction method and device for web title

A web page title and title technology, applied in the field of retrieval, can solve problems such as reducing user experience and hindering searchers from obtaining the information content to be retrieved, so as to improve user experience, improve user experience, and solve technical problems.

Inactive Publication Date: 2013-02-13
ALIBABA GRP HLDG LTD
View PDF2 Cites 8 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, when the search term appears after the truncation of the webpage title, there is no "red" information in the webpage title, and these webpage titles without "red" processing will be sorted to the rear of the entire search results, hindering searchers Quickly obtain the information content to be retrieved, greatly reducing the user experience

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Extraction method and device for web title
  • Extraction method and device for web title
  • Extraction method and device for web title

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 500

[0051] The several embodiments described above are all embodiments of the method of the present invention, and accordingly, the present invention also provides an embodiment of a device for extracting a web page title. See attached Figure 5 , the embodiment 500 of the web page title extraction device provided by the present invention includes: a search word position determining unit 501, a judging unit 502, a sentence break search unit 503, a first matching unit 504 and a result returning unit 505, wherein:

[0052] A search word position determining unit 501, configured to determine the position of the search word in the title of the webpage;

[0053] Judging unit 502, used to judge whether the length between the first character of the web page title and the last character of the search term is less than or equal to the preset title presentation length, if yes, trigger the result return unit; if not, trigger the sentence break search unit ;

[0054] Sentence breaker search...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides an extraction method for a web title. The method comprises the following steps of: determining the position of a search term in the web title; judging whether a length from a first character of the web title to the last character of the search term is less than or equal to a preset title display length or not, searching for a segmentation character if the length from the first character of the web title to the last character of the search term is not less than or equal to the preset title display length, and returning corresponding characters as results when the character length of a certain segment of characters in the web title is less than or equal to the preset title display length and the characters comprise the integral search term and the found segmentation character. The invention also provides an extraction device for the web title. The displayed web title is high in readability, comprises much reserved core information, and has red marks, so that retrieved contents can be obtained quickly and conveniently by a retriever.

Description

technical field [0001] The invention relates to the technical field of retrieval, in particular to a method and device for extracting a web page title. Background technique [0002] With the development of Internet technology, network information is growing explosively. In the information ocean, people often rely on information retrieval technology to obtain specific information. By inputting the search term of the information to be understood, the search engine can present the content containing the search term in front of the searcher. The presentation form usually displays each search result item in the form of a web page title, and a paragraph containing the search term is attached under the title of the web page. Short text that allows people to click on the title to easily link to a detailed page containing the search term. In order to speed up the search and make it easier to read, the title of the webpage usually also "marks red" the search terms. However, as the ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
Inventor 陈宏杰张小洵薛贵荣
Owner ALIBABA GRP HLDG LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products