Label automatic generation method based on meta-search engine

A meta-search engine and automatic generation technology, applied in special data processing applications, instruments, electronic digital data processing, etc., can solve the problems of mixed information resources and search engines that cannot meet the individual needs of users, so as to ensure the recall rate, Guaranteed precision effect

Inactive Publication Date: 2017-05-17
HUNAN UNIV OF SCI & ENG
View PDF3 Cites 10 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0002] In recent years, with the rapid development of the Internet industry and the maturity of search engines, various search engines have become a useful tool for people to obtain information. Wit

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Label automatic generation method based on meta-search engine
  • Label automatic generation method based on meta-search engine
  • Label automatic generation method based on meta-search engine

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0035] Based on the existing TextRank algorithm, the present invention proposes an improved TextRank algorithm to generate labels. This method consists of three stages, which are text preprocessing optimization, information amount calculation, and label extraction.

[0036] Algorithm improvement ideas: firstly, text preprocessing optimization, while performing Chinese word segmentation, retain the basic information of words, including part of speech, word position, word frequency, to form a five-tuple; secondly, word filtering, remove stop words, and perform part-of-speech filtering. Retain nouns, verbs, and gerunds based on experience to reduce noise interference; recalculate word information again, calculate word position score, word frequency, and word span through the statistical basic information of words, and calculate the comprehensive score as the weight of words; finally calculate words The similarity between them is used as the weight of the edge in the TextRank algo...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a label automatic generation method based on a meta-search engine. The method comprises the steps that firstly, text preprocessing optimization is conducted, Chinese word segmentation is conducted and meanwhile, basis information of words is saved and the basic information comprises part of speech, word position, word frequency of which quintuple is composed; secondly, the words are filtered, stop words are removed, part of speech filtration is conducted, and according to experience, noun, verb and gerund are kept and noise disturbance is reduced; word information quantity is recalculated again, by counting the word basic information, word position score, word frequency and word span are calculated and comprehensive score is calculated as weight of the words; finally, the similarity between words is calculated as edge weight in TextRank algorithm and the TextRank algorithm is used for calculating TextRank value of each word. According to the label automatic generation method based on the meta-search engine, the meta-search engine technology and automatic generation label are used, the automatic label technology is applied to the search engine and therefore recall ratio and precision ratio are guaranteed.

Description

technical field [0001] The invention relates to a method for obtaining tags, in particular to a method for automatically generating tags based on a meta search engine. Background technique [0002] In recent years, with the rapid development of the Internet industry and the maturity of search engines, various search engines have become a useful tool for people to obtain information. With the increase of users, the amount of information generated by the Internet has exploded. Resources are often mixed with various noises, and search engines cannot meet the individual needs of users. In order to make more effective use of these information resources, researchers have introduced "tag" technology, allowing users to search for the desired results more accurately, and screening effective information from massive information has gradually become a research hotspot. With the maturity of "labeling" technology, automatic labeling technology has also attracted the attention of scholar...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/30
Inventor 唐雅媛罗恩韬唐亚纯高傲
Owner HUNAN UNIV OF SCI & ENG
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products