A method for extracting people's interest tags based on social networks

A social network and tag extraction technology, applied in the field of tag extraction, can solve the problems of ignoring the representativeness of candidate words, not considering the structure of document text, ignoring phrases, etc., to achieve the effect of accurate hobbies

Active Publication Date: 2021-10-08
SUZHOU UNIV
View PDF4 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] Most of the existing interest tag extraction algorithms use single words as interest tags, while ignoring phrases and unique topic tags in social networks
In addition, the TFIDF algorithm mentioned above only considers the frequency of words in documents and document libraries, but does not consider the text structure of documents.
On the contrary, the TextRank algorithm only considers the role of candidate words in the document structure, but ignores the representation of candidate words in the entire corpus, which is easily affected by meaningless words (such as stop words, etc.)

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A method for extracting people's interest tags based on social networks
  • A method for extracting people's interest tags based on social networks
  • A method for extracting people's interest tags based on social networks

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0030] The present invention will be further described below in conjunction with the accompanying drawings and specific embodiments, so that those skilled in the art can better understand the present invention and implement it, but the examples given are not intended to limit the present invention.

[0031] to combine Figure 1 to Figure 3 As shown, the present invention discloses a method for extracting interest tags of people based on social networks, including the following steps: step A: data preprocessing; step B: derivation of candidate tags; step C: extraction of interest tags.

[0032] The step A: data preprocessing, which is used to clean, filter and replace the social network data of the person to form a set including multiple words; the data preprocessing includes case conversion, word segmentation, part-of-speech marking, and deletion in turn. Stop words, remove slang, remove links, remove emoticons, remove retweets. Wherein, the case conversion includes: uniforml...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a method for extracting interest tags of people based on social networks, which includes the following steps: Step A: data preprocessing, which is used to clean, screen and replace the social network data of people to form a set including multiple words; Step B: Derivation of candidate tags, read in and judge the words in the set in turn, and form a candidate tag set including topic tags, word candidate tags and phrase candidate tags; Step C: Extraction of interest tags, including candidate tags Determination of TF value; calculation of candidate tag IDF value; sorting according to the TFIDF value of candidate tags, exporting some topic tags to the interest tag set; weight calculation between candidate tags; score calculation of candidate tags; acquisition of interest tag set . The present invention has at least the following advantages: it not only considers the frequency of the interest tags in the document library and documents, but also considers the impact of the document structure on the interest tags, and can obtain more accurate effects.

Description

technical field [0001] The invention relates to the technical field of tag extraction, in particular to a method for extracting tags of people's interests based on social networks. Background technique [0002] With the rapid development of Internet applications, social networks have an increasing influence on users. People are increasingly relying on social networks for information exchange and sharing, which has brought about an explosive growth of Internet data. At the same time, users' demand for personalization is becoming stronger and stronger, such as recommending users' favorite products, games, music, movies or News and more. Character interest tags are usually used to describe the identity attributes and interest attributes of characters, which are very helpful for character retrieval and recommendation, character behavior analysis, discovery of character hobbies and character portrait models. [0003] Commonly used interest tag extraction technologies include TF...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06F16/9536G06F16/35G06F40/289G06F40/216G06Q50/00
CPCG06F16/9535G06F40/289
Inventor 韩月辉赵雷
Owner SUZHOU UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products