Unlock instant, AI-driven research and patent intelligence for your innovation.

Topic portrait system and topic portrait method based on Zhihu

A topic and portrait technology, applied in the field of artificial intelligence systems and data mining, can solve problems such as single information dimension, no structure and dissemination mechanism, and research object staying.

Active Publication Date: 2021-07-09
NANJING UNIV OF POSTS & TELECOMM
View PDF4 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] Judging from the existing research results, although the network Q&A community information has been used in user behavior research, information quality research, knowledge dissemination research and decision support research, most of the research objects are still in the first generation of keyword search Q&A. Among them, the research of information science and technology focuses on algorithm optimization, and the research of social science is mainly qualitative and experience summarization. There is no content structure and dissemination mechanism for Zhihu, and systematic and universal data mining around topics. method
At the same time, in the research of the second-generation social Q&A community, topic research mainly focuses on topic identification, focusing on natural language processing, including technologies such as network information capture, natural language segmentation, and keyword extraction. There are also the following limitations : 1. Research mainly focuses on methods such as topic and keyword extraction, sentiment analysis, etc., providing information with a single dimension and not being associated with specific application scenarios
2. The application of natural language processing technology is still immature, and many shortcomings such as complex Chinese semantics and lack of training corpus make the actual performance of natural language processing technology poor
3. Most of the text data in the online question-and-answer community is a collection of short texts, which has the characteristics of fragmentation, colloquial expression, and sparse data sets, which brings new challenges to natural language processing technology

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Topic portrait system and topic portrait method based on Zhihu
  • Topic portrait system and topic portrait method based on Zhihu
  • Topic portrait system and topic portrait method based on Zhihu

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0082] The preferred embodiments of the present invention will be further described below in conjunction with the drawings and specific embodiments.

[0083] This invention is based on the content creation and dissemination mechanism of Zhihu community, such as image 3 As shown, it can help those skilled in the art to understand the background of the present invention.

[0084] figure 1 It is a schematic structural diagram of the Zhihu topic portrait system and method based on an embodiment of the present invention, including: "data preprocessing module 11", "topic portrait module 12", and "user graphical interface module 13". The following describes each of the embodiments of the present invention Modules are described in detail.

[0085] Data preprocessing modules such as figure 1 As shown, it is suitable for extracting, cleaning and preprocessing data suitable for the topic portrait module from the website. The data preprocessing module 11 includes a user data crawling...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a topic portrait system and topic portrait method based on Zhihu data. The system includes a data preprocessing module for extracting, cleaning and preprocessing data from a website, a topic portrait module for accurately portraiting a topic, and A user graphical interface module for visually presenting the results of the topic portrait module and downloading reports; the topic portrait method includes the following steps: (1) extracting, cleaning, and preprocessing data from the website, specifically including topic data crawling, data cleaning and Preprocessing; (2) Precise portrait of the topic, specifically including data statistical analysis, user portrait analysis, network data analysis, text data analysis, labeling the specified features of the topic and comparative analysis with similar topics; (3) Presentation of the user image interface, specifically Including visual presentation of analysis results and downloading of analysis reports; the present invention presents portrait results through an interactive and visualized user image interface, which broadens the mining and application of Zhihu data.

Description

technical field [0001] The present invention relates to an artificial intelligence system and a data mining method, in particular to a topic portrait system based on Zhihu, and also to a topic portrait method based on Zhihu. Background technique [0002] Zhihu is an emerging online question-and-answer community in recent years, with the concept of sharing each other's professional knowledge and experience, and maintaining a rigorous and rational community atmosphere. As of September 2017, Zhihu has more than 100 million individual registered users and 18 billion monthly views. Zhihu integrates social elements on the basis of the question-and-answer community, redefines the relationship between people and information, and establishes a new content creation and dissemination mechanism. Its high-quality community content has gradually become an important way for Internet users to acquire knowledge. [0003] Zhihu topic is a kind of social tagging (Social Tagging). Users create...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F16/9535G06F16/955G06F16/332G06F16/338
CPCG06F16/3329G06F16/338G06F16/9535G06F16/955
Inventor 王飞翔王友国
Owner NANJING UNIV OF POSTS & TELECOMM