Research hotspot analysis method based on expert paper big data

An analysis method and big data technology, applied in the field of data processing, can solve problems such as inability to solve professional analysis, not allowing downloads, and no analysis functions, etc., to make up for the lack of sampling randomness, high authenticity and accuracy, and make up for data samples small amount of effect

Inactive Publication Date: 2019-05-21
SOUTH CHINA UNIV OF TECH +1
View PDF4 Cites 5 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] However, the defect of HowNet’s own search engine is that it needs to be manually clicked every time. For data analysis, this working method is undoubtedly very laborious.
The amount of manually selected and downloaded data is limited, and usually the labor cost does not allow to download all the data, making it difficult to have an overall and comprehensive analysis of the data.
Moreover, its analysis function is limited to the memory and sorting of keywords entered by users, and there is no deeper analysis function
CiteSpace is still based on manual operation, the efficiency is low, and incomplete data will cause analysis errors, and it cannot solve professional analysis problems

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Research hotspot analysis method based on expert paper big data
  • Research hotspot analysis method based on expert paper big data

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0023] Such as figure 1 As shown, this embodiment provides a research hotspot analysis method based on big data of expert papers, including the following steps:

[0024] S1. According to keywords, search for papers with knowledge database as data source, and capture open data such as title, publication time, author, and data source;

[0025] S2. Perform word segmentation according to the title of the downloaded paper, delete structural words such as conjunctions, prepositions, and pronouns, remove verbs and adjectives, and keep only nouns to obtain a list of hot words;

[0026] S3. Delete the daily words in the hot vocabulary list through the matching and exclusion method of the daily word corpus in professional papers, and obtain the professional vocabulary list;

[0027] S4. Carry out word frequency analysis on the professional vocabulary list, and arrange in descending order of word frequency, select the top 100 data to obtain the general table of professional vocabulary w...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a research hotspot analysis method based on expert paper big data, which comprises the following steps: S1, carrying out paper search by taking a knowledge database as a data source according to keywords, and capturing open data such as a paper title, publication time, an author and a data source; S2, according to the downloaded thesis title, performing word segmentation processing, deleting structural vocabularies such as continuous words, mesons and pronouns, removing vocabularies of verbs and word properties of adjectives, only retaining nouns, and obtaining a hot word list; S3, deleting daily frequently-used words in the hot word list through a professional paper daily frequently-used word corpus matching and removing method to obtain a professional word list; S4, performing word frequency analysis on the professional vocabulary list, arranging the professional vocabulary list in a word frequency descending order, and selecting first hundreds of bits of datato obtain a professional vocabulary word frequency general table; and S5, adding year data, listing a word frequency score table according to different years on the basis of the professional vocabulary word frequency general table, and obtaining a change trend of a focus point with time as an axis.

Description

technical field [0001] The invention relates to the technical field of data processing, in particular to a research hotspot analysis method based on big data of expert papers. Background technique [0002] HowNet has its own search engine, which can be searched according to title, author, keywords, etc., and the search results can also be exported and shared with software such as Note express. CiteSpace, a Java application program used to analyze and visualize co-citation networks, can analyze the development process and structural relationship of scientific knowledge. After exporting data such as titles from HowNet, it can complete scientific knowledge genealogy analysis such as keyword analysis and author relationship analysis. . [0003] However, the defect of HowNet’s existing search engine is that it needs to be manually clicked every time. For data analysis, this working method is undoubtedly very laborious. The amount of manually selected and downloaded data is limi...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/2457G06F16/248G06F17/27
Inventor 黄翼吴硕贤
Owner SOUTH CHINA UNIV OF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products