Academic article processing method and search processing method and apparatus for academic articles

An article and academic technology, applied in the computer field, can solve the problems of uneven source quality, difficulty in achieving high accuracy and recall at the same time, and achieve the effect of improving accuracy and recall

Active Publication Date: 2015-09-09
BAIDU ONLINE NETWORK TECH (BEIJIBG) CO LTD
View PDF5 Cites 8 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] In the process of realizing the disambiguation of academic authors, there are at least the following problems: because the difficulty of disambiguation and the quality o

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Academic article processing method and search processing method and apparatus for academic articles
  • Academic article processing method and search processing method and apparatus for academic articles
  • Academic article processing method and search processing method and apparatus for academic articles

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0038] figure 1 It is a schematic flowchart showing the computer-implemented processing method for academic articles according to an exemplary embodiment of the present invention.

[0039] refer to figure 1 , the academic article processing method implemented by computer in this embodiment specifically includes:

[0040] In step S110, multiple articles with the same author name feature are obtained.

[0041] Specifically, the purpose of this step is to extract multiple articles with the same author name (possibly the same name) and gather them together.

[0042] In step S120, the multiple articles are clustered according to the author's institution characteristics to obtain multiple first clusters.

[0043] Specifically, the purpose of this step is to group articles with the same or similar authors' institutions. "Author's name + author's institution" is used as an identification method for an author entity. However, due to institutional changes and an author holding posit...

Embodiment 2

[0076] figure 2 It is a schematic flowchart showing a method for searching and processing academic articles in an exemplary embodiment of the present invention.

[0077] refer to figure 2 , the search processing method of the academic article of the present embodiment specifically includes:

[0078] In step S210, the user's search terms for academic articles are sent to the server.

[0079] In step S220, a plurality of academic article search result items are received from the server, and the academic article search result items include article titles, author information, and cluster identifiers corresponding to the articles.

[0080] Specifically, the author information may include the author's name, the institution to which the author belongs, and the like. The cluster identifier is the cluster identifier of the third cluster in the first embodiment.

[0081] In step S230, displaying the search result item of the academic article on the user interface;

[0082] In ste...

Embodiment 3

[0091] image 3 It is a schematic flowchart showing a method for searching and processing academic articles in an exemplary embodiment of the present invention.

[0092] refer to image 3 , the search processing method of the academic article of the present embodiment specifically includes:

[0093] In step S310, the user's search terms for academic articles are received from the client.

[0094] In step S320, a plurality of search result items of academic articles corresponding to the search words are obtained according to the search words, and the search result items of academic articles include article titles, author information, and cluster identifiers corresponding to the articles.

[0095] Specifically, the author information may include the author's name, the institution to which the author belongs, and the like. The cluster ID is figure 1 Cluster ID for the third cluster in the illustrated embodiment.

[0096] In step S330, the plurality of academic article search...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides an academic article processing method and a search processing method and apparatus for academic articles. The academic article processing method comprises: acquiring a plurality of articles with the same author name characteristic; clustering the plurality of articles according to the author subsidiary organ characteristics of the articles to obtain a plurality of first clusters; clustering the plurality of first clusters according to the cooperator characteristics and the first semantic characteristics of the articles to obtain a plurality of second clusters; and clustering the plurality of second clusters according to the author subsidiary organ characteristics and the second semantic characteristics of the articles to obtain a plurality of third clusters, wherein the set of the second semantic characteristics is the subset of the set of the first semantic characteristics. According to the academic article processing method and the search processing method and apparatus for the academic articles, which are provided by the invention, the accuracy rate and recall rate of the articles corresponding one author entity are improved.

Description

technical field [0001] The invention relates to the field of computer technology, in particular to a method for processing academic articles and a method and device for searching and processing academic articles. Background technique [0002] With the rapid increase in the number of electronic publications (papers, books, patents, etc.), the same author appears in multiple names (alias, abbreviation, etc.), and the situation of multiple authors with the same name is becoming more and more serious. [0003] Imagine the following scenario: When a graduate student in a certain field is reading related literature in this field, he finds an article of particular interest. The first author of the article is "Zhang San". They all came to read. But even in the field of scientific research, there may be a large number of scholars named "Zhang San", and even in the same segment, there will be many cases of the same name. So, how to find out all the articles published by "Zhang San" ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/30
CPCG06F16/93
Inventor 高一鸣李浩张晓婧
Owner BAIDU ONLINE NETWORK TECH (BEIJIBG) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products