International Symposium on Grids & Clouds 2018 (ISGC 2018) in conjunction with Frontiers in Computational Drug Discovery (FCDD)

Name: International Symposium on Grids & Clouds 2018 (ISGC 2018) in conjunction with Frontiers in Computational Drug Discovery (FCDD)
Start: 2018-03-16T08:00:00+08:00
End: 2018-03-23T18:00:00+08:00
Location: Academia Sinica

16-23 March 2018

Academia Sinica

Asia/Taipei timezone

Support

stella.shen@twgrid.org

Authorship recognition and disambiguation of scientific papers using a neural networks approach

21 Mar 2018, 16:30

30m

Media Conference Room, BHSS (Academia Sinica)

Media Conference Room, BHSS

Academia Sinica

Oral Presentation Humanities, Arts, and Social Sciences (HASS) Application Humanities, Arts & Social Sciences Session

Dr Luca Tomassetti (University of Ferrara and INFN) Dr Sebastiano Fabio Schifano (University of Ferrara and INFN)

One of the main issues affecting the quality and reliability of bibliographic records retrieved from digital libraries -- such as Web of Science, Scopus, Google Scholar and many others -- is the autorship recognition and author names disambiguation. So far these problems have been faced using methods mainly based on text-pattern-recognition for specific datasets, with high-level degree of errors. In this paper, we propose an approach using neural networks to learn features automatically for solving authorship recognition and disambiguation of author names. The network learns for each author the set of co-writers, and from this information recovers authorship of papers. In addition, the network can be trained taking into account other features such as author affiliations, keywords, projects and research areas. The network has been developed using the TensorFlow framework, and run on recent Nvidia GPUs and multi-core Intel CPUs. Test datasets have been selected from records exported in RIS format from the Scopus digital library, for several groups of authors working in the fields of computer science, environmental science and physics. The proposed methods achieves accuracies above 99% in authorship recognition and is able to effectively disambiguate homonyms. We have taken into account several network parameters, such as training-set size and batch size, number of levels and hidden units, threshold and weight initialization, back-propagation algorithms, and analyzed the impact on accuracy of results. This approach can be easily extended to any dataset and any bibliographic records provider.

Dr Luca Tomassetti (University of Ferrara and INFN) Dr Sebastiano Fabio Schifano (University of Ferrara and INFN) Mr Tommaso Sgarbanti (University of Ferrara)

There are no materials yet.

International Symposium on Grids & Clouds 2018 (ISGC 2018) in conjunction with Frontiers in Computational Drug Discovery (FCDD)

Support

Authorship recognition and disambiguation of scientific papers using a neural networks approach

Media Conference Room, BHSS

Academia Sinica

Speakers

Description

Primary authors

Presentation materials