Ambalytics: A Scalable and Distributed System Architecture Concept for Bibliometric Network Analyses

A deep understanding about a field of research is valuable for academic researchers. In addition to technical knowledge, this includes knowledge about subareas, open research questions, and social communities (networks) of individuals and organizations within a given field. With bibliometric analyse...

Full description

Bibliographic Details
Published in:Future Internet
Main Authors: Klaus Kammerer, Manuel Göster, Manfred Reichert, Rüdiger Pryss
Format: Article
Language:English
Published: MDPI AG 2021-08-01
Subjects:
Online Access:https://www.mdpi.com/1999-5903/13/8/203
_version_ 1850110376327249920
author Klaus Kammerer
Manuel Göster
Manfred Reichert
Rüdiger Pryss
author_facet Klaus Kammerer
Manuel Göster
Manfred Reichert
Rüdiger Pryss
author_sort Klaus Kammerer
collection DOAJ
container_title Future Internet
description A deep understanding about a field of research is valuable for academic researchers. In addition to technical knowledge, this includes knowledge about subareas, open research questions, and social communities (networks) of individuals and organizations within a given field. With bibliometric analyses, researchers can acquire quantitatively valuable knowledge about a research area by using bibliographic information on academic publications provided by bibliographic data providers. Bibliometric analyses include the calculation of bibliometric networks to describe affiliations or similarities of bibliometric entities (e.g., authors) and group them into clusters representing subareas or communities. Calculating and visualizing bibliometric networks is a nontrivial and time-consuming data science task that requires highly skilled individuals. In addition to domain knowledge, researchers must often provide statistical knowledge and programming skills or use software tools having limited functionality and usability. In this paper, we present the ambalytics bibliometric platform, which reduces the complexity of bibliometric network analysis and the visualization of results. It accompanies users through the process of bibliometric analysis and eliminates the need for individuals to have programming skills and statistical knowledge, while preserving advanced functionality, such as algorithm parameterization, for experts. As a proof-of-concept, and as an example of bibliometric analyses outcomes, the calculation of research fronts networks based on a hybrid similarity approach is shown. Being designed to scale, ambalytics makes use of distributed systems concepts and technologies. It is based on the microservice architecture concept and uses the Kubernetes framework for orchestration. This paper presents the initial building block of a comprehensive bibliometric analysis platform called ambalytics, which aims at a high usability for users as well as scalability.
format Article
id doaj-art-e00a27fdb7c74633a77a5598aaaa5d66
institution Directory of Open Access Journals
issn 1999-5903
language English
publishDate 2021-08-01
publisher MDPI AG
record_format Article
spelling doaj-art-e00a27fdb7c74633a77a5598aaaa5d662025-08-19T23:59:53ZengMDPI AGFuture Internet1999-59032021-08-0113820310.3390/fi13080203Ambalytics: A Scalable and Distributed System Architecture Concept for Bibliometric Network AnalysesKlaus Kammerer0Manuel Göster1Manfred Reichert2Rüdiger Pryss3Institute of Clinical Epidemiology and Biometry, University of Würzburg, 97080 Würzburg, GermanyInstitute of Clinical Epidemiology and Biometry, University of Würzburg, 97080 Würzburg, GermanyInstitute of Databases and Information Systems, Ulm University, 89081 Ulm, GermanyInstitute of Clinical Epidemiology and Biometry, University of Würzburg, 97080 Würzburg, GermanyA deep understanding about a field of research is valuable for academic researchers. In addition to technical knowledge, this includes knowledge about subareas, open research questions, and social communities (networks) of individuals and organizations within a given field. With bibliometric analyses, researchers can acquire quantitatively valuable knowledge about a research area by using bibliographic information on academic publications provided by bibliographic data providers. Bibliometric analyses include the calculation of bibliometric networks to describe affiliations or similarities of bibliometric entities (e.g., authors) and group them into clusters representing subareas or communities. Calculating and visualizing bibliometric networks is a nontrivial and time-consuming data science task that requires highly skilled individuals. In addition to domain knowledge, researchers must often provide statistical knowledge and programming skills or use software tools having limited functionality and usability. In this paper, we present the ambalytics bibliometric platform, which reduces the complexity of bibliometric network analysis and the visualization of results. It accompanies users through the process of bibliometric analysis and eliminates the need for individuals to have programming skills and statistical knowledge, while preserving advanced functionality, such as algorithm parameterization, for experts. As a proof-of-concept, and as an example of bibliometric analyses outcomes, the calculation of research fronts networks based on a hybrid similarity approach is shown. Being designed to scale, ambalytics makes use of distributed systems concepts and technologies. It is based on the microservice architecture concept and uses the Kubernetes framework for orchestration. This paper presents the initial building block of a comprehensive bibliometric analysis platform called ambalytics, which aims at a high usability for users as well as scalability.https://www.mdpi.com/1999-5903/13/8/203system architecture designbibliometric analysiscommunity detection
spellingShingle Klaus Kammerer
Manuel Göster
Manfred Reichert
Rüdiger Pryss
Ambalytics: A Scalable and Distributed System Architecture Concept for Bibliometric Network Analyses
system architecture design
bibliometric analysis
community detection
title Ambalytics: A Scalable and Distributed System Architecture Concept for Bibliometric Network Analyses
title_full Ambalytics: A Scalable and Distributed System Architecture Concept for Bibliometric Network Analyses
title_fullStr Ambalytics: A Scalable and Distributed System Architecture Concept for Bibliometric Network Analyses
title_full_unstemmed Ambalytics: A Scalable and Distributed System Architecture Concept for Bibliometric Network Analyses
title_short Ambalytics: A Scalable and Distributed System Architecture Concept for Bibliometric Network Analyses
title_sort ambalytics a scalable and distributed system architecture concept for bibliometric network analyses
topic system architecture design
bibliometric analysis
community detection
url https://www.mdpi.com/1999-5903/13/8/203
work_keys_str_mv AT klauskammerer ambalyticsascalableanddistributedsystemarchitectureconceptforbibliometricnetworkanalyses
AT manuelgoster ambalyticsascalableanddistributedsystemarchitectureconceptforbibliometricnetworkanalyses
AT manfredreichert ambalyticsascalableanddistributedsystemarchitectureconceptforbibliometricnetworkanalyses
AT rudigerpryss ambalyticsascalableanddistributedsystemarchitectureconceptforbibliometricnetworkanalyses