A new approach for scientific data dissemination in developing countries: a case of Indonesia
This short paper is intended as an additional progress report to share our experiences in Indonesia on collecting, integrating and disseminating both global and local scientific data across the country through the web technology. Our recent efforts are exerted on improving the local public access to global scientific data, and on the other hand encouraging the local scientific data to be more accessible for the global communities. We have maintained well-connected infrastructure and some web-based information management systems to realize such objectives. This paper is especially focused on introducing the ARSIP for mirroring global as well as sharing local scientific data, and the newly developed Indonesian Scientific Index for integrating local scientific data through an automated intelligent indexing system.
💡 Research Summary
The paper presents a practical solution to the chronic problem of scientific data accessibility in a developing country, using Indonesia as a case study. The authors identify three major obstacles: insufficient network infrastructure, a shortage of skilled human resources, and relatively low levels of scientific activity. While the first issue is gradually improving thanks to recent telecom privatization, the remaining challenges require internal initiatives by the scientific community itself. To address these, the authors have built two complementary systems: ARSIP (Archive Server for Scientific Data) and ISI (Indonesian Scientific Index).
ARSIP is a centralized mirroring service that replicates major global scientific repositories—such as NASA’s Astrophysics Data System, the Particle Data Group at LBL, and SLAC’s SPIRES—on servers located within Indonesia. By negotiating a Memorandum of Understanding with the national ISP Indosat Mega Media, ARSIP enjoys unlimited bandwidth both to the international backbone and to the Indonesian Internet Exchange (IIX). This architecture eliminates the bottleneck that would otherwise arise when many local users simultaneously access foreign servers, and it also provides a reliable, high‑speed gateway for Indonesian researchers to retrieve large datasets (e.g., protein structures, high‑energy physics data). In a novel twist, ARSIP also mirrors locally generated scientific data (e.g., magnetic observations from LAPAN, astronomical images from the Bosscha Observatory) so that international users can download these resources without overloading the originating institutions’ limited international links. The mirroring approach thus turns the traditional “decentralization” principle on its head, using a single, well‑provisioned hub to serve both global and local data efficiently.
ISI tackles the second major challenge: the collection and integration of scattered local scientific outputs. Earlier attempts, such as the DBRiptek portal, required each institution to manually enter data twice—once on its own website and again on the national database—leading to inconsistencies and a heavy administrative burden. ISI replaces this with an automated web‑crawling system that periodically visits predefined institutional sites. Unlike generic crawlers, ISI’s back‑end is manually configured for each target, allowing it to recognize and extract structured information such as publications, author profiles, bibliographies, and institutional descriptions. The resulting index provides precise, field‑specific search results, far surpassing the capabilities of a simple keyword search. Because participation only requires institutions to make their existing web pages publicly accessible, no additional data‑entry work or format standardization is imposed, preserving the autonomy of each research group while still delivering a unified national search platform.
The authors report that both systems are operational: ARSIP has been mirroring global datasets since 2003 and began hosting Indonesian data in 2007, while ISI is currently in a beta phase with promising performance. Future work includes embedding an Online Calculator for Scientific Performance (OCSP) into ISI to automatically generate metrics for individuals, institutions, and projects, thereby supporting transparent decision‑making and policy formulation.
Overall, the paper demonstrates that, even with limited resources, a developing nation can achieve substantial improvements in scientific data dissemination by centralizing critical infrastructure, leveraging partnerships with national telecom providers, and employing intelligent, low‑cost software solutions. The ARSIP and ISI models are presented as scalable templates that could be adapted by other countries facing similar constraints, ultimately narrowing the information gap between the Global North and South.
Comments & Academic Discussion
Loading comments...
Leave a Comment