e-Infrastructures became critical platforms that integrate computational resources, facilities and repositories globally. The coordination and harmonization of advanced e-Infrastructure project developed with partners from Europe, Latin America, Arabia, Africa, China, and India contributed to developing interoperable platforms based on identity federation and science gateway technologies. This paper presents these technologies to support key services in the development of Arabia networking and services platform for research and education. The platform provides scientists, teachers, and students with seamless access to a variety of advanced resources, services, and applications available at regional e-Infrastructures in Europe and elsewhere. Users simply enter the credentials provided by their home institutions to get authenticated and do not need digital certificate-based mechanisms. Twenty applications from five scientific domains were deployed and integrated. Results showed that on average about 35,000 monthly jobs are running for a total of about 17,500 CPU wall-clock hours. Therefore, seamlessly integrated e-Infrastructures for regional e-Science activities are important resources that support scientists, students, and faculty with computational services and linkage to global research communities.
Research and education are now globalized and require efficient communication and innovative services, literally named e-Infrastructure or cyber-infrastructure. An e-Infrastructure ranges from the physical supply of research networks to providing access to data for virtual research communities. It includes organizations and services as diverse as national and international multi-purpose grids, supercomputer infrastructure, data grids and repositories, tools for visualization, simulation, data management, storage, analysis and collection, tools for support with regard to methods or analysis as well as remote access to research instruments and very large research facilities.
Today, research has increasingly become computationally and data-intensive and of inter-disciplinary nature. Therefore, new tools are needed to analyze, model and visualize diverse datasets to support a new science paradigm as well as to develop complex applications to solve pressing problems and issues. In general, these tools are made accessible on dedicated research and education networks that connect universities and research institutes at large. Providing seamless access to global distributed tools, applications, and repositories as well as sharing of grid, cloud and high-performance computing resources have become a global trend. Virtual research communities have been formed on e-infrastructures to address new scientific challenges through collaboration. Unfortunately, a variety of e-infrastructures are still isolated and context insensitive. Therefore, their interoperability to create larger computational capacities is seen critical to address the requirements of cross-continental research communities and for the development of research and education and the advancement of science and technology, worldwide, [
Interoperability is a property referring to the ability of diverse systems and organizations to work and interoperate together. It defines standards and policy guidelines for services and applications between regional e-infrastructures and integration of operations, including: storage and hosting of content delivery; services/applications for research and educational community; network communication tools and resources; and virtual learning environments. With interoperability, research communities in computationally intensive scientific areas such as genomics, climate change, and medical diagnostics can easily interact and take advantage of the distributed computing resources: scientific applications and tools, data repositories, CPUs and storage disks.
Science gateways [
Arabia has been developing its e-Infrastructure that is made up of networks and services through close coordination with their European counterparts. Its ultimate objective is to build a more cohesive platform for robust scientific collaboration. Interoperability and coordination of these e-Infrastructures to inter link continentally and to create larger computational capacities has been identified as a critical mission in the development of research and education and for the socio-economical welfare of the region. A number of Arabian countries interconnect with regional e-Infrastructures either using the commodity Internet or have links to Europe with limited capacity [
This paper contributes to the development of tools to harmonize ubiquitous interoperation between regional e-infrastructures and help research communities to benefit from these infrastructures. The results of this paper build on the collaboration models that have been established by many projects and initiatives between the different e-infrastructures in Europe, Latin America, China, India, Africa, and Arabia in the framework of the EU FP7 funded projects [
Arabia is highly regarded as a region of emerging economy and is host to many western universities and businesses, which require substantial linkage and networking with their counterparts elsewhere. It comprises 22 countries with about 350 million inhabitants and over 10 thousand institutions, networking of which has become critical to take advantage of today’s research and education advanced resources and be able to integrate with research communities at the global level to address issues of common concern. The development of Arabia regional network and its global interoperation through variety of European funded projects, including [
Regional e-Infrastructures exist to connect national research and education high-speed communication networks. The European GÉANT, US Internet2, Canadian CANARIE, Asia-Pacific APAN, and Latin American CLARA are examples of regional networks. So far, the coordination between the regional e-Infrastructure efforts has been restricted to basic operational, organizational and technology know-how and exchanges. The upsurge of other paradigms, such as virtualization and cloud computing, represent additional trends in the light of a global e-Infrastructure landscape. In Arabia, the evolution of e-Infrastructures, foreseen cross border connectivity, established international linkages and capacities, and created Arabian Global Exchange points in London and Fujairah are seen as major developments. These are enhanced with federated identity and science gateway platforms that are necessary to provide key services necessary to sustain Arabia e-Infrastructure. The interoperable e-Infrastructure allows young scientists to seamlessly use varieties of applications, services and repositories in a large-scale ubiquitous computing environment.
In Europe, GÉANT connects 3900 institutions in more than 40 countries and provide education, science, and research services to more than 30 million students, teachers and researchers. The Asia-Pacific Advanced Network APAN connects several countries and support interlinks and services to many scientists. In Latin America, RedCLARA has played an important role in developing the interconnection between 16 Latin American countries’ education and research networks. UbuntuNet provides coordination for network infrastructure in the Southeastern region of Africa.
Interoperations between regional e-Infrastructures allow global access to shared resources, computing services, and data repositories. CHAIN-REDS has enabled five regional operation centers worldwide to interoperate with the European Grid Infrastructure (EGI). The objective is to support the use of advanced technologies and resources and to interface with the EGI. It has provided links to data repositories, open access document repositories, and open access education repositories [
A number of research communities have intensively used these services and resources and demonstrated potential use in variety of domains. Examples of these communities are:
• The African Population and Health Research Center
• The Latin America Giant Observatory
• Genome and protein structure prediction using TreeThreader
• Bio-molecules and molecular dynamics simulation using GROMACS
• Quantum chemistry and physics of materials using ABINIT calculations based on density functional theory
Arabia is highly regarded as a region of emerging economy and is host to many western universities and businesses. The aim of the Arabia e-infrastructure is to provide faculties, researchers, and students with ubiquitous and reliable services for networking and computing as well as the open access to e-Science environments and European data resources. As the access to data-driven and computer-intensive resources and services is the basis of innovation improvement and the advancement of knowledge in the academic communities, Arabia e-infrastructure targets:
• Connecting education networks to provide scalable inter-domain services, grid and cloud infrastructures. Having such a virtual access to the services and data lacked in the Arabian institutions will enable researches to join the modern research fields like bioinformatics for example.
• Creating the needed protocols to identify priorities and to provide researchers in Arabian academic institutions with the needed processing at supercomputing facilities upon request.
• Create an open and a trusted infrastructure to hold the highly accessed scientific information to engage research data sharing and to conduct joint research activities.
• Create mechanisms, strategies, and business models for inter-commercial services to optimize and sustain related investments.
• Create a shared cross-countries multi-disciplinary innovation environment.
• Create data strategies, standards, certification schemes, and ontologies to manage data usage and to harmonize the procedures regulating the delivery of virtual research services. Accordingly, partnership of research organizations with industry can be reinforced.
• Support trans-national software implementations and trainings.
• Propose flexible trans-national business models to ensure financial sustainability.
• Develop and distribute the standards codes and know-hows among the participating research communities.
• Improve trans-national competitiveness and productivity between participating institutes and companies.
• Promote a change of culture of research communities towards open data.
• Arabia e-infrastructure is able to assemble mass of people, knowledge and investment, which contributes to national and regional economic development.
Identity federation is based on facilitating user identities across several domains through single sign-on [
The Research and Education Identity Federations—REFEDS—represents stakeholders from NRENs, industry, business, and research and education communities [
eduGAIN is a service available to enable seamless exchange of information related to identity, authentication and authorization between the GÉANT Partners' identity federations [
Science gateways are becoming popular tools used by research communities to interact with e-Infrastructures and to access shared data, software, educational resources, and computational services [
The generic architecture of Catania science gateways is presented in
Interoperability is facilitated through science gateways. It allows diverse systems and organizations to work and interoperate together. Standards and policy guidelines are defined for services and applications between regional e-Infrastructures and integration of operations, including: Storage and hosting of content delivery; Services/applications for research and educational community; Network communication tools and resources; Grid computing and coordination; and Virtual learning environments.
With interoperability, research communities in computationally intensive areas such as genomics, climate change, and medical diagnostics can easily interact and take advantage of distributed computing resources: scientific applications and tools, data repositories, CPUs and storage disks.
High-level interfaces are provided to users for quick access to distributed computing and storage resources. It provides a set of well-defined and domain specific applications for a smooth access, while preserving security imposed by the distributed e-Infrastructure and the topology of the sensible information managed. Several web and Grid technologies have been adopted and deployed for different VRCs to ensure compliance of authentication and authorization requirements. The highest component in the authorization/authentication hierarchy is integrated in the Science Gateway and supports a Single Sign On mechanism across all services a given user is entitled to use. SAML is used for the credentials communication, on the basis of Shibboleth System [
When a user signs into the science gateway and is authenticated and authorized to run an application, a proxy certificate is issued to secure Grid transactions. Robot certificates are used and managed on the multi-threaded eToken server, stored in different USB eToken PRO 32/64 KB smart cards [
A number of use cases for e-Infrastructure services in different regions, covering different sets of user requirements, have been identified, mainly: Molecular dynamics—GROMACS, Materials science—ABINIT, Astronomy—LAGO, Population and health—APHRC, and Proteomics—TreeThreader [
We give here insights on the preliminary development of the Pan-Arab e-Infrastructure through identity federation and science gateway. Deployment is taking place at sites in Jordan, Egypt, Algeria and Morocco.
architectural layout of the deployment. The science gateway server installs Liferay with federations made through idp.asrenorg.net and authorization through ldap.asrenorg.net. Users login to sgw.asrenorg.net, and when authenticated and authorized for use, ASREN eToken server issues the proxy certificate needed for Grid transactions. The core of the eToken server is a “lightweight” grid crypto. It holds the web services to access the smart cards and interacts with the automatic proxy renewal server. A Java multi-platform client is configured for inter-service communication via HTTPS. The eToken server is built on top of the Apache Tomcat Application Server and is configured to accept authorized requests.
The ASREN identity federation is based on the concept of setting up a common framework for the Arab education and research institutions to manage access to online resources, services and repositories. Researchers, users, scientists, and students use their own credentials at their home institutions (identity providers) for access. The federation framework facilitates authentication and authorization using the Shibboleth implementation of the SAML standard, to allow interoperation. It serves as the basis of the pan-Arab e-Infrastructure development. In this framework, we propose federations at the national level aggregating local Identity providers and service providers within universities and research institutions. The Arabian countries will establish the federations. In the meantime, when there is no national federation initiative, ASREN federation plays the role of an aggregate federation for all.
The ASREN Science Gateway builds on the EUMEDGRID Science Gateway, which has been implemented using the Catania Science Gateway Framework, to give users access to the distributed computing environment. Its basic elements are developed using standard portlets in the common Liferay portal framework supported by web 2.0 interfaces that easily integrate with many technologies. Users access the portal with specific roles and privileges and are allowed to run applications embedded in the Science Gateway. Applications are interfaced to the underlying e-infrastructure via a set of independent middleware services and are accessible through decoupled authentication and authorization processes. The Identity Federation based on SAML 2.0 provides the authentication and authorization is governed by a set of agreements. Several applications have been made available to scientists with seamless access. These include ASTRA, BES, ClustalW, CMSquares, GATE, Octave, Phylogenetics, and Sonification.
Testing has been demonstrated using the EMI-gLite-based e-Infrastructure layout at INFN Catania. A Java multi-platform client has been developed and configured for inter-service communication via HTTPS. In order to improve performance, the server is built on top of the Apache Tomcat Application Server and configured to accept requests only from a set of authorized “clients” (i.e., the Science Gateways). The grid proxies generated by the server on request and are accessible by a Representational State Transfer (REST) API. The adoption of Apache Tomcat as an Application Server ensures scalability and high performance, with a cache mechanism implemented at the eToken Server.
The original pilot test-bed, built in the first phase of EUMEDGRID, has smoothly evolved into a production service counting 38 sites, for a total of about 4000 CPU cores and 600 TB of disk storage. Twenty applications from five scientific domains have been deployed on the EUMEDGRID infrastructure to be integrated in ASREN science gateway. The results of the application test show that about 35,000 jobs are running on average each month for a total of about 17,500 CPU wall clock hours.
The ultimate goal is to implement, manage and extend sustainable Pan-Arab e-Infrastructures dedicated for research and education communities. The e-Infrastructure provides vital resources for the deployment of services that are authenticated for access to a large group of Arab scientists in more than 1000 institutions. Several efforts can be made to stimulate interest and increase usage of e-Infrastructures across the region. These include: 1) region-wide awareness campaign on computing resources, services, and applications that are available to scientific communities; 2) top-down approach to support the decision making process towards the integration in the global research and education networks for sharing experiences, learning from best practice models, and enhance collaboration with other regional communities; 3) stimulate government spending on research with funding focused on projects that are computationally intensive to address problems and issues of regional importance.
This paper presents an interoperable platform using identity federation and science gateway models for developing a pan-Arab e-Infrastructure to provide seamless access to e-Science resources, applications, and services. An architectural layout is given with details on the initial implementation setup. Twenty applications from five scientific domains have been deployed on the EUMEDGRID infrastructure and have been integrated in the ASREN Science Gateway. The results of the application test show that about 35,000 jobs are running on average each month for a total of about 17,500 CPU wall clock hours. Therefore, seamlessly integrated e-Infrastructures for e-Science activities in the region are becoming critical resources for e-Science activities. A pan-Arab regional e-Infrastructure has evolved building on the results of the framework of the EC funded CHAIN-REDS project which was concluded in 2015. A new phase of development of ASREN Science Gateway and Identity Federation is planned so that it scales to support scientists, students, and faculty across the region with resources and services.
The authors would like to acknowledge the financial support of the European Commission in context of EUMEDGRID, EUMEDCONNECT, and CHAIN-REDs projects. Special thanks are to Yousef Torman, Ola Samara, Ashraf Alhuseini, Ramez Qunaibi, Mohamad Alshami, and Dr. Ahmad Bargash for their valuable contributions.
The authors declare no conflicts of interest regarding the publication of this paper.
Al-Agtash, S. and Barbera, R. (2019) Interoperable e-Infrastructure Services in Arabia. Journal of Computer and Communications, 7, 29-41. https://doi.org/10.4236/jcc.2019.75003