 
    PEARC25
- Columbus, OH
Join a Globus session at PEARC25, stop by the Globus booth #6. Have a question? Join Tuesday’s Globus Office Hours.
Date: July 21, 2025, 9:00 am-12:30 pm ET
Speakers: Vas Vasiliadis, Rachana Ananthakrishnan
Location: Room A215
From First Byte to Publication: Instrument Science Enabled by Globus
The proliferation of instruments, e.g., cryogenic electron microscopes and nanopore sequencers, is driving the need for automated solutions to manage generated data throughout its lifecycle—especially as resolutions and datasets, continue to grow. The Globus platform is used in diverse scenarios to build solutions that increase instrument throughput and researcher productivity, and ensure these expensive devices remain highly utilized.
This tutorial focuses on integrating multiple services in the Globus platform to build a scalable solution for automating instrument data management and computation, from the time the first byte of data is captured, to distribution and publication of final data and findings. Building on the success of our tutorial “Scaling Instrument Science in the FAIR Age” at PEARC24, we will present an overview of the relevant services and engage participants in a series of hands-on exercises to create and run automated flows that process data coming from an instrument.
The material is geared primarily towards research data and computing professionals, including system administrators and research software engineers. Participants will develop an understanding of the various solution components and leave with a fully-working, small scale system— including an actual instrument(!)—that can serve as a starting point for their own development efforts. The tutorial is intended for research computing and data (RCD) professionals. The material is mostly at the intermediate and advanced levels, suitable for those with an understanding of data- and compute-intensive research tasks, with some exposure to development of tools and applications to support research.
Date: July 21, 2025, 1:30-5:00 pm ET
Speaker: Kyle Chard
Location: Room A215
Globus Compute: Federated Function as a Service for the Computing Continuum
Growing data volumes, new computing paradigms, and increasing hardware heterogeneity are driving the need to execute computational tasks across a continuum of distributed computing resources. Such needs are motivated by the desire to compute closer to data acquisition sources, exploit specialized computing resources (e.g., hardware accelerators), provide real-time processing of data, reduce energy consumption (e.g., by matching workload with hardware), and scale simulations beyond the limits of a single computer. Globus Compute addresses these needs by delivering a hybrid cloud platform implementing the Function-as-a-Service (Faas) paradigm. Researchers first register their desired function with the cloud-hosted Globus Compute service, they can then request invocation of that function with arbitrary input arguments to be executed on remote cyberinfrastructure. Globus Compute manages the reliable and secure execution of the function, provisioning resources, staging function code and inputs, managing safe and secure execution (optionally using containers), monitoring execution, and asynchronously returning results to users via the cloud platform. Functions are executed by the Globus Compute endpoint software—an agent that may be installed by administrators and offered to user communities or installed by users anywhere they have access. The endpoint effectively turns any existing resource (e.g., laptop, cloud, cluster, supercomputer, or container orchestration cluster) into a FaaS endpoint. Over the last three years, Globus Compute has been used by thousands of researchers around the world to execute more than 50M tasks across more than 15,000 distributed computing endpoints.
This tutorial builds upon the success of a similar tutorial hosted at PEARC 2024. That tutorial was attended by 20-25 people and was the first tutorial to close registration due to all slots being filled. The tutorial will discuss opportunities for FaaS in research computing, approaches for portable execution across endpoints, and the benefits of this approach (e.g., performance, energy efficiency). Further, it will directly relate to modern approaches in CI, for example enabling fine-grained and portable allocations in NSF ACCESS and as a common interface for remote computing in DOE’s integrated research infrastructure. The tutorial will extend existing tutorial materials that have been delivered at many international venues.
Date: July 22, 2025, 11:00 am - 3:30 pm
Location: Room A222
Globus Office Hours
An increasingly large segment of the PEARC community is using the Globus platform for research data management and remote computation. This meeting is an opportunity for current and prospective users to engage with the Globus team and get answers to  questions related to their specific use cases and Globus implementation on campus. We will be available to answer ad hoc questions and do “deep dives” into specific services to help users better understand best practices and what is possible.
Date: July 22, 2025, 4:15 pm - 5:15 pm
Location: A211-A212
Birds of a Feather
Authors: Daniel Madren and Geoffrey Lentner, Purdue
Navigating Open-Source Software Commercialization: From Infancy to Maturity Open-source software (OSS) has become the foundation of modern research computing and scientific collaboration. However, transitioning an OSS project from a single-maintainer project to long-term sustainability remains a challenge for developers and institutions. This Birds of a Feather (BOF) will bring together thought leaders from across the research computing landscape to discuss sustainability and commercialization in OSS. Our panel will feature experts from various stages of this journey: HyperShell (Purdue), a newly emerging project in its infancy; Open OnDemand (OSU), a portal solution navigating funding and sustainability models; Globus (UChicago), a highly successful model of OSS sustainability; and Spack (LLNL), the HPC package manager joining a foundation. We will also highlight the role of organizations like the Software Sustainability Institute (SSI), NumFOCUS, and others in providing structured onramps for many OSS projects, helping to navigate governance, funding, and long-term sustainability.
Representatives:
Globus - Rachana Ananthakrishnan, Executive Director & Head of Products, Globus, University of Chicago
Open OnDemand - Alan Chalker, OSC, Director of Strategic Programs, Open OnDemand PI
Spack - Todd Gamblin, LLNL, Distinguished Member of Technical Staff, Creator of Spack
Software Sustainability Institute - Neil Chue Hong, University of Edinburgh, Founder of SSI
Poster Sessions:
July 23, 2025 - 5:30 pm -7:30pm ET
Poster#17: SHARED: Secure Hub for Access, Reliability, and Exchange of Data
Authors: Hakizumwami Birali Runesha, Torsten Reimer and Gustavo Garcia Rondina
The University of Chicago’s Secure Hub for Access, Reliability, and Exchange of Data, (SHARED), serves as a resource for data-driven research and a platform for integrated data management. Funded by the NSF, SHARED provides federated data storage for projects across disciplines, promoting collaboration and best practices in data management. Beyond managing ‘active’ data, SHARED integrates with UChicago’s institutional repository to support data sharing in line with FAIR principles. Twenty percent of the storage capacity will be accessible through the NSF PATh Open Science Data Federation, Globus, and Rucio, extending SHARED’s storage reach across research institutions. Data will also be available to U.S. researchers via the OSG’s OSPool. The initiative is led by the University Library and Research Computing Center, in collaboration with several university departments.  |  SHARED aligns with data lifecycle strategy, ensuring data access, analysis, publication, distribution, and long-term archiving. It supports diverse scientific applications, from cosmological studies to linguistic research, with a Ceph-based storage platform. Projects include dark matter searches, cosmic reionization, and cognitive process studies. SHARED fosters interdisciplinary research and offers educational opportunities through collaborations with other institutions and K-12 student engagement. This poster will showcase SHARED’s approach to integrating active storage with long-term data sharing and preservation.
Poster#19: The Nebraska Research Data Storage: NRDStor
Authors:  Derek Weitzel and Caughlin Bohn
The Nebraska Research Data Storage (NRDStor) service, provided by the University of Nebraska, aims to simplify and accelerate researchers’ access to computing resources, including campus computing clusters. Researchers can access NRDStor through two convenient methods: directly through the file system mounted on compute worker nodes or utilizing the SMB protocol from any campus-networked computer. Furthermore, NRDStor enhances the ease of sharing research data and results through integration with the Open Science Data Federation (OSDF) and Globus.