Workshop Overview Videos#

Welcome to the workshop. This section provides a quick video-based introduction to the key topics and additional context around Surya.


Videos#


Welcome from NASA - Madhulika Guhathakurta#

What to expect: Introduction from Madhulika Guhathakurta to the Surya Science workshop, focused on integrating the Surya foundation model into heliophysics through rigorous testing and collaboration. This workshop is a shift away from typical product demonstrations toward a framework where AI is interrogated as a scientific object and its errors are analyzed for deeper physical insights. This introduction outlines a clear progression from understanding model foundations to validating results, emphasizing that success lies in generating sharper research questions rather than just achieving better technical metrics.

Welcome from NASA-IMPACT - Rahul Ramachandran#

What to expect: Rahul Ramachandran provides a strategic overview of NASA’s AI for Science initiative, framing the Surya model within the broader historical mission of distilling knowledge from massive space science datasets. This talk explores how foundation models represent a paradigm shift from labor-intensive, bespoke modeling to scalable, self-supervised systems that can be adapted for diverse scientific applications. The presentation uses the success of the Prithvi Earth science model as a blueprint, challenging participants to move beyond traditional classification tasks toward innovative explorations of the model’s underlying data representations.

Foundation models for science: Johannes Schmude | IBM#

Q&A#

What to expect: Johannes Schmude from IBM highlights the transformative potential of foundation models in science by drawing parallels to recent breakthroughs in atmospheric physics and weather forecasting. This includes a discussion on how these models serve as powerful new tools in the scientific quiver, offering massive computational speedups and data efficiency without replacing traditional physics-based methods.

The talk emphasizes that scientific foundation models are defined by their generalization across tasks and instruments, illustrated through examples like partial differential equation emulation and cross-modality satellite mapping. Schmude concludes that trust in these systems is built through a continuous, rigorous process of community validation and challenging the model’s limits rather than through a single, static explainability metric.

Surya 101: Sujit Roy | NASA-IMPACT#

Q&A#

What to expect: Sujit Roy from NASA-IMPACT, introduces Surya as a 366-million parameter foundation model trained on nearly 260 terabytes of high-resolution solar data. The video details an innovative architecture that uses spectral blocks and sliding window attention to process massive 4K images while remaining flexible enough to be fine-tuned on standard consumer hardware.

The talk highlights how Surya outperforms traditional models in certain cases, such as solar flare and wind forecasting by leveraging self-supervised temporal learning rather than relying on human-labeled datasets. The session concludes by framing the model not just as a labeling tool, but as a gateway for scientists to interrogate physical processes and test hypotheses directly within a latent embedding space.

What can Surya do for heliophysics? Andrés Muñoz-Jaramillo | SwRI#

What to expect: Andrés Muñoz-Jaramillo from Southwest Research Institute, introduces the potential applications of Surya in heliophysics. He transitions the workshop from high-level theory to the practical realities of building downstream applications, framing the process as a disciplined experimental exercise. He emphasizes that while Surya provides a massive repository of distilled knowledge from its time-advancement pretext task, success in fine-tuning requires researchers to establish clear baselines and value-added metrics. By using timestamps as a universal index, scientists can connect Surya’s internal representations to specific supervised tasks like flare forecasting or instrument translation, starting with small datasets to observe how the model scales.

The session also highlights the technical modularity of the architecture—including tokenizers, spectral blocks, and attention mechanisms—which users can manipulate to adapt the model to new domains. Crucially, the human researcher is often a risk factor in the AI loop, urging the community to maintain strict discipline with hold-out data to avoid self-deception. He concludes that the goal is not just to outperform existing models, but to use Surya as a catalyst for a more sophisticated, collaborative way of doing heliophysics.

Introduction to the SuryaBench dataset: Dinesha Vasanta Hegde | University of Alabama in Huntsville#

Q&A#

What to expect: Dinesha Vasanta Hegde from the University of Alabama in Huntsville introduces SuryaBench, the curated 360-terabyte dataset of native-resolution SDO data that serves as the essential fuel for the Surya foundation model. The talk details the rigorous preprocessing pipeline—including temporal synchronization, instrument degradation correction, and spatial homogenization—required to transform raw solar observations into a standardized, ML-ready resource spanning 14 years of activity. In addition to the core 13-channel dataset, the video highlights six auxiliary datasets designed for specific tasks like flare forecasting and active region segmentation, positioning SuryaBench as an open-source, high-fidelity pillar for the broader heliophysics community.

Downstream science demos#

Berkay Aydin - Solar flare forecasting#

Shah Bahauddin - Solar EUV irradiance modelling#

Vishal Upendran - Solar wind forecasting#

Downstream science Q&A#

HelioAI - what’s next?#

What to expect: Introduction of the Helio AI Portal, a strategic shift from “project-based” AI, where models often disappear after a single paper, to a permanent, scalable scientific infrastructure. This platform is designed to provide the community with analysis-ready data, documented evaluations, and essential scientific metadata, ensuring that models like Surya are not just isolated demonstrations but shared resources that can be stress-tested and reused by everyone.

What to expect: Walkthrough of HelioAI.org, a prototype web application designed to move heliophysics beyond “development in isolation” by centralizing the community’s rapidly expanding library of AI artifacts. Unlike a static website, the portal features a content management system that allows researchers to upload projects, document the performance of downstream use cases, and provide the essential metadata required for both human discovery and agentic AI applications.

What to expect: Exploration of the critical role of the SPASE (Space Physics Archive Search and Extract) metadata model, which has spent three decades standardizing how space science data is documented to ensure long-term reusability. The talk shares how this mature framework, already the de facto standard for NASA, ESA, and JAXA, is being extended to specifically describe AI/ML resources like the Surya model.

Panel Sessions#

Industry perspectives: Dan Crichton (JPL) & Massy Mascaro (Google)#

Agency perspectives: Lika Guhathakurta (NASA), Rahul Ramachandran (NASA-IMPACT), Robert Redmon (Director of NOAA Center for AI), Therese Moretto Jorgensen (NASA)#

Science perspectives: Raul Ramos Pollan, David Fouhey (NYU), Tzu-Wei Fang (NOAA)#