HomeAbout TANet2News & EventsProjectsNet OperationLint to UsSite Map
April 21, 2000
TANet2 Day Technical Issues and Applications of the Next Generation Internet

June 25, 1999
NSC-NSF Joint Workshop on NGI Applications

September 20, 1999
Summer Institute in Taiwan
 

Upcoming Meetings

April 21, 2000

TANet2 Day

Technical Issues and Applications of the Next Generation Internet

Computational and Data Grids:

Distributed High-Performance Computing and Large-Scale Data Management for Science and Engineering

William E. Johnston
Lawrence Berkeley National Laboratory and NASA Ames Research Center


We use the term "Grid" to refer to the services and infrastructure needed to support distributed, high performance computing and data handling systems that incorporate geographically and organizationally dispersed, heterogeneous resources that are persistent and supported.

Several science and engineering scenarios define the type of Grids that we are working on in NASA's Information Power Grid and DOE's Science Grid:

  • multi-disciplinary science and engineering design computational modeling
  • large-scale scientific data analysis
  • real-time data analysis, e.g. for on-line scientific instruments
  • coupling instruments/experiments and computational models
  • generation and management of large, complex data archives
These scenarios all require some sort of resource aggregation and/or advance scheduling of resources, and much of the Grid technology is targeted at these advanced capabilities. However, the biggest near-term impact of the Grid may be a combination of a service delivery model that promotes remote use of single resources, and an implementation that provide a uniform interface to many different resources.

The resulting Grid architecture includes:

  • Tools for constructing collaborative, application oriented Problem Solving Environments / Frameworks (the primary user interfaces for Grids)
  • Programming environments, tools, and services providing a variety of approaches for building applications that use aggregated computing and storage resources, and federated data sources
  • A comprehensive and consistent set of location independent tools and services for accessing and managing and providing uniform access to dynamic collections of widely distributed resources: heterogeneous computing systems, storage systems, real-time data sources and instruments, human collaborators, and communications systems
  • Operational infrastructure including management tools for distributed systems and distributed resources, user services, accounting and auditing, strong and location independent user authentication and authorization, and overall system security services

The vision for science computation and data Grids is that they will provide significant new capabilities to scientists and engineers by facilitating routine construction of information based problem solving environments / frameworks. Such Grids will knit together widely distributed computing, data, instrument, and human resources into just-in-time systems that can address complex and large-scale computing and data analysis problems.

This talk will discuss the motivation and architecture of science Grids, and implementation progress of NASA's IPG: A prototype production Grid.


Last updated: