[communities] GGF Proposal Submission

ctjordan at sdsc.edu ctjordan at sdsc.edu
Wed Nov 30 17:55:05 CST 2005


proposers_name: Chris Jordan 
 
affiliation: San Diego Supercomputer Center 

email: ctjordan at sdsc.edu 

proposed_title: A Wide-Area Parallel Filesystem for the TeraGrid: A Case Study 

session_type: Presentation (two presenters) 

proposed_duration: 60 mins 

target_audience: Users/Engineers 

num_attendees: 50-75 

abstract: TeraGrid deployed the first production, grid-enabled high performance parallel file system using 500 TB of disk and IBM General Parallel File System (GPFS). Challenges such as latency, which were overcome in the development of the deployment model are explained, with notes on the strategies taken in overcoming these challenges. The use of existing Grid technologies to enable UID-mapping between administrative domains is described in detail, along with the relationship of this work to evolving Grid standards work. The experience of pilot projects such as NVO, ENZO, and BIRN, and the production workflows enabled by the global filesystem are related. 

synopsis: Session Goals:
After the session is completed, attendees should understand the need for wide-area, Grid-enabled filesystems, and the particular approach to the problem taken in this project. Attendees should understand the relationship of the GPFS-WAN work to the evolution of Grid standards and existing Grid technologies. Attendees who are users should understand how a Grid file system can enable new workflows, and attendees who are engineers should understand the specific technical challenges inherent in large Grid-based file system deployment.

Outline:

Introduction to TeraGrid
Motivation - TeraGrid user needs
Brief Introduction to GPFS
Previous work on Grid file systems at SDSC
Security Requirements
Performance Requirements
Reliability Requirements

Initial Hardware Resources and Configuration
Cluster Authentication and Authorization
User Identification and UID-mapping
GSI-based UID-mapping

Special Considerations for Grid File System design
Metadata and Management servers
Data Replication Options
Final Hardware Configuration

Network Performance Issues
OS-Specific and GPFS Network tuning
Local and Remote performance benchmarks
Network Reliability Issues

Pilot User Projects - BIRN and NVO
Overview of BIRN/lddmm characteristics
Overview of NVO/2MASS characteristics
ENZO use of GPFS-WAN
BIRN use of GPFS-WAN
2MASS use of GPFS-WAN
Multi-site task workflows

GPFS-WAN and Grid Standards
Access mechanisms/Interface issues
Policies for Grid File Systems
GFS WG/RNS
OGSA Data WG
Challenges for Grid File System Standards

Summary - Achievements and Goals
GPFS-WAN as a Production Grid Resource
User Adventures
Future Work
 

tech_requirements: None 

prereq_participants: Basic knowledge of parallel file system concepts and the Globus Toolkit security layer will be helpful. 

advertise_suggestion: The work described is relevant to the OGSA-Data, GFS, GSM, and ByteIO working groups, at least, and each of these WG mailing lists should be informed of the presentation, should it be accepted. TeraGrid news items and mailing lists could also be used to inform interested individuals of the presentation. 





More information about the communities mailing list