Data Management in Grids
           
Event Type Start Time End Time Rm # Chair  

 

Paper 10:30AM 11:00AM 38-39 Gregor von Laszewski (Argonne National Laboratory)
 
Title:

A Metadata Catalog Service for Data Intensive Applications
  Speakers/Presenter:
Gurmeet Singh (Information Sciences Institute, University of Southern California), Shishir Bharathi (Information Sciences Institute, University of Southern California), Ann Chervenak (Information Sciences Institute, University of Southern California), Ewa Deelman (Information Sciences Institute, University of Southern California), Carl Kesselman (Information Sciences Institute, University of Southern California), Mary Manohar (Information Sciences Institute, University of Southern California), Sonal Patil (Information Sciences Institute, University of Southern California), Laura Pearlman (Information Sciences Institute, University of Southern California)

 

Paper 11:00AM 11:30AM 38-39 Gregor von Laszewski (Argonne National Laboratory)
 
Title:

Grid-Based Galaxy Morphology Analysis for the National Virtual Observatory
  Speakers/Presenter:
Ewa Deelman (ISI), Raymond Plante (NCSA), Carl Kesselman (USC/ISI), Gurmeet Singh (USC/ISI), Mei Su (USC/ISI), Gretchen Greene (Space Telescope Science Institute), Robert Hanisch (Space Telescope Science Institute), Niall Gaffney (Space Telescope Science Institute), Antonio Volpicelli (Space Telescope Science Institute), James Annis (Fermi Lab), Vijay Sekhri (Fermi Lab), Tamas Budavari (John Hopkins University), Maria Nieto-Santisteban (John Hopkins University), William O'Mullane (John Hopkins University), David Bohlender (Canadian Astrophysical Data Center), Tom McGlynn (NASA), Arnold Rots (Simthsonian Astrophysical Observatory), Olga Pevunova (NASA)

 

Paper 11:30AM 12:00PM 38-39 Gregor von Laszewski (Argonne National Laboratory)
 
Title:

The Livny and Plank-Beck Problems: Studies in Data Movement on the Computational Grid
  Speakers/Presenter:
Matthew S. Allen (University of California, Santa Barbara), Rich Wolski (University of California, Santa Barbara)
             

 

     
  Session: Data Management in Grids
  Title: A Metadata Catalog Service for Data Intensive Applications
  Chair: Gregor von Laszewski (Argonne National Laboratory)
  Time: Thursday, November 20, 10:30AM - 11:00AM
  Rm #: 38-39
  Speaker(s)/Author(s):  
  Gurmeet Singh (Information Sciences Institute, University of Southern California), Shishir Bharathi (Information Sciences Institute, University of Southern California), Ann Chervenak (Information Sciences Institute, University of Southern California), Ewa Deelman (Information Sciences Institute, University of Southern California), Carl Kesselman (Information Sciences Institute, University of Southern California), Mary Manohar (Information Sciences Institute, University of Southern California), Sonal Patil (Information Sciences Institute, University of Southern California), Laura Pearlman (Information Sciences Institute, University of Southern California)
   
  Description:
  Today's advances in computational, storage and network technologies as well as middleware such as the Globus Toolkit, allow scientists to expand their scientific horizons and develop sophisticated data intensive application. These applications produce and analyze terabytes and petabytes of data that are distributed in millions of files or objects. In order to efficiently manage the large data sets, the metadata about the data needs to be managed. There are various types of metadata, and it is likely that a range of metadata services will exist in grid environments that are specialized for different types of metadata cataloguing and discovery. In this paper, we present a design of a Metadata Catalog Service (MCS) that provides a mechanism for storing and accessing descriptive metadata and allows users to query for data items based on desired attributes. We describe our experience in using the MCS with several applications and present a scalability study of the service.
  Link: Download PDF
   

 

     
  Session: Data Management in Grids
  Title: Grid-Based Galaxy Morphology Analysis for the National Virtual Observatory
  Chair: Gregor von Laszewski (Argonne National Laboratory)
  Time: Thursday, November 20, 11:00AM - 11:30AM
  Rm #: 38-39
  Speaker(s)/Author(s):  
  Ewa Deelman (ISI), Raymond Plante (NCSA), Carl Kesselman (USC/ISI), Gurmeet Singh (USC/ISI), Mei Su (USC/ISI), Gretchen Greene (Space Telescope Science Institute), Robert Hanisch (Space Telescope Science Institute), Niall Gaffney (Space Telescope Science Institute), Antonio Volpicelli (Space Telescope Science Institute), James Annis (Fermi Lab), Vijay Sekhri (Fermi Lab), Tamas Budavari (John Hopkins University), Maria Nieto-Santisteban (John Hopkins University), William O'Mullane (John Hopkins University), David Bohlender (Canadian Astrophysical Data Center), Tom McGlynn (NASA), Arnold Rots (Simthsonian Astrophysical Observatory), Olga Pevunova (NASA)
   
  Description:
  As part of the development of the National Virtual Observatory (NVO), a data grid for astronomy, we have developed a prototype science application to explore the dynamical history of galaxy clusters by analyzing the galaxies' morphologies. The purpose of the prototype is to explore how grid-based technologies can be used to provide specialized computational services within the NVO environment. Although we describe the scientific goals of the application, this paper focuses on the key technology components, particularly Chimera and Pegasus which are used to create and manage the computational workflow. We illustrate how the components were connected and driven from the application's portal.
  Link: Download PDF
   

 

     
  Session: Data Management in Grids
  Title: The Livny and Plank-Beck Problems: Studies in Data Movement on the Computational Grid
  Chair: Gregor von Laszewski (Argonne National Laboratory)
  Time: Thursday, November 20, 11:30AM - 12:00PM
  Rm #: 38-39
  Speaker(s)/Author(s):  
  Matthew S. Allen (University of California, Santa Barbara), Rich Wolski (University of California, Santa Barbara)
   
  Description:
  Over the last few years the Grid Computing research community has become interested in developing data intensive applications for the Grid. These applications face significant challenges because their widely distributed nature makes it difficult to access data with reasonable speed. In order to address this problem, we feel that the Grid community needs to develop and explore data movement challenges that represent problems encountered in these applications. In this paper, we will identify two such problems that we have dubbed the Livny Problem and the Plank-Beck Problem. We will also present data movement scheduling techniques that we have developed to address these problems.
  Link: Download PDF