HPC Challenge Presentations
           
Event Type Start Time End Time Rm # Chair  

 

HPC Challenge 10:30AM 12:00PM 40-41 Christine E. Cuicchi (NAVOCEANO/DoD MSRC) and Radha Nandkumar (NCSA)
 
Title:

HPC Challenge Presentations
  Speakers/Presenter:
Christine E. Cuicchi (NAVOCEANO/DoD MSRC) and Radha Nandkumar (NCSA)

 

HPC Challenge 10:30AM 10:45AM 40-41 Christine E. Cuicchi (NAVOCEANO/DoD MSRC) and Radha Nandkumar (NCSA)
 
Title:

Global Data-Intensive Grid Collaboration
  Speakers/Presenter:
Kim Branson (Walter and Eliza Hall Institure for Medical Research), Rajkumar Buyya (University of Melbourne), Susumu Date (Osaka University), Benjamin Khoo (IBM Global Services), Baden Hughes (University of Melbourne), Rafael Moreno-Vozmediano (Complutense University of Madrid), Jon Smillie (Australian National University), Srikumar Venugopal (University of Melbourne), Jia Yu (University of Melbourne), Lyle Winton (University of Melbourne)

 

HPC Challenge 10:45AM 11:00AM 40-41 Christine E. Cuicchi (NAVOCEANO/DoD MSRC) and Radha Nandkumar (NCSA)
 
Title:

Using Virtual Joins in DataSpace to Mine and Visualize Distributed Data
  Speakers/Presenter:
Robert L Grossman, University of Illinois at Chicago and Open Data), Dave Hanley (University of Illinois at Chicago), Steve Eick (University of Illinois at Chicago and SSS Software), Yunhong Gu (University of Illinois at Chicago), Swapna Guddanti (University of Illinois at Chicago), Xinwei Hong (University of Illinois at Chicago), Pavan Kasturi (University of Illinois at Chicago), Parasarathy Krishnamurthy (University of Illinois at Chicago), Joel Mambretti (Northwestern University), Rajeet Nair (University of Illinois at Chicago), Alex Szalay (John Hopkins), Steve Wallace (Indiana University), Michael Welge (University of Illinois at Urbana Champaign)

 

HPC Challenge 11:00AM 11:15AM 40-41 Christine E. Cuicchi (NAVOCEANO/DoD MSRC) and Radha Nandkumar (NCSA)
 
Title:

Global Analysis of Arthropod Evolution
  Speakers/Presenter:
Rainer Keller (HLRS (University of Stuttgart), Craig Stewart (UITS, Indiana University), John Colbourne (Center for Genomics and Bioinformatics, Indiana University), Matthias Hess (HLRS, U. Stuttgart), David Hart (UITS, Indiana U.), Jennifer Steinbachs (Center for Genomics and Bioinformatics, Indiana University), Uwe Woessner (HLRS, U. Stuttgart), Donald Berry (UITS, Indiana U.), Richard Repasky (UITS, Indiana U.), Matthias Mueller (HLRS, U. Stuttgart), Huian Li (UITS, Indiana U.), Gary W. Stuart (Center for), Michael Resch (HLRS, U. Stuttgart), Huian Li (UITS, Indiana U.), Eric Wernert (UITS, Indiana U.), Markus Buchhorn (Australia National University), Hiroshi Takemiya (National Institute of Advanced Industrial Science and Technology, Japan), Rim Belhaj (ISET'Com, Tunesia), Wolfgang E. Nagel (Center for High Performance Computing (ZHR), Technical University of Dresden), Sergui Sanielevici (Pittsburgh Supercomputing Center), Sergio takeo Kofuji, LCCA/CCE-USP), David Bannon (Victorian Partnership for Advanced Computing, Australia), Norihiro Nakajima (Japan Atomic Energy Research Institute), Rosa Badia (CEPBA-IBM Research Institute), Mark A. Miller (San Diego Supercomputer Center), Hyungwoo Park (Korea Institute of Science and Technology Information), Rick Stevens (Argonne National Laboratory), Fang-Pang Lin (National Center for High Performance Computing), John Brooke (Manchester Computing), David Moffett (Purdue University), Tan Tin Wee (National University of Singapore), Greg Newby (Arctic Region Supercomputer Center), J.C.T. Poole (CACR, Cal-Tech), Ramched Hamza (Sup'com, Tunesia), Mary Papakhian (UITS, Indiana U.), Leigh Grundhoefer (UITS, Indiana U.), Peter Cherbas (Center for Genomics and Bioinformatics, Indiana U.), John Trueman (Australia National University)

 

HPC Challenge 11:15AM 11:30AM 40-41 Christine E. Cuicchi (NAVOCEANO/DoD MSRC) and Radha Nandkumar (NCSA)
 
Title:

Real-time Collaborative Knowledge Discovery Services on the Grid
  Speakers/Presenter:
Moustafa Ghanem (Discovery Net, Department of Computing, Imperial College London), Yike Guo (Discovery Net, Dept of Computing, Imperial College London), John Hassard (Discovery Net, Dept of Physics, Imperial College London), Jian Guo Liu (Discovery Net, Dept of Earh Sciences, Imperial College London), Moustafa Ghanem (Discovery Net, Dept of Computing, Imperial College London), Patrick Wendel (Discovery Net, Dept of Computing, Imperial College London), Filippia Sofia Emmanouil (Discovery Net, Dept of Computing, Imperial College London), Vasa Curcin (Discovery Net, Dept of Computing, Imperial College London), Jameel Syed (Discovery Net, Dept of Computing, Imperial College London), Anthony Rowe (Discovery Net, Dept of Computing, Imperial College London), Michelle Osmond (Discovery Net, Dept of Computing, Imperial College London), Mark Richards (Discovery Net, Dept of Physics, Imperial College London), Matthew Howard (Deltadot Ltd.), Martin Kohler (InforSense Ltd.), Dimitirios Sideris (Discovery Net, Dept of Physics, Imperial College London), Stuart Hassard (Deltadot Ltd)

 

HPC Challenge 11:30AM 11:45AM 40-41 Christine E. Cuicchi (NAVOCEANO/DoD MSRC) and Radha Nandkumar (NCSA)
 
Title:

Transcontinental RealityGrids for Interactive Collaborative Exploration of Parameter Space (TRICEPS)
  Speakers/Presenter:
Stephen M Pickles (University of Manchester), Peter V Coveney (University College London), Bruce M Boghosian (Tufts University)

 

HPC Challenge 11:45AM 12:00PM 40-41 Christine E. Cuicchi (NAVOCEANO/DoD MSRC) and Radha Nandkumar (NCSA)
 
Title:

Astronomical Object Survey on Grid Datafarm
  Speakers/Presenter:
Naotaka YAMAMOTO (GTRC,AIST), Osamu TATEBE (GTRC,AIST), Satoshi SEKIGUCHI (GTRC,AIST)
             

 

     
  Session: HPC Challenge Presentations
  Title: HPC Challenge Presentations
  Chair: Christine E. Cuicchi (NAVOCEANO/DoD MSRC) and Radha Nandkumar (NCSA)
  Time: Wednesday, November 19, 10:30AM - 12:00PM
  Rm #: 40-41
  Speaker(s)/Author(s):  
  Christine E. Cuicchi (NAVOCEANO/DoD MSRC) and Radha Nandkumar (NCSA)
   
  Description:
  The HPC Challenge honors participants for innovative uses of high performance computing resources. It provides opportunities for contestants to showcase applications and platforms. The categories are:

Most Innovative Data-Intensive Application: With the increasing ability to create, store, and re-access larger and larger datasets, one thing remains constant: the importance of mining such data to glean useful pieces of knowledge. The award will be presented to the entry that uses the most novel and/or inventive approaches in mining data, visualizing data, or a combination of these tasks.

Most Geographically Distributed Application: As the Grid continues to decrease the virtual distance between computers around the world, the ability to solve challenging computational problems with combinations of diverse system architectures is continuing to strengthen. The award will be presented to the team with the most geographically distributed application to solve a significantly complex problem.

This session will allow teams to present their projects to the SC03 audience and the HPC Challenge judges. The actual award will be made at the Awards plenary session on Thursday.
  Link: --
   

 

     
  Session: HPC Challenge Presentations
  Title: Global Data-Intensive Grid Collaboration
  Chair: Christine E. Cuicchi (NAVOCEANO/DoD MSRC) and Radha Nandkumar (NCSA)
  Time: Wednesday, November 19, 10:30AM - 10:45AM
  Rm #: 40-41
  Speaker(s)/Author(s):  
  Kim Branson (Walter and Eliza Hall Institure for Medical Research), Rajkumar Buyya (University of Melbourne), Susumu Date (Osaka University), Benjamin Khoo (IBM Global Services), Baden Hughes (University of Melbourne), Rafael Moreno-Vozmediano (Complutense University of Madrid), Jon Smillie (Australian National University), Srikumar Venugopal (University of Melbourne), Jia Yu (University of Melbourne), Lyle Winton (University of Melbourne)
   
  Description:
  We plan to demonstrate various applications from natural language processing (Melbourne University) and particle physics (Belle collaboration: KEK-Japan and School of Physics@Melbourne) to portfolio analysis (Complutense University of Madrid). For each application area, we developed a data catalogue directory and remote databases (e.g., protein data banks) access mechanism along with a brokering system (Gridbus Data-Grid scheduler).

Applications will make use of real-world distributed data-sets. Eg, high-energy physics demonstration will utilise datasets generated from the Belle experiment particle generator/detector based at the KEK Laboratory in Japan.
  Link: --
   

 

     
  Session: HPC Challenge Presentations
  Title: Using Virtual Joins in DataSpace to Mine and Visualize Distributed Data
  Chair: Christine E. Cuicchi (NAVOCEANO/DoD MSRC) and Radha Nandkumar (NCSA)
  Time: Wednesday, November 19, 10:45AM - 11:00AM
  Rm #: 40-41
  Speaker(s)/Author(s):  
  Robert L Grossman, University of Illinois at Chicago and Open Data), Dave Hanley (University of Illinois at Chicago), Steve Eick (University of Illinois at Chicago and SSS Software), Yunhong Gu (University of Illinois at Chicago), Swapna Guddanti (University of Illinois at Chicago), Xinwei Hong (University of Illinois at Chicago), Pavan Kasturi (University of Illinois at Chicago), Parasarathy Krishnamurthy (University of Illinois at Chicago), Joel Mambretti (Northwestern University), Rajeet Nair (University of Illinois at Chicago), Alex Szalay (John Hopkins), Steve Wallace (Indiana University), Michael Welge (University of Illinois at Urbana Champaign)
   
  Description:
  DataSpace is an example of a data web, a web services based framework for exploring, visualizing, analyzing, and mining remote and distributed data. DataSpace employs a specialized internet protocol called the DataSpace Transfer Protocol or DSTP for accessing data, metadata, and keys, and for common operations such as range queries, sampling etc.

We show that this mechanism is practical and provides a good foundation for distribued data mining and visualization by demonstrating DataSpace applications involving bioinformatics data, census data, and astronomical data. We also show that DSTP can be integrated easily easily into third party data analysis and data mining applications such as R and D2K.

Finally, we show that this approach can scale to very large remote and distributed data sets. We have integrated our open source DSTP Servers with specialized high performance protocols, such as SABUL, which can work effectively with data over networks with high bandwidth delay products.
  Link: --
   

 

     
  Session: HPC Challenge Presentations
  Title: Global Analysis of Arthropod Evolution
  Chair: Christine E. Cuicchi (NAVOCEANO/DoD MSRC) and Radha Nandkumar (NCSA)
  Time: Wednesday, November 19, 11:00AM - 11:15AM
  Rm #: 40-41
  Speaker(s)/Author(s):  
  Rainer Keller (HLRS (University of Stuttgart), Craig Stewart (UITS, Indiana University), John Colbourne (Center for Genomics and Bioinformatics, Indiana University), Matthias Hess (HLRS, U. Stuttgart), David Hart (UITS, Indiana U.), Jennifer Steinbachs (Center for Genomics and Bioinformatics, Indiana University), Uwe Woessner (HLRS, U. Stuttgart), Donald Berry (UITS, Indiana U.), Richard Repasky (UITS, Indiana U.), Matthias Mueller (HLRS, U. Stuttgart), Huian Li (UITS, Indiana U.), Gary W. Stuart (Center for), Michael Resch (HLRS, U. Stuttgart), Huian Li (UITS, Indiana U.), Eric Wernert (UITS, Indiana U.), Markus Buchhorn (Australia National University), Hiroshi Takemiya (National Institute of Advanced Industrial Science and Technology, Japan), Rim Belhaj (ISET'Com, Tunesia), Wolfgang E. Nagel (Center for High Performance Computing (ZHR), Technical University of Dresden), Sergui Sanielevici (Pittsburgh Supercomputing Center), Sergio takeo Kofuji, LCCA/CCE-USP), David Bannon (Victorian Partnership for Advanced Computing, Australia), Norihiro Nakajima (Japan Atomic Energy Research Institute), Rosa Badia (CEPBA-IBM Research Institute), Mark A. Miller (San Diego Supercomputer Center), Hyungwoo Park (Korea Institute of Science and Technology Information), Rick Stevens (Argonne National Laboratory), Fang-Pang Lin (National Center for High Performance Computing), John Brooke (Manchester Computing), David Moffett (Purdue University), Tan Tin Wee (National University of Singapore), Greg Newby (Arctic Region Supercomputer Center), J.C.T. Poole (CACR, Cal-Tech), Ramched Hamza (Sup'com, Tunesia), Mary Papakhian (UITS, Indiana U.), Leigh Grundhoefer (UITS, Indiana U.), Peter Cherbas (Center for Genomics and Bioinformatics, Indiana U.), John Trueman (Australia National University)
   
  Description:
  fastDNAml is a well-established parallel program for inference of phylogenetic relationships based on DNA sequences. PACX is a tool for linking and geographically distributing MPI codes (http://www.hlrs.de/organization/pds/projects/pacx-mpi/) that permits use of distributed MPI applications across heterogeneous systems with the use of multiple, vendor- or system-specific optimized MPI libraries. We will use with Dimemas to predict the performance of the application implemented across the grid created for this project, and will attempt to match the pattern of work distribution to variations in communication speeds across the network to optimize the overall effectiveness of the calculations.

Our approach will be to link many supercomputers and workstations across as many continents as possible, using PACX to link fastDNAml so as to solve problems that would otherwise be too large to solve in a reasonable amount of time. Our plan, then, is to simultaneously demonstrate a global-scale distributed computing application and at the same time perform meaningful analysis of an important biological problem.
  Link: --
   

 

     
  Session: HPC Challenge Presentations
  Title: Real-time Collaborative Knowledge Discovery Services on the Grid
  Chair: Christine E. Cuicchi (NAVOCEANO/DoD MSRC) and Radha Nandkumar (NCSA)
  Time: Wednesday, November 19, 11:15AM - 11:30AM
  Rm #: 40-41
  Speaker(s)/Author(s):  
  Moustafa Ghanem (Discovery Net, Department of Computing, Imperial College London), Yike Guo (Discovery Net, Dept of Computing, Imperial College London), John Hassard (Discovery Net, Dept of Physics, Imperial College London), Jian Guo Liu (Discovery Net, Dept of Earh Sciences, Imperial College London), Moustafa Ghanem (Discovery Net, Dept of Computing, Imperial College London), Patrick Wendel (Discovery Net, Dept of Computing, Imperial College London), Filippia Sofia Emmanouil (Discovery Net, Dept of Computing, Imperial College London), Vasa Curcin (Discovery Net, Dept of Computing, Imperial College London), Jameel Syed (Discovery Net, Dept of Computing, Imperial College London), Anthony Rowe (Discovery Net, Dept of Computing, Imperial College London), Michelle Osmond (Discovery Net, Dept of Computing, Imperial College London), Mark Richards (Discovery Net, Dept of Physics, Imperial College London), Matthew Howard (Deltadot Ltd.), Martin Kohler (InforSense Ltd.), Dimitirios Sideris (Discovery Net, Dept of Physics, Imperial College London), Stuart Hassard (Deltadot Ltd)
   
  Description:
  We present the world's first fully Grid-enabled Knowledge Discovery Platform based on:

a) Integrating both Globus (GT3) and Unicore services into the Discovery Net infrastructure,
b) Providing a higher level of workflow definition allowing end users to describe abstract knowledge discovery workflows.
c) Providing an efficient execution middleware that translates the end users'_ abstract workflows into high performance grid-enabled workflows.
d) Providing the infrastructure for distributed workflow warehousing, indexing, querying and retrieval.

The end result is an unprecedented collaborative and distributed knowledge discovery platform allowing users to dynamically connect to and integrate data from remote data sources, find and locate suitable data analysis and mining components, schedule the execution of distributed knowledge discovery workflows using the distributed components using standardised grid services, collect and interactively analyse the distributed results, publish the results of the analysis over the grid, and more importantly publish both the abstract and executable workflows that generated such results as grid services that can be modified and executed by their co-workers.
  Link: --
   

 

     
  Session: HPC Challenge Presentations
  Title: Transcontinental RealityGrids for Interactive Collaborative Exploration of Parameter Space (TRICEPS)
  Chair: Christine E. Cuicchi (NAVOCEANO/DoD MSRC) and Radha Nandkumar (NCSA)
  Time: Wednesday, November 19, 11:30AM - 11:45AM
  Rm #: 40-41
  Speaker(s)/Author(s):  
  Stephen M Pickles (University of Manchester), Peter V Coveney (University College London), Bruce M Boghosian (Tufts University)
   
  Description:
  In this collaborative project, many simulations will be spawned and steered by scientists geographically distributed across two continents. We rely on several unique features of RealityGrid that have not been demonstrated before on this scale. Using our malleable checkpoint/restart capabilities, we can checkpoint a simulation running on one system on the Grid, launch clones of this simulation on other systems of different architecture (potentially on a different number of processors), steer the spawned simulations into different regions of parameter space, then monitor and compare their subsequent divergent evolution through on-line visualization. Our integration of RealityGrid visualizations with Access Grid will make it possible for scientists (and audiences) at geographically distant sites to share visualizations of the underlying physical systems and discuss them even while the simulations are in progress. We also showcase the use of Grid services for computational steering, based on our novel implementation of the Open Grid Services Infrastructure (OGSI::Lite).
  Link: --
   

 

     
  Session: HPC Challenge Presentations
  Title: Astronomical Object Survey on Grid Datafarm
  Chair: Christine E. Cuicchi (NAVOCEANO/DoD MSRC) and Radha Nandkumar (NCSA)
  Time: Wednesday, November 19, 11:45AM - 12:00PM
  Rm #: 40-41
  Speaker(s)/Author(s):  
  Naotaka YAMAMOTO (GTRC,AIST), Osamu TATEBE (GTRC,AIST), Satoshi SEKIGUCHI (GTRC,AIST)
   
  Description:
  We will construct world-wide data analysis environment for the astronomical field using the Gfarm Data Grid middleware. The Gfarm provides world-wide large-scale virtual file system with replica management called the Gfarm file system, and distributed and parallel data analysis environment. Ten terabyte-scale public archival data taken by the SUBARU telescope will be stored in the file system, which will be calibrated and analyzed in parallel to find new solar system objects using a set of FITS file access libraries (WCSTools), data analysis tools such as data reduction (NEKO soft), and a source extractor tool (SExtractor).
  Link: --