[GRIDCPR-WG] CPR Document - final call

Nathan Stone stone at psc.edu
Thu Mar 29 09:18:25 CDT 2007


	Indeed, it seems well enough "contained" that it could be added as a 
separate item.  It also would seem to make the Use Cases document more 
current (i.e. more relevant).

FWIW,
	Nathan.

Thilo Kielmann wrote:
> All,
> 
> I am not sure what to do with this comment. (And I am not sure if we can
> do this, formally, at this stage of the process.)
> 
> Anyway, just for doing something reasonable, I am in favour of adding this
> use case.
> 
> 
> Thilo
> 
> 
> On Thu, Mar 29, 2007 at 03:23:58PM +0200, Eduardo Huedo Cuesta wrote:
>> From: Eduardo Huedo Cuesta <ehuedo at fdi.ucm.es>
>> To: Andre Merzky <andre at merzky.net>
>> Cc: gridcpr-wg at ogf.org, SAGA RG <saga-rg at ogf.org>
>> Subject: Re: [GRIDCPR-WG] CPR Document - final call
>>
>> Dear All,
>>
>>  From the GridWay team, we would like to propose another consumer 
>> use-case that we think is within the scope of GridRPC. See below.
>>
>> Best Regards,
>>
>> Eduardo Huedo.
>>
>> --------------------------------------------------------------------
>>
>> GridWay Metascheduler
>> =================
>>
>> The GridWay Metascheduler [1, 2], now a Globus project, adapts job 
>> execution to changing grid conditions by providing fault recovery 
>> mechanisms, dynamic scheduling, migration on-request and opportunistic 
>> migration [3]. Migration is implemented by restarting the job on the new 
>> candidate host, therefore the job should generate restart files at 
>> regular intervals in order to continue execution from a given point. If 
>> checkpointing files are not provided, the job is restarted from the 
>> beginning. GridWay periodically retrieves to the client machine or a 
>> checkpoint server (GridFTP URL) the restart architecture-independent files.
>>
>> Jobs submitted with GridWay could benefit from GridCPR systems providing 
>> standard and uniform APIs and services for portable checkpoint 
>> generation and storage.
>>
>> Functional requirements
>> . API for application state writing and reading.
>> . Services for failure notification.
>> . Services for checkpoint data management.
>>
>> [1] GridWay Metascheduler. http://www.gridway.org/.
>> [2] E. Huedo, R.S. Montero and I.M. Llorente: A framework for adaptive 
>> execution on grids. Software - Practice and Experience 34 (7): 631-651, 
>> 2004.
>> [3] E. Huedo, R. S. Montero, I. M. Llorente: Evaluating the reliability 
>> of computational grids from the end user's point of view. Journal of 
>> Systems Architecture 52(12): 727-736, 2006.
>>
>> Andre Merzky escribió:
>>> Hi groups, 
>>>
>>> as discussed earlier, we put some effort into the GridCPR
>>> documents, to get them back into the editor pipeline.
>>> Thanks to Nathan and others, both the CPR usecase and the
>>> cpr architecture document have now all public comments
>>> addressed, and are to be submitted to the OGF editor.
>>>
>>> The docs are supposed to represent groups consensus after
>>> submission, so, this mail is a one week final call on the
>>> mailing list:  please review the documents, and comment on
>>> them!   "speak now or forever hold your peace ..." :-)
>>>
>>> Cheers, Andre.
>>>
>>>   
>>> ------------------------------------------------------------------------
>>>
>>> --
>>>   gridcpr-wg mailing list
>>>   gridcpr-wg at ogf.org
>>>   http://www.ogf.org/mailman/listinfo/gridcpr-wg
>> -- 
>>
>> GridWay, Meta-scheduling Technologies for the Grid! http://www.gridway.org
>>
>> **************************************************
>>
>> Dr. Eduardo Huedo Cuesta
>> Departamento de Arquitectura de Computadores
>>   y Automática
>> Facultad de Informática
>> Universidad Complutense de Madrid
>> C/ Prof. García Santesmases s/n
>> 28040 Madrid
>> Spain
>>
>> Tel:   +34 91 394 76 03
>> Fax:   +34 91 394 75 27
>> Email: ehuedo at fdi.ucm.es
>>
>> **************************************************
>>
>> --
>>   gridcpr-wg mailing list
>>   gridcpr-wg at ogf.org
>>   http://www.ogf.org/mailman/listinfo/gridcpr-wg
> 
> 
> 

-- 
  +-----------------------------+----------------------------------+
  | Nathan T.B. Stone, Ph.D.    | Pittsburgh Supercomputing Center |
  | Advanced Systems Group      | 300 S. Craig St.                 |
  +-----------------------------+ Pittsburgh, PA 15213             |
  | mailto:stone at psc.edu        | phone: 412-268-4367              |
  | http://www.psc.edu/~nstone/ | fax:   412-268-5832              |
  +-----------------------------+----------------------------------+


More information about the gridcpr-wg mailing list