[DRMAA-WG] Meeting Minutes - Conference call -Mar 31th -

Peter Tröger peter at troeger.eu
Wed Apr 1 04:07:40 CDT 2009


Participants in the call: Hrabri, Daniel T., Peter

Meeting minutes from Feb 17th were accepted.

> 1. Meeting secretary for this meeting?

Peter.

> 2. Job sub-state data type - final voting:
> http://www.ogf.org/pipermail/drmaa-wg/2009-February/001007.html

Decision for object / void pointer approach. The returned data  
structure can be defined by the language binding or the  
implementation, as long as the jobStatus() signature contains a  
generic pointer type.

> 3. Partial time stamp replacement:
> http://www.ogf.org/pipermail/drmaa-wg/2009-February/001008.html

Decision for replacement by RFC822 strings.

> 4. TERMINATED vs. FAILED state discussion:
> http://www.ogf.org/pipermail/drmaa-wg/2009-March/001012.html

Option 2 from the original mail is now highly preferred. TERMINATED  
state should express that an external entity (e.g. user or DRM system)  
stopped the job before finishing. For POSIX-aligned systems, this  
could be formulated as reception of a signal by "the job". In  
contrast, FAILED state now expresses that the application stopped on  
its own before finishing. For POSIX-aligned systems, this could be  
formulated as reception of a signal "by the job's application process".

We ask for comments from PBS and LSF experts (FedStage ?!?). Do these  
systems provide enough error information to distinguish between these  
two states  ? For SGE and Condor, Dan and Peter already agreed.

This decision also has some implications on the JobInfo structure, the  
job state flow and the error conditions for job templates.

> 5. Discussion Kick-Off: Remodeling the JobInfo interface

New attribute for job state at the time of querying, since  
"terminatingSignal" now only makes sense in the FAILED state.



More information about the drmaa-wg mailing list