[Pgi-wg] PGI Job State Model for discussions (UML: zargo + xmi + png)

Oxana Smirnova oxana.smirnova at hep.lu.se
Thu May 28 15:16:11 CDT 2009


Hi,

I actually tend to agree with Etienne on the point that job interruption should be treated independently of the reason that caused it.

A user may want to kill a lower priority job in order to e.g. let other jobs to go ahead in the queue, and then re-start the killed job. Or if a user happens to know that the input file is not available yet, she may want to stop the job in order not to waste the resources on download attempts, and restart the job a bit later, when input appears. Or a user may suddenly realize that she had a too short (or too long) proxy, and would like to restart a job with a newer one, without changing anything else. In any such case a job is restartable, and even may be killed exactly with the purpose to restart later.

Technically, it's not quite correct to assume that users kill jobs because they never _want_ to restart them. They probably do.

Cheers,
Oxana


Etienne URBAH wrote:
> Aleksandr,
> 
> Thank you very much for studying and criticizing my OGF PGI Job State
> Model (based on yours) :
> 
> I agree that a job 'failed because of failure detected during job
> processing/execution may lead to restartable job if reason of failure
> may be eleminated by user action.'
> 
> In that case, the job needs User action, so its state should be one of
> the following :
> -  Pre-processing-Hold
> -  Delegated-Hold
> -  Post-processing-Hold
> 
> Besides, I suggest that when a User cancels a job which is NOT Hold, the
> Execution Service simply handles that as a job failure.  That permits to
> avoid 4 supplemental transitions toward Failed-Cancelled.
> 
> Please continue to criticize ...
> 
> Best regards.
> 
> -----------------------------------------------------
> Etienne URBAH         LAL, Univ Paris-Sud, IN2P3/CNRS
>                       Bat 200   91898 ORSAY    France
> Tel: +33 1 64 46 84 87      Skype: etienne.urbah
> Mob: +33 6 22 30 53 27      mailto:urbah at lal.in2p3.fr
> -----------------------------------------------------
> 
> 
> On Thu, 28 May 2009, Aleksandr Konstantinov wrote:
>> On Wednesday 27 May 2009 22:04, Etienne URBAH wrote:
>>> Morris, Aleksandr and All,
>>>
>>> Concerning the OGF PGI Job State Model (based on Aleksandr's) :
>>>
>>> I agree that :
>    -  Pre-processing includes Automatic Stage-In.
> 
>    -  Delegated includes Running.
> 
>    -  Post-processing includes Automatic Stage-Out.
> 
>>>
>>> I suggest that :
>>>
>>> - Pending should be renamed as Submitted
>>>
>>> - Deep internal states do NOT need to be exposed, but states
>>> requiring User action do.
>>>
>    -  Pre-processing-Hold  includes Failed-Recoverable and Manual Stage-In.
> 
>    -  Delegated-Hold       includes Failed-Recoverable.
> 
>    -  Post-processing-Hold includes Failed-Recoverable and Manual
> Stage-Out.
> 
>>> - Cancellation by the User and Failure detected by the Execution
>>> Service lead to the same result, so the same state.
>>>
>>
>> From my point of view job canceled by user is canceled for good. But
>> one failed because of failure detected during job processing/execution
>> may
>> lead to restartable job if reason of failure may be eliminated by user
>> action.
>>
>>
>> A.K.
>>
>>
>>> Therefore, using the free ArgoUML tool, I have created my own
>>> proposal, which is available at
>>> http://forge.gridforum.org/sf/go/doc15655?nav=1 with the ZARGO, XMI
>>> and PNG formats.
>>>
>>> Please criticize and improve it !
>>>
>    The ArgoUML tool is open source and available for Windows, Mac OS X
> and Linux at http://argouml.tigris.org/
> 
>>> Best regards.
>>>
>>> -----------------------------------------------------
>>> Etienne URBAH         LAL, Univ Paris-Sud, IN2P3/CNRS
>>>                        Bat 200   91898 ORSAY    France
>>> Tel: +33 1 64 46 84 87      Skype: etienne.urbah
>>> Mob: +33 6 22 30 53 27      mailto:urbah at lal.in2p3.fr
>>> -----------------------------------------------------
> 
> ------------------------------------------------------------------------
> 
> _______________________________________________
> Pgi-wg mailing list
> Pgi-wg at ogf.org
> http://www.ogf.org/mailman/listinfo/pgi-wg

-- 
______________________________________________________________________

 Dr. Oxana Smirnova * http://www.hep.lu.se * oxana.smirnova at hep.lu.se
   Institute of Physics, Dept. for Experimental High Energy Physics
           Lund University Box 118, S-22100 Lund, SWEDEN
   Tel. +46(46)222.76.99, +46(709)22.46.57 * Fax: +46(46)222.40.15


More information about the Pgi-wg mailing list