[saga-rg] Fwd (hupfeld at zib.de): Re: SAGA Strawman API Version 1.0
Thorsten Schuett
schuett at zib.de
Mon Jun 20 02:10:14 CDT 2005
On Saturday 18 June 2005 17:21, John Shalf wrote:
> Hi, the quoted conversation its just one side of a conversation in AIM.
> As such, I think many of these statements are likely to get
> misinterpreted because it does not include Andre's side of the
> conversation (every other paragraph was an exchange between us).
>
> So, in order to avert any flame wars on this list, the focus here is
> 1) Its probably inappropriate for SAGA to definine a consistency
> model,
> given it is only an API. However, it should be able to accomodate
> consistency models
> that exist in the underlying implementations. This is not to say
> consistency models are not important, but that the API can't be used
> to impose a consistency model. Its the other way around... the
> consistency model is going to impose conditions on the API.
>
> 2) At GGF it would be worthwhile to walk through some existing
> consistency models and find out how they map into the API (are there
> error states or features that are not accomodated?)
>
> 3) Felix is correct that such a model potentially exposes the
> programmer to the worst-case scenario in terms of consistency models.
> This is one of the risks involved in anything that carries the moniker
> "simple".
I see the point that it is hard guarantee a specific consistency model in SAGA
or make it a configuration option to choose one. But if you completely ignore
this topic in the specs you will end up with a good spec of the syntax but
without semantics.
My 0.02€,
Thorsten
>
> On Jun 18, 2005, at 4:26 AM, Andre Merzky wrote:
> > Hi,
> >
> > in a chat with John Shalf, he offered following opinion to
> > the topic of consistence, which he agreed to let me post to
> > the list. The paper John refers to is the paper cited by
> > Jon McLaren (see quoting below). SSI here means Single
> > System Image.
> >
> > John:
> >
> > I think SAGA has no business defining a consistency
> > model, but it should be able to accomodate consistency
> > models that exist in the underlying implementations.
> >
> > SAGA is just the API. We should lay out some existing
> > consistency models and make sure the error handling
> > supports it. The problem with POSIX+NFS that was pointed
> > out in that paper was that they could not extend the
> > error codes that were available to POSIX.
> >
> > Felix is right on that account.
> >
> > However, most grid technologies are trying their best to
> > create an SSI consistency model. The existence proof is
> > there (WAN-GPFS provides the same consistency model
> > semantics as a local FS connection).
> >
> > So I reject any claim that a "Grid" filesystem neccessitates
> > some bizzaro consistency model that is not the same as
> > Single System Image (SSI).
> >
> > I think any remote-filesystem strategy should have SSI
> > as its goal ultimately. I think it is somewhat of a
> > religious battle as to whether a remote consistency
> > model must be radically different than the local one.
> > Certainly, we need to deal with different kinds of
> > failures (eg. an open filehandle that suddenly becomes
> > unavailable).
> >
> > It should be discussed at GGF [...].
> >
> > But just to keep the discussion focused, the core comment
> > is "SAGA has no business defining a consistency model,
> > but it should be able to accomodate consistency models
> > that exist in the underlying implementations." and also
> > to say that "Felix is right about the programmer
> > potentially having to worry about the worst-case."
> >
> > [...]
> >
> >
> > Best Regards,
> >
> > Andre.
> >
> > Quoting [Jon MacLaren] (Jun 15 2005):
> >>>> <snip>
> >>>> If SAGA choses to give single-system like guarantees, this must be
> >>>> explicititely stated. All interfaces that deal with data are
> >>>> unusable
> >>>> without a
> >>>> specification of consistency guarantees.
> >>
> >> I don't believe that it is possible, or even desirable, to try to
> >> make distributed systems look like they are not distributed. For
> >> example, I don't think you should provide POSIX behaviour on a
> >> distributed filesystem. If you look at AFS, it doesn't fit the POSIX
> >> model. Most people write code that ignores what the filesystem might
> >> be, and assume POSIX. How many people check the failure status on a
> >> file close? With AFS, you can get "Host not found" when you do a
> >> file close. You can wait, and try again. If you quit, your changes
> >> are lost. (As a library writer, you can try and "squash" the errors
> >> by putting a clever layer of code between the app and the filesystem
> >> that know tricks like this. The Condor people do this, I seem to
> >> recall.)
> >>
> >> The point here isn't that developers should never assume a POSIX
> >> filesystem, it is that they should know what kind of filesystem they
> >> are dealing with, so that they can write appropriate code. When you
> >> go distributed, there are a whole new set of error conditions that
> >> can occur. I don't think that there is anything to be gained from
> >> pretending that remote objects are the same as local objects, so that
> >> people's code can stay the same. If the code doesn't know it's
> >> dealing with something that is remote, rather than local, then at
> >> best (i.e. if there is lots of error checking) it will fail far more
> >> often. Probably though, it won't be robust.
> >>
> >> It might be worth looking at the following paper, which says
> >> eloquently what I'm grasping for.
> >>
> >> "A note on distributed computing" by Jim Waldo et al., available
> >> from: http://research.sun.com/techrep/1994/abstract-29.html
> >>
> >> Cheers,
> >>
> >> Jon.
> >
> > --
> > +-----------------------------------------------------------------+
> >
> > | Andre Merzky | phon: +31 - 20 - 598 - 7759 |
> > | Vrije Universiteit Amsterdam (VU) | fax : +31 - 20 - 598 - 7653 |
> > | Dept. of Computer Science | mail: merzky at cs.vu.nl |
> > | De Boelelaan 1083a | www: http://www.merzky.net |
> > | 1081 HV Amsterdam, Netherlands | |
> >
> > +-----------------------------------------------------------------+
More information about the saga-rg
mailing list