[ogsa-d-wg] RE: Data federation definition

Allen Luniewski luniew at almaden.ibm.com
Fri Jan 27 17:40:45 CST 2006


Neil,

I understand the definitions you used in your note.  But I still can not 
see how a federation service and an integration service are, in practice, 
any different.  Consider:

To do federation, a federation service is going to have to do schema 
integration.  Thus it acts as, or at least uses in your terms, an 
integration service.  If it does not do this, I do not see how it can 
perform its federation function.

In the definition of an integration service, I ask myself: what are the 
operations I can ask an integration service to do?  The only interesting 
answer I can come up with is: access data using the integrated schema. But 
this is just what a federation service does.

So I end  up where I started: the terms "federation" and "integration" 
are, in practice, indistinguishable.  I am willing to believe that the two 
are different but I am still looking for a way to tell them apart.

Allen




"neil p chue hong" <N.ChueHong at epcc.ed.ac.uk> 
Sent by: owner-ogsa-d-wg at ggf.org
01/27/2006 04:44 AM
Please respond to
N.ChueHong


To
"'Dave Berry'" <daveb at nesc.ac.uk>, "'Treadwell, Jem'" 
<jem.treadwell at hp.com>, <ogsa-d-wg at gridforum.org>
cc

Subject
RE: [ogsa-d-wg] RE: Data federation definition







Hi All,

I dredged up a response to a similar question I asked someone doing 
research
in the area a while back:

---
Federation means data sources that are autonomously managed and are
generally heterogeneous. It is also logical to say that the sources are
distributed (as a good cause for their autonomy and heterogeneity - but it
doesn't have to be the case as you can run multiple data servers on your
laptop.) 

Integration targets local and centralized managed data sources. 

There is also a difference between a global schema, federated schema, and 
an
integrated schema.
 - Global schema: result of integrating local schema into one global view
 - Federated schema: result of federating distributed schemas into one
global view
 - Integrated schema: integration of schemas "in preparation" to build a
global/federated schema. The main difference between integrated schema and
(global/federated) schema is in the integrated schema the elements "are 
NOT
YET" being mapped to their original data sources using (LaV, GaV, etc).

Check papers by Maurizio Lenzerini - "Data Integration: A theoretical
perspective" and others.
---

In general, I think we could say that a "data federation" service presents 
a
single consistent front end to a number of autonomously managed data
sources, whilst a "data integration" service uses some schema integration
tactic to map between data sources.

In that sense, the phrasing that Allen uses sounds like a data federation
service.

My 2p,
neil 


> -----Original Message-----
> From: owner-ogsa-d-wg at ggf.org 
> [mailto:owner-ogsa-d-wg at ggf.org] On Behalf Of Dave Berry
> Sent: 26 January 2006 10:11
> To: Treadwell, Jem; ogsa-d-wg at gridforum.org
> Subject: [ogsa-d-wg] RE: Data federation definition
> 
> Folks,
> 
> We need a definition of how we are using the term "data 
> federation", for the OGSA glossary.  Fortunately we don';t 
> have to find a definition that covers all the ways the term 
> is used in the world, just how we use it in our document. 
> Following a short discussion at the OGSA F2F, Jem (who is 
> keeper of the glossary) suggested the following;
> 
> The integration of multiple data resources so that they can 
> be accessed as if they were a single resource.
> 
> Allen suggested that as we are accessing data via services, 
> this would be better phrased as follows (see the attached 
> message for Allen's explanation in his own words):
> 
> The integration of multiple services or data resources so 
> that they can be accessed as if they were a single service.
> 
> We discussed this on yesterday's call and the consensus was 
> that I should post to the list and ask for your comments.
> 
> We briefly discussed whether we should separately define 
> "data federation" and "data integration".  One view was that 
> "integration"
> didn't necessarily involve distributed resources while "federation"
> didn't necessarily involve integrating the resources into a 
> single view.
> The contrasting view was that integration almost always 
> involves distributed data in practice, and especially so in a 
> Grid context, while federation typically requires some way of 
> accessing the distributed data as a whole.  So I'm leaning 
> towards treated the terms as synonyms within our documents.
> 
> What do you think?
> 
> Dave.
> 
> 
> 
>   _____ 
> 
>                From: Treadwell, Jem [mailto:jem.treadwell at hp.com] 
>                Sent: 19 January 2006 23:17
>                To: Dave Berry
>                Subject: Data federation definition
> 
> 
>                Dave: This is (very slightly) modified from your 
> document - though you don't have the glossary entry filled in :0)
> 
>                The integration of multiple data resources to so that 
> they can be accessed as if they were a single resource.
> 
>                - Jem
> 
>   _____ 
> 
> Jem Treadwell
> Hewlett-Packard Company
> 6000 Irwin Road
> Mount Laurel, NJ 08054 
> Phone:                 856-638-6021 
> Fax:           856-638-6190 
> E-mail:                 Jem.Treadwell at hp.com <
mailto:Jem.Treadwell at hp.com> 
> 
> 
> 


-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://www.ogf.org/pipermail/ogsa-d-wg/attachments/20060127/f5715f7c/attachment.html 


More information about the ogsa-d-wg mailing list