[ogsa-d-wg] RE: Data federation definition

neil p chue hong N.ChueHong at epcc.ed.ac.uk
Fri Jan 27 06:44:11 CST 2006


Hi All,

I dredged up a response to a similar question I asked someone doing research
in the area a while back:

---
Federation means data sources that are autonomously managed and are
generally heterogeneous. It is also logical to say that the sources are
distributed (as a good cause for their autonomy and heterogeneity - but it
doesn't have to be the case as you can run multiple data servers on your
laptop.) 

Integration targets local and centralized managed data sources. 

There is also a difference between a global schema, federated schema, and an
integrated schema.
 - Global schema: result of integrating local schema into one global view
 - Federated schema: result of federating distributed schemas into one
global view
 - Integrated schema: integration of schemas "in preparation" to build a
global/federated schema. The main difference between integrated schema and
(global/federated) schema is in the integrated schema the elements "are NOT
YET" being mapped to their original data sources using (LaV, GaV, etc).

Check papers by Maurizio Lenzerini - "Data Integration: A theoretical
perspective" and others.
---

In general, I think we could say that a "data federation" service presents a
single consistent front end to a number of autonomously managed data
sources, whilst a "data integration" service uses some schema integration
tactic to map between data sources.

In that sense, the phrasing that Allen uses sounds like a data federation
service.

My 2p,
neil 


> -----Original Message-----
> From: owner-ogsa-d-wg at ggf.org 
> [mailto:owner-ogsa-d-wg at ggf.org] On Behalf Of Dave Berry
> Sent: 26 January 2006 10:11
> To: Treadwell, Jem; ogsa-d-wg at gridforum.org
> Subject: [ogsa-d-wg] RE: Data federation definition
> 
> Folks,
>  
> We need a definition of how we are using the term "data 
> federation", for the OGSA glossary.  Fortunately we don';t 
> have to find a definition that covers all the ways the term 
> is used in the world, just how we use it in our document.  
> Following a short discussion at the OGSA F2F, Jem (who is 
> keeper of the glossary) suggested the following;
>  
> The integration of multiple data resources so that they can 
> be accessed as if they were a single resource.
>  
> Allen suggested that as we are accessing data via services, 
> this would be better phrased as follows (see the attached 
> message for Allen's explanation in his own words):
>  
> The integration of multiple services or data resources so 
> that they can be accessed as if they were a single service.
>  
> We discussed this on yesterday's call and the consensus was 
> that I should post to the list and ask for your comments.
>  
> We briefly discussed whether we should separately define 
> "data federation" and "data integration".  One view was that 
> "integration"
> didn't necessarily involve distributed resources while "federation"
> didn't necessarily involve integrating the resources into a 
> single view.
> The contrasting view was that integration almost always 
> involves distributed data in practice, and especially so in a 
> Grid context, while federation typically requires some way of 
> accessing the distributed data as a whole.  So I'm leaning 
> towards treated the terms as synonyms within our documents.
>  
> What do you think?
>  
> Dave.
>  
> 
> 
>   _____  
> 
> 	From: Treadwell, Jem [mailto:jem.treadwell at hp.com] 
> 	Sent: 19 January 2006 23:17
> 	To: Dave Berry
> 	Subject: Data federation definition
> 	
> 	
> 	Dave: This is (very slightly) modified from your 
> document - though you don't have the glossary entry filled in :0)
> 	 
> 	The integration of multiple data resources to so that 
> they can be accessed as if they were a single resource.
> 	 
> 	- Jem
> 	 
>   _____  
> 
> Jem Treadwell
> Hewlett-Packard Company
> 6000 Irwin Road
> Mount Laurel, NJ 08054	  	  	
> Phone: 	856-638-6021	
> Fax: 	856-638-6190	
> E-mail:	 Jem.Treadwell at hp.com <mailto:Jem.Treadwell at hp.com> 	
> 	 
> 
> 





More information about the ogsa-d-wg mailing list