BIRD Development

From Wikili

Jump to: navigation, search

Contents

Origin BIRD System

BIRD was based on main principe of Saada project [1].

  SAADA - Systèm d’Archivage Automatique des Données Astronomiques
  First Goal : Archive & Exploitation of Data of  the European XMM Newton satellite  [2].The 2XMM catalogue of X-ray sources, 
    the largest of its kind ever, has now been released.
  In a PhD Framework (2002-2005, Prototype Saada V.1.3) of Dr.NGUYEN  at University of Strasbourg I,Supported by the  
   CNES[www.cnes.fr] and the Alsace Region,  Supervised by Dr. Michel and Dr.Motch. 


Image:saada_bird.jpg

Conceptual Data Model

In order to automatically integrate heterogeneous data, we have designed several business data model corresponding to the real format of the data banks. Figure below illustrates the conceptual data model of the BIRD system. It can simultaneously host several bank types. Each type can itself concern several User Defined banks having the same format. Thanks to this conceptual model, BIRD can host different versions of a given data bank and manage them so that the programs launched on grid computing within an application can exploit the same data version during their computation time.

Image:model1.jpg


The data model of a data bank is predefined in an XML configuration file . This metadata is used to create the Java and SQL code. The code generation is launched at the moment of the configuration of the BIRD data bank or at data loading or reloading. In the example given figure 3, the metadata of Genbank are used to create Genbank-EST, Genbank-Refseq.

Image:model2.jpg


This Figure illustrates the business model for Genbank. Each bank can have several associated entries. Each entry has its associated information like Dbref, SEQData, FTSource,... According to our design, the Java classes of the business models are automatically generated by BIRD. Only instances of classes inherited from super ObjectPersistence will be recognized by the BIRD API. This super class contains common attributes and methods for all generated classes.

Query Engine

Data Integration

Image:integration1.jpg

The creation of a database goes through some principal stages : Initially the relational schema system (meta-model) is created when BIRD is installed. In the second phase, the configuration module creates the business data model including SQL and Java codes corresponding to the predefined metadata given by the XML configuration. Then, the system analyzes some integration rules to select the data files and proceed to their conversion and loading into the relational tables .

Architecture

Image:bird_arch.jpg

Key Technologies

Relational Core store

  IBM DB2 WareHouse V9.1 
  WebSphere Federation Server 

WEB Server & Services

  IBM WebSphere Application Server ( main Portal)
  Tomcat Server (services, non graphic)
  Hibernate and JSF-Java Server Face 
  Object Relational Mapping 
  Web component

XML & JAVA

Project Distribution

  Not net to public
Personal tools