Lehrstuhl Informatik III: Datenbanksysteme AstroGrid-D Meeting Heidelberg, Informationsfusion und -Integrität: Grid-Erweiterungen zum Datenmanagement Project Leader: Prof. Dr.-Ing. Erhard Rahm Institut für Informatik, Universität Leipzig Project Partners Infrastructure: TU Dresden Universität Leipzig TU München Application: AstroGrid-D MediGrid TextGrid Forschungszentrum Rossendorf
Lehrstuhl Informatik III: Datenbanksysteme AstroGrid-D Meeting Heidelberg, Main Goals: 1. Data Fusion o semantical correct Combination of Data Sources o Support of Analysis over distributed Data o Ontology-based Mapping of Data Sources 2. Data Integrity o Data Lineage o based on Grid Security Infrastructure (GSI) 3. Dynamic Data Distribution (see following slides) o Optimization of data intensive Queries o effective and efficient Usage of decentral Main Memory Ressources Services for Augmenting Data Management in DGI
Lehrstuhl Informatik III: Datenbanksysteme AstroGrid-D Meeting Heidelberg, HiSbase: Main Memory P2P Database System
Lehrstuhl Informatik III: Datenbanksysteme AstroGrid-D Meeting Heidelberg, HiSbase – The Challenge Histogram-based peer-to-peer main memory database for locality-aware data processing Example: distributed archives (astrophysics) Correlation of different catalogs Skewed data distribution Region-based queries Right ascension (longitude) Declination (latitude)
Lehrstuhl Informatik III: Datenbanksysteme AstroGrid-D Meeting Heidelberg,
Lehrstuhl Informatik III: Datenbanksysteme AstroGrid-D Meeting Heidelberg, Distribute by Region – not by Archive! Highly distributed information management Distributed Hashtable peer-to-peer architecture High performance query processing Main memory database Semantic clustering, spatial locality Equi-depth histograms
Lehrstuhl Informatik III: Datenbanksysteme AstroGrid-D Meeting Heidelberg,