FG 1306/1: Stratosphere - Information Management on the Cloud (TP D)

The Collaborative Research Unit Stratosphere aims at advancing the state-of-art in data processing on parallel, adaptive architectures. Stratosphere explores the power of massively parallel computing for complex information management applications. We will develop a novel, database-inspired approach to analyze, aggregate, and query very large collections of either textual or (semi-)structured data on a virtualized, massively parallel cluster architecture. Stratosphere will conduct research in the areas of massively parallel data processing engines, a programming model for parallel data programming, robust optimization of declarative data flow programs, continuous re-optimization and adaptation of the execution, data cleansing, and text mining. The unit will validate its work through a benchmark of the overall system performance and by demonstrators in the areas of climate research, the biosciences and linked open data.

Projektleitung
Leser, Ulf Prof. Dr.-Ing. (Details) (Wissensmanagement in der Bioinformatik)

Beteiligte Organisationseinheiten der HU

Mittelgeber
DFG: Forschergruppen

Laufzeit
Projektstart: 09/2010
Projektende: 10/2015

Forschungsfelder
Big Data, Query Optimization, Text Mining

Zuletzt aktualisiert 2020-01-06 um 18:53