• German
German

Main Navigation

Noll/Teubner/2020a: Shared Load(ing): Efficient Bulk Loading into Optimized Storage

Bibtype Inproceedings
Bibkey Noll/Teubner/2020a
Author Stefan Noll and Jens Teubner and Norman May and Alexander Böhm
Title Shared Load(ing): Efficient Bulk Loading into Optimized Storage
Booktitle Proc. of the 10th Annual Conference on Innovative Data Systems Research (CIDR)
Address Amsterdam
Abstract Bulk loading into the optimized storage of a database system is a performance-critical task for data analysis, replication, and system integration. Depending on the storage layout, it may entail complex data transformations, making it also an expensive task that can disturb other workloads running in parallel.
In this work, we demonstrate that for a commercial, in-memory columnar system with compression-optimized storage, data transformation dominates the cost of bulk loading. The transformations may cause resource contention on a stressed system, resulting in poor and unpredictable performance for both bulk loading and query processing. To mitigate this problem, we propose Shared Loading, a distributed bulk loading mechanism that enables dynamically offloading deserialization and data transformation to the machine where the input data resides. In our evaluation we demonstrate that, for different network bandwidths and data sets, Shared Loading accelerates bulk loading into compressionoptimized storage and improves the performance and predictability of queries running concurrently.
Month jan
Year 2020
Projekt SFB876-C5
 
Bibtex Here you can get this literature entry as BibTeX format.