Exacution: Enhancing Scientific Data Management for Exascale
Document Type
Conference Proceeding
Publication Date
7-13-2017
Abstract
As we continue toward exascale, scientific data volume is continuing to scale and becoming more burdensome to manage. In this paper, we lay out opportunities to enhance state of the art data management techniques. We emphasize well-principled data compression, and using it to achieve progressive refinement. This can both accelerate I/O and afford the user increased flexibility when she interacts with the data. The formulation naturally maps onto enabling partitioning of the progressively improving-quality representations of a data quantity into different media-type destinations, to keep the highest priority information as close as possible to the computation, and take advantage of deepening memory/storage hierarchies in ways not previously possible. Careful monitoring is requisite to our vision, not only to verify that compression has not eliminated salient features in the data, but also to better understand the performance of massively parallel scientific applications. Increased mathematical rigor would be ideal,to help bring compression on a better-understood theoretical footing, closer to the relevant scientific theory, more aware of constraints imposed by the science, and more tightly error-controlled. Throughout, we highlight pathfinding research we have begun exploring related these topics, and comment toward future work that will be needed.
Identifier
85027248804 (Scopus)
ISBN
[9781538617915]
Publication Title
Proceedings International Conference on Distributed Computing Systems
External Full Text Location
https://doi.org/10.1109/ICDCS.2017.256
First Page
1927
Last Page
1937
Recommended Citation
Klasky, Scott; Suchyta, Eric; Ainsworth, Mark; Liu, Qing; Whitney, Ben; Wolf, Matthew; Choi, Jong; Foster, Ian; Kim, Mark; Logan, Jeremy; Mehta, Kshitij; Munson, Todd; Ostrouchov, George; Parashar, Manish; Podhorszki, Norbert; Pugmire, David; and Wan, Lipeng, "Exacution: Enhancing Scientific Data Management for Exascale" (2017). Faculty Publications. 9431.
https://digitalcommons.njit.edu/fac_pubs/9431
