🚀 Mission
💎 Features
📰 News
🤔 Why?
🧑🤝🧑 People
🎤 Publications
🎪 Events
papers
Skyhook: Towards an Arrow-Native Storage System
With the ever-increasing dataset sizes, several file formats such as Parquet, ORC, and Avro have been developed to store data …
Jayjeet Chakraborty
,
Ivo Jimenez
,
Sebastiaan Alvarez Rodriguez
,
Alexandru Uta
,
Jeff LeFevre
,
Carlos Maltzahn
PDF
Cite
Zero-Cost, Arrow-Enabled Data Interface for Apache Spark
Distributed data processing ecosystems are widespread and their components are highly specialized, such that efficient interoperability …
Sebastiaan Alvarez Rodriguez
,
Jayjeet Chakraborty
,
Aaron Chu
,
Ivo Jimenez
,
Jeff LeFevre
,
Carlos Maltzahn
,
Alexandru Uta
PDF
Cite
Mapping Scientific Datasets to Programmable Storage
Access libraries such as ROOT and HDF5 allow users to interact with datasets using high level abstractions, like coordinate systems and …
Aaron Chu
,
Jeff LeFevre
,
Carlos Maltzahn
,
Aldrin Montana
,
Peter Alvaro
,
Dana Robinson
,
Quincey Koziol
PDF
Cite
Scale-out Edge Storage Systems with Embedded Storage Nodes to Get Better Availability and Cost-Efficiency At the Same Time
In the resource-rich environment of data centers most failures can quickly failover to redundant resources. In contrast, failure in …
Jianshen Liu
,
Matthew Leon Curry
,
Carlos Maltzahn
,
Philip Kufeldt
PDF
Cite
Slides
SkyhookDM: Data Processing in Ceph with Programmable Storage
Jeff LeFevre
,
Carlos Maltzahn
PDF
Cite
Towards Physical Design Management in Storage Systems
In the post-Moore era, systems and devices with new architectures will arrive at a rapid rate with significant impacts on the software …
Kathryn Dahlgren
,
Jeff LeFevre
,
Ashay Shirwadkar
,
Ken Iizawa
,
Aldrin Montana
,
Peter Alvaro
,
Carlos Maltzahn
PDF
Cite
Slides
Skyhook: Programmable storage for databases
Ceph is an open source distributed storage system that is object-based and massively scalable. Ceph provides developers with the …
Jeff LeFevre
,
Noah Watkins
,
Michael Sevilla
,
Carlos Maltzahn
PDF
Cite
Video
Abstract
Cite
×