Revealing the Unstable Foundations of eBPF-Based Kernel Extensions
Shawn Zhong (University of Wisconsin-Madison); Jing Liu (Microsoft Research); Andrea
Arpaci-Dusseau (University of Wisconsin-Madison); Remzi Arpaci-Dusseau (University of
Wisconsin–Madison)
Dandelion Hashtable: Cracking the Billion Memory Requests Per Second Barrier —
Effortlessly
Antonios Katsarakis, Vasilis Gavrielatos (Huawei Research); Nikos Ntarmos (Huawei Research
UK)
Abstract
MEPipe: Democratizing LLM Training with Memory-Efficient Slice-Level Pipeline Scheduling
on Cost-Effective Accelerators
Zhenbo Sun, Shengqi Chen, Yuanwei Wang (Tsinghua University); Jian Sha (Ant Group); Guanyu
Feng (Zhipu AI); Wenguang Chen (Tsinghua University)
Towards Interference-aware Application Co-locations
Ioannis A. Papadakis (National Technical University of Athens); Nectarios Koziris (National
Technical University of Athens, Greece); Georgios Goumas (National Technical University of
Athens)
Abstract
Beyond Layers: Container Registries for Files Distribution and On-Demand Image
Partitioning
Giovanni Bartolomeo, Navidreza Asadi, Wolfgang Kellerer (Technical University of Munich);
Jorg Ott (Technische Universität München); Nitinder Mohan (TU Delft)
Abstract
Boosting Rematerialization Training via Execution Mode Splitting Modeling on Convex
Optimized Dynamic Programming
Yu Tang, Lujia Yin (National University of Defense Technology); Qiao Li, Yiming Zhang
(Xiamen University)
Abstract
The LAW Behind ALRs: Redefining Crash-Tolerant Reads
Antonios Katsarakis (Huawei Research); Emmanouil Giortamis (TU Munich); Vasilis Gavrielatos
(Huawei Research); Pramod Bhatotia (TU Munich); Aleksandar Dragojevic (None); Boris Grot
(University of Edinburgh); Vijay Nagarajan (University of Utah); Panagiota Fatourou (FORTH
ICS and University of Crete, Greece)
Abstract
DPack: Efficiency-Oriented Privacy Budget Scheduling
Pierre Tholoniat, Kelly Kostopoulou (Columbia University)
Adios to Busy-Waiting for Microsecond-scale Memory Disaggregation
Wonsup Yoon, Jisu Ok, Sue Moon, Youngjin Kwon (KAIST)
Reproducible Fault Injection at the Operating System Level
Sebastião Amaro, Miguel Matos (IST Lisbon & INESC-ID); Pedro Fonseca (Purdue
University)
Abstract
HawkSet: Automatic, Application-Agnostic, and Efficient Concurrent PM Bug
Detection
João Oliveira (INESC-ID, IST); João Gonçalves (INESC-ID & IST U. Lisboa); Miguel Matos (IST
Lisbon & INESC-ID)
CacheBlend: Fast Large Language Model Serving for RAG with Cached Knowledge
Fusion
Jiayi Yao (The Chinese University of Hong Kong, Shenzhen); Hanchen Li, Yuhan Liu, Siddhant
Ray, Yihua Cheng (University of Chicago); Qizheng Zhang (Stanford University); Kuntai Du
(University of Chicago); Shan Lu (University of Chicago / Microsoft Research); Junchen Jiang
(University of Chicago)
Evolving XFS with Zoned Storage and Intelligent Data Placement
Hans Holmberg, Christoph Hellwig (Western Digital Research)
Abstract
NeuStream: Bridging Deep Learning Serving and Stream Processing
Haochen Yuan (Peking University); Yuanqing Wang (Peking University and Microsoft Research);
Wenhao Xie (Peking University); Yu Cheng (Peking University and Microsoft Research); Ziming
Miao, Lingxiao Ma, Jilong Xue (Microsoft Research); Zhi Yang (Peking University)
GraphGen+: Advancing Distributed Subgraph Generation and Graph Learning On Industrial
Graphs
Yue Jin, Yongchao Liu, Chuntao Hong (Ant Group)
Abstract
CAPSys: Contention-aware task placement for data stream processing
Yuanli Wang, Lei Huang, Zikun Wang, Vasiliki Kalavri, Abraham Matta (Boston
University)
SeBS-Flow: Benchmarking Serverless Cloud Function Workflows
Larissa Schmid (Karlsruhe Institute of Technology); Marcin Copik (ETH Zurich); Alexandru
Calotoiu (ETH Zurich, Switzerland); Laurin Brandner (ETH Zürich); Anne Koziolek (KIT
Karlsruhe Institute of Technology); Torsten Hoefler (ETH Zurich)
Towards VM Rescheduling Optimization Through Deep Reinforcement Learning
Xianzhong Ding (University of California, Merced); Yunkai Zhang (University of California,
Berkeley); Binbin Chen (ByteDance Inc); Donghao Ying (University of California, Berkeley);
Tieying Zhang (ByteDance); Jianjun Chen (Bytedance); Lei Zhang (ByteDance); Alberto Cerpa,
Wan Du (University of California, Merced)
HyperAlloc: Efficient VM Memory De/Inflation via Hypervisor-Shared Page-Frame
Allocators
Lars Wrenger, Kenny Albes, Marco Wurps (Leibniz Universität Hannover); Christian Dietrich
(Technische Universität Braunschweig); Daniel Lohmann (Leibniz Universität
Hannover)
Kirsch: It's 2025. Does your OS know what's on your SoC?
Roman Meier, Zikai Liu, Ben Fiedler, Timothy Roscoe (ETH Zurich)
Abstract
ConsenStress: A Framework to Torture Test Consensus Protocols
Pasindu Tennage, Shailesh Mishra (EPFL); Alberto Sonnino (Mysten Labs, UCL); Eleftherios
Kokoris Kogias (Mysten Labs); Philipp Jovanovic (UCL); Bryan Ford (EPFL)
Abstract
Distributed Graph Neural Network Inference With Just-In-Time Compilation For
Industry-Scale Graphs
Xiabao Wu, Yongchao Liu, Wei Qin, Chuntao Hong (Ant Group, China)
Abstract
Comprehensive Deadlock Prevention for GPU Collective Communication
Lichen Pan (School of Computer Science, Peking University); Juncheng Liu (OneFlow Research);
Yongquan Fu (Science and Technology Laboratory of Parallel and Distributed Processing;
College of Computer, National University of Defense Technology, Changsha, Hunan province,
Chin); Jinhui Yuan (OneFlow Research); Rongkai Zhang, Pengze Li, Zhen Xiao (School of
Computer Science, Peking University)
Understanding the Linux Kernel, Visually
Hanzhi Liu, Yanyan Jiang, Chang Xu (Nanjing University)
Achilles: Efficient TEE-Assisted BFT Consensus via Rollback Resilient
Recovery
Jianyu Niu (Southern University of Science and Technology); Xiaoqing Wen (University of
British Columbia); Guanlong Wu (Southern University of Science and Technology); Shengqi Liu
(Southern University of Science and Technology.); Jiangshan Yu (The University of Sydney);
Yinqian Zhang (Southern University of Science and Technology)
Socarrat: Building Cost-Effective Secure WORM Devices Following the Reverse File System
Approach
Gorka Guardiola Múzquiz, Enrique Soriano-Salvador (Universidad Rey Juan Carlos)
Abstract
Eva: Cost-Efficient Cloud-Based Cluster Scheduling
Tzu-Tao Chang, Shivaram Venkataraman (University of Wisconsin-Madison)
Dandelion: Small Clusters, Massive Throughput—The Future of Distributed
Transactions
Antonios Katsarakis, Vasilis Gavrielatos (Huawei Research); Chris Jensen (University of
Cambridge); Nikos Ntarmos (Huawei Research UK)
Abstract
Maestro: VM memory overcommit balancing platform
Adamos Ttofari, Lukas Humbel (Huawei)
Abstract
Towards Efficient Large Language Model Serving for Multi-turn Conversations
Jianxiong Liao, Zhi Zhou (Sun Yat-sen University)
Abstract
A Hardware-Software Co-Design for Efficient Secure Containers
Jiacheng Shi, Yang Yu, Jinyu Gu, Yubin Xia (Shanghai Jiao Tong University)
A House United Within Itself: SLO-Awareness for On-Premises Containerized ML Inference
Clusters via Faro
Beomyeol Jeon (University of Illinois Urbana-Champaign); Chen Wang, Diana Arroyo, Alaa
Youssef (IBM Research); Indranil Gupta (University of Illinois Urbana-Champaign)
Fast State Restoration in LLM Serving with HCache
Shiwei Gao, Youmin Chen, Jiwu Shu (Tsinghua University)
Optimizing Task Scheduling in Cloud VMs with Accurate vCPU Abstraction
Edward Guo (Hofstra University); Weiwei Jia (The University of Rhode Island); Xiaoning Ding
(New Jersey Institute of Technology); Jianchen Shan (Hofstra University)
RoCE BALBOA: Towards FPGA-enhanced RDMA
Maximilian Jakob Heer, Benjamin Ramhorst, Jonas Dann, Gustavo Alonso (ETH Zurich)
Abstract
Bounded Resource Reclamation
Viktor Reusch, Till Miemietz (Barkhausen Institut)
Abstract
Towards Energy-Efficient Split Computing: A Hardware-Software Co-Design
Perspective
Daniel May, Alessandro Tundo (Technische Universität Wien (TU Wien)); Shashikant Ilager
(University of Amsterdam (UvA)); Ivona Brandic (Technische Universität Wien (TU
Wien))
Abstract
Solid State Drive Targeted Memory-Efficient Indexing for Universal I/O Patterns and
Fragmentation Degrees
Junsu Im, Jeonggyun Kim, Seonggyun Oh, Jinhyung Koo, Juhyung Park, Hoon Sung Chwa (DGIST);
Sam H. Noh (Virginia Tech); Sungjin Lee (DGIST)
Ladon: High-Performance Multi-BFT Consensus via Dynamic Global Ordering
Hanzheng Lyu (The University of British Columbia); Shaokang Xie, Jianyu Niu (Southern
University of Science and Technology); Chen Feng (University of British Columbia); Yinqian
Zhang (Southern University of Science and Technology); Ivan Beschastnikh (University of
British Columbia)
HybridFlow: A Flexible and Efficient RLHF Framework
Guangming Sheng (The University of Hong Kong); Chi Zhang, Zilingfeng Ye (ByteDance); Xibin
Wu (Bytedance); Wang Zhang (ByteDance); Ru Zhang (Bytedance); Yanghua Peng, Haibin Lin
(ByteDance); Chuan Wu (The University of Hong Kong)
Serverless Cold Starts and Where to Find Them
Artjom Joosen (Huawei); Ahmed Hassan (Systems Infrastructure Lab, Huawei); Martin Asenov,
Rajkarn Singh (Huawei); Luke Darlow (Sakana AI); Jianfeng Jianfeng (Wang); qiwen deng
(Huawei Edinburgh Research Center); Adam Barker (Huawei)
Systems for LLMs Are Old News: Multimodality Is Redefining Everything We
Know
Konstantinos Papaioannou, Thaleia Dimitra Doudali (IMDEA Software Institute)
Abstract
TUNA: Tuning Unstable and Noisy Cloud Applications
Johannes Freischuetz (University of Wisconsin - Madison); Konstantinos Kanellis (University
of Wisconsin-Madison); Brian Kroth (Microsoft); Shivaram Venkataraman (University of
Wisconsin-Madison)
Maximizing Return On Investment for Sustainable Operations through Smart Workload
Migration
Georgia Christofidi (IMDEA Software Institute, Universidad Politécnica de Madrid); Francisco
Álvarez Terribas, Jesus Alberto Omaña Iglesias (Telefónica Research); Nicolas Kourtellis
(Keysight); Thaleia Dimitra Doudali (IMDEA Software Institute)
Abstract
An Architecture for Shrinking the TCB of TEEs on Heterogeneous Systems
Nils Asmussen, Carsten Weinhold (Barkhausen Institut)
Abstract
DuoSwap: adaptive concurrent swapping to compressed memory and NVMe SSD
Yuben Yang (University of Sydney); Baptiste Lepers (Inria); Kimberly Keeton, Khaled
Elmeleegy (Google); Willy Zwaenepoel (University of Sydney)
Abstract
Efficient Deep Learning Inference on IoT Devices
Zhizhuo Liu, Min Liu, Chaonong Xu (China University of Petroleum-Beijing)
Abstract
JABAS: Joint Adaptive Batching and Automatic Scaling for DNN Training on Heterogeneous
GPUs
Gyeongchan Yun, Junesoo Kang, Hyunjoon Jeong, Sanghyeon Eom (UNIST); Minsung Jang (Samsung
SDS); Young-ri Choi (UNIST (Ulsan National Institute of Science and Technology))
HuffmanEmbed: Using Huffman Coding for Embedding Table Compression in Deep Learning
Recommendation Models
Chaoyi Jiang, Abdulla Alshabanah, Hossein Entezari Zarch, Keshav Balasubramanian, Murali
Annavaram (University of Southern California)
Abstract
Lauberhorn: a Smart NIC that is part of the OS
Pengcheng Xu, Timothy Roscoe (ETH Zurich)
Abstract
SpotHedge: Serving AI Models on Spot Instances
Ziming Mao, Tian Xia, Zhanghao Wu, Wei-Lin Chiang, Tyler Griggs, Romil Bhardwaj, Zongheng
Yang (UC Berkeley); Scott Shenker (ICSI AND UC Berkeley); Ion Stoica (UC Berkeley)
RoboRebound: Multi-Robot System Defense with Bounded-Time Interaction
Neeraj Gandhi, Yifan Cai, Andreas Haeberlen, Linh Thi Xuan Phan (University of
Pennsylvania)
Automated Reasoning About Memory Accesses on Systems-on-Chip
Ben Fiedler, Samuel Gruetter, Timothy Roscoe (ETH Zurich)
Abstract
Dynamic Dispatcher Assignment With Flat-Combining
Gangmin Lee, Wonsup Yoon, Sue Moon (KAIST)
Abstract
Systematic Testing of Persistent Memory Applications
Henrique Fernandes, João Gonçalves, Miguel Matos (IST U. Lisboa & INESC-ID)
Abstract
Ad-hoc composable cache coherent systems - a fairy tale?
Jasmin Schult, Timothy Roscoe (ETH Zurich)
Abstract