Results

Submissions: 46 (46% of accepted papers)

Evaluation Results:

  • 45 Artifacts Available
  • 43 Artifacts Functional
  • 35 Results Reproduced
Paper title Avail. Funct. Repro.
ASTERINAS: A Linux ABI-Compatible, Rust-Based Framekernel OS with a Small and Sound TCB Artifacts Available Artifacts Evaluated - Functional Results Reproduced
Burst Computing: Quick, Sudden, Massively Parallel Processing on Serverless Resources Artifacts Available Artifacts Evaluated - Functional
Chitu: Avoiding Unnecessary Fallback in Byzantine Consensus Artifacts Available Artifacts Evaluated - Functional
CLONE: Customizing LLMs for Efficient Latency-Aware Inference at the Edge Artifacts Available
Colocating ML Inference and Training with Fast GPU Memory Handover Artifacts Available Artifacts Evaluated - Functional Results Reproduced
CrossPipe: Towards Optimal Pipeline Schedules for Cross-Datacenter Training Artifacts Available Artifacts Evaluated - Functional Results Reproduced
DSA-2LM: A CPU-Free Tiered Memory Architecture with Intel DSA Artifacts Available Artifacts Evaluated - Functional Results Reproduced
Fast Distributed Transactions for RDMA-based Disaggregated Memory Artifacts Available Artifacts Evaluated - Functional Results Reproduced
FlexPipe: Maximizing Training Efficiency for Transformer-based Models with Variable-Length Inputs Artifacts Available
GeneralSparse: Bridging the Gap in SpMM for Pruned Large Language Model Inference on GPUs Artifacts Available Artifacts Evaluated - Functional
GMI-DRL: Empowering Multi-GPU DRL with Adaptive-Grained Parallelism Artifacts Available Artifacts Evaluated - Functional
GPREEMPT: GPU Preemptive Scheduling Made General and Efficient Artifacts Available Artifacts Evaluated - Functional Results Reproduced
GREYHOUND: Hunting Fail-Slows in Hybrid-Parallel Training at Scale Artifacts Available Artifacts Evaluated - Functional Results Reproduced
HotRAP: Hot Record Retention and Promotion for LSM-trees with Tiered Storage Artifacts Available Artifacts Evaluated - Functional Results Reproduced
IRHash: Efficient Multi-Language Compiler Caching by IR-Level Hashing Artifacts Available Artifacts Evaluated - Functional Results Reproduced
JENGA: Enhancing LLM Long-Context Fine-tuning with Contextual Token Sparsity Artifacts Available Artifacts Evaluated - Functional Results Reproduced
Katz: Efficient Workflow Serving for Diffusion Models with Many Adapters Artifacts Available Artifacts Evaluated - Functional Results Reproduced
LEOCraft: Towards Designing Performant LEO Networks Artifacts Available Artifacts Evaluated - Functional Results Reproduced
LITESHIELD: Secure Containers via Lightweight, Composable Userspace μKernel Services Artifacts Evaluated - Functional
Mitigating Resource Usage Dependency in Sorting-based KV Stores on Hybrid Storage Devices via Operation Decoupling Artifacts Available Artifacts Evaluated - Functional
mTuner: Accelerating Parameter-Efficient Fine-Tuning on Multi-GPU Servers with Elastic Tensor Artifacts Available Artifacts Evaluated - Functional Results Reproduced
On-Demand Container Partitioning for Distributed ML Artifacts Available Artifacts Evaluated - Functional Results Reproduced
Para-ksm: Parallelized Memory Deduplication with Data Streaming Accelerator Artifacts Available Artifacts Evaluated - Functional Results Reproduced
PathWeaver: A High-Throughput Multi-GPU System for Graph-Based Approximate Nearest Neighbor Search Artifacts Available
Poby: SmartNIC-accelerated Image Provisioning for Coldstart in Clouds Artifacts Available Artifacts Evaluated - Functional Results Reproduced
PPipe: Efficient Video Analytics Serving on Heterogeneous GPU Clusters via Pool-Based Pipeline Parallelism Artifacts Available Artifacts Evaluated - Functional Results Reproduced
QFactory: Accelerating Quantized Large Language Model Serving with Qtile Graphs Artifacts Available Artifacts Evaluated - Functional Results Reproduced
Resource Multiplexing in Tuning and Serving Large Language Models Artifacts Available Artifacts Evaluated - Functional Results Reproduced
Revealing Floating-Point Accumulation Orders in Software/Hardware Implementations Artifacts Available Artifacts Evaluated - Functional Results Reproduced
Rex: Closing the language-verifier gap with safe and usable kernel extensions Artifacts Available Artifacts Evaluated - Functional
SAVE: Software-Implemented Fault Tolerance for Model Inference against GPU Memory Bit Flips Artifacts Available Artifacts Evaluated - Functional Results Reproduced
Separate but Together: Integrating Remote Attestation into TLS Artifacts Available Artifacts Evaluated - Functional Results Reproduced
ShieldReduce: Fine-Grained Shielded Data Reduction Artifacts Available Artifacts Evaluated - Functional Results Reproduced
SpaceExit: Enabling Efficient Adaptive Computing in Space with Early Exits Artifacts Available Artifacts Evaluated - Functional Results Reproduced
SwCC: Software-Programmable and Per-Packet Congestion Control in RDMA Engine Artifacts Available Artifacts Evaluated - Functional
The Koala Benchmarks for the Shell: Characterization and Implications Artifacts Available Artifacts Evaluated - Functional Results Reproduced
Toppings: CPU-Assisted, Rank-Aware Adapter Serving for LLM Inference Artifacts Available Artifacts Evaluated - Functional Results Reproduced
Torpor: GPU-Enabled Serverless Computing for Low-Latency, Resource-Efficient Inference Artifacts Available Artifacts Evaluated - Functional Results Reproduced
Turbocharge ANNS on Real Processing-in-Memory by Enabling Fine-Grained Per-PIM-Core Scheduling Artifacts Available Artifacts Evaluated - Functional Results Reproduced
Understanding and Detecting Fail-Slow Hardware Failure Bugs in Cloud Systems Artifacts Available Artifacts Evaluated - Functional Results Reproduced
Universal Checkpointing: A Flexible and Efficient Distributed Checkpointing System for Large-Scale DNN Training with Reconfigurable Parallelism Artifacts Available Artifacts Evaluated - Functional Results Reproduced
Unveiling Compiler Faults via Attribute-Guided Compilation Space Exploration Artifacts Available Artifacts Evaluated - Functional Results Reproduced
Voltrix: Sparse Matrix-Matrix Multiplication on Tensor Cores with Asynchronous and Balanced Kernel Optimization Artifacts Available Artifacts Evaluated - Functional Results Reproduced
Weaver: Efficient Multi-LLM Serving with Attention Offloading Artifacts Available Artifacts Evaluated - Functional Results Reproduced
XRT: An Accelerator-Aware Runtime for Accelerated Chip Multiprocessors Artifacts Available Artifacts Evaluated - Functional Results Reproduced
μEFI: A Microkernel-Style UEFI with Isolation and Transparency Artifacts Available Artifacts Evaluated - Functional Results Reproduced