|
EMT: An OS Framework for New Memory Translation Architectures
|
|
|
|
|
Paralegal: Practical Static Analysis for Privacy Bugs
|
|
|
|
|
Low End-to-End Latency atop a Speculative Shared Log with Fix-Ante Ordering
|
|
|
|
|
Kamino: Efficient VM Allocation at Scale with Latency-Driven Cache-Aware Scheduling
|
|
|
|
|
Decentralized, Epoch-based F2FS Journaling with Fine-grained Crash Recovery
|
|
|
|
|
QiMeng-Xpiler: Transcompiling Tensor Programs for Deep Learning Systems with a Neural-Symbolic Approach
|
|
|
|
|
Stripeless Data Placement for Erasure-Coded In-Memory Storage
|
|
|
|
|
QOS: Quantum Operating System
|
|
|
|
|
Picsou: Enabling Replicated State Machines to Communicate Efficiently
|
|
|
|
|
KPerfIR: Towards a Open and Compiler-centric Ecosystem for GPU Kernel Performance Tooling on Modern AI Workloads
|
|
|
|
|
Achieving Low-Latency Graph-Based Vector Search via Aligning Best-First Search Algorithm with SSD
|
|
|
|
|
Neutrino: Fine-grained GPU Kernel Profiling via Programmable Probing
|
|
|
|
|
Tigon: A Distributed Database for a CXL Pod
|
|
|
|
|
Training with Confidence: Catching Silent Errors in Deep Learning Training with Automated Proactive Checks
|
|
|
|
|
Bayesian Code Diffusion for Efficient Automatic Deep Learning Program Optimization
|
|
|
|
|
PoWER Never Corrupts: Tool-Agnostic Verification of Crash Consistency and Corruption Detection
|
|
|
|
|
Tintin: A Unified Hardware Performance Profiling Infrastructure to Uncover and Manage Uncertainty
|
|
|
|
|
Understanding Stragglers in Large Model Training Using What-if Analysis
|
|
|
|
|
Deriving Semantic Checkers from Tests to Detect Silent Failures in Production Distributed Systems
|
|
|
|
|
Quake: Adaptive Indexing for Vector Search
|
|
|
|
|
MettEagle: Costs and Benefits of Implementing Containers on Microkernels
|
|
|
|
|
Fast and Synchronous Crash Consistency with Metadata Write-Once File System
|
|
|
|
|
Building Bridges: Safe Interactions with Foreign Languages through Omniglot
|
|
|
|
|
Enabling Efficient GPU Communication over Multiple NICs with FuseLink
|
|
|
|
|
XSched: Preemptive Scheduling for Diverse XPUs
|
|
|
|
|
Weave: Efficient and Expressive Oblivious Analytics at Scale
|
|
|
|
|
Scalio: Scaling up DPU-based JBOF Key-value Store with NVMe-oF Target Offload
|
|
|
|
|
Quantum Virtual Machines
|
|
|
|
|
FineMem: Breaking the Allocation Overhead vs. Memory Waste Dilemma in Fine-Grained Disaggregated Memory Management
|
|
|
|
|
Mirage: A Multi-Level Superoptimizer for Tensor Programs
|
|
|
|
|
WLB-LLM: Workload-Balanced 4D Parallelism for Large Language Model Training
|
|
|
|
|
Decouple and Decompose: Scaling Resource Allocation with DeDe
|
|
|
|
|
BlitzScale: Fast and Live Large Model Autoscaling with O(1) Host Caching
|
|
|
|
|
KRR: Efficient and Scalable Kernel Record Replay
|
|
|
|
|
Basilisk: Using Provenance Invariants to Automate Proofs of Undecidable Protocols
|
|
|
|
|
Extending Applications Safely and Efficiently
|
|
|
|
|
Compass: Encrypted Semantic Search with High Accuracy
|
|
|
|
|
NanoFlow: Towards Optimal Large Language Model Serving Throughput
|
|
|
|