ATC '25 and OSDI '25 -Joint Keynote Address: Accelerating Software Development: The LLM (R)evolution
OSDI '25 - QiMeng-Xpiler: Transcompiling Tensor Programs for Deep Learning Systems with a...
OSDI '25 - WaferLLM: Large Language Model Inference at Wafer Scale
OSDI '25 - BlitzScale: Fast and Live Large Model Autoscaling with O(1) Host Caching
OSDI '25 - Bayesian Code Diffusion for Efficient Automatic Deep Learning Program Optimization
OSDI '25 - Training with Confidence: Catching Silent Errors in Deep Learning Training with...
OSDI '25 - Understanding Stragglers in Large Model Training Using What-if Analysis
OSDI '25 - NanoFlow: Towards Optimal Large Language Model Serving Throughput
OSDI '25 - PipeThreader: Software-Defined Pipelining for Efficient DNN Execution
OSDI '25 - WLB-LLM: Workload-Balanced 4D Parallelism for Large Language Model Training
OSDI '25 - DecDEC: A Systems Approach to Advancing Low-Bit LLM Quantization
OSDI '25 - ZEN: Empowering Distributed Training with Sparsity-driven Data Synchronization
OSDI '25 - Tigon: A Distributed Database for a CXL Pod
OSDI '25 - Mako: Speculative Distributed Transactions with Geo-Replication
OSDI '25 - Scalio: Scaling up DPU-based JBOF Key-value Store with NVMe-oF Target Offload
OSDI '25 - Skybridge: Bounded Staleness for Distributed Caches
OSDI '25 - Low End-to-End Latency atop a Speculative Shared Log with Fix-Ante Ordering
OSDI '25 - Picsou: Enabling Replicated State Machines to Communicate Efficiently
OSDI '25 - Quake: Adaptive Indexing for Vector Search
OSDI '25 - Achieving Low-Latency Graph-Based Vector Search via Aligning Best-First Search...
OSDI '25 - Compass: Encrypted Semantic Search with High Accuracy
OSDI '25 - Enabling Efficient GPU Communication over Multiple NICs with FuseLink
OSDI '25 - KPerfIR: Towards a Open and Compiler-centric Ecosystem for GPU Kernel
OSDI '25 - Neutrino: Fine-grained GPU Kernel Profiling via Programmable Probing
OSDI '25 - XSched: Preemptive Scheduling for Diverse XPUs
OSDI '25 - Fast and Synchronous Crash Consistency with Metadata Write-Once File System
OSDI '25 - Stripeless Data Placement for Erasure-Coded In-Memory Storage
OSDI '25 - Decentralized, Epoch-based F2FS Journaling with Fine-grained Crash Recovery
OSDI '25 - Okapi: Decoupling Data Striping and Redundancy Grouping in Cluster File Systems
OSDI '25 - PoWER Never Corrupts: Tool-Agnostic Verification of Crash Consistency and Corruption...
OSDI '25 - Disentangling the Dual Role of NIC Receive Rings
OSDI '25 - Söze: One Network Telemetry Is All You Need for Per-flow Weighted Bandwidth Allocation...
OSDI '25 - Basilisk: Using Provenance Invariants to Automate Proofs of Undecidable Protocols
OSDI '25 - Mirage: A Multi-Level Superoptimizer for Tensor Programs
OSDI '25 - Paralegal: Practical Static Analysis for Privacy Bugs
OSDI '25 - Building Bridges: Safe Interactions with Foreign Languages through Omniglot
OSDI '25 - Deriving Semantic Checkers from Tests to Detect Silent Failures in Production...
OSDI '25 - To PRI or Not To PRI, That's the question
OSDI '25 - Kamino: Efficient VM Allocation at Scale with Latency-Driven Cache-Aware Scheduling
OSDI '25 - Decouple and Decompose: Scaling Resource Allocation with DeDe
OSDI '25 - Quantum Virtual Machines
OSDI '25 - EMT: An OS Framework for New Memory Translation Architectures
OSDI '25 - KRR: Efficient and Scalable Kernel Record Replay
OSDI '25 - MettEagle: Costs and Benefits of Implementing Containers on Microkernels
OSDI '25 - QOS: Quantum Operating System
OSDI '25 - Tintin: A Unified Hardware Performance Profiling Infrastructure to Uncover and Manage Uncertainty
OSDI '25 - OS Rendering Service Made Parallel with Out-of-Order Execution and In-Order Commit
OSDI '25 - Extending Applications Safely and Efficiently
OSDI '25 - Fork in the Road: Reflections and Optimizations for Cold Start Latency in Production...
OSDI '25 - FineMem: Breaking the Allocation Overhead vs. Memory Waste Dilemma in Fine-Grained...
OSDI '25 - Tiered Memory Management Beyond HotnessOSDI '25 - Deterministic Client: Enforcing Determinism on Untrusted Machine Code
OSDI '25 - Principles and Methodologies for Serial Performance Optimization
OSDI '25 - Weave: Efficient and Expressive Oblivious Analytics at Scale