Skip to content

Instantly share code, notes, and snippets.

@RealNeGate
Last active January 24, 2026 09:28
Show Gist options
  • Select an option

  • Save RealNeGate/17dcb8a0e5b6c2e912a499c6ffff5752 to your computer and use it in GitHub Desktop.

Select an option

Save RealNeGate/17dcb8a0e5b6c2e912a499c6ffff5752 to your computer and use it in GitHub Desktop.

AI/ML

ATC '25 and OSDI '25 -Joint Keynote Address: Accelerating Software Development: The LLM (R)evolution

OSDI '25 - QiMeng-Xpiler: Transcompiling Tensor Programs for Deep Learning Systems with a...

OSDI '25 - WaferLLM: Large Language Model Inference at Wafer Scale

OSDI '25 - BlitzScale: Fast and Live Large Model Autoscaling with O(1) Host Caching

OSDI '25 - Bayesian Code Diffusion for Efficient Automatic Deep Learning Program Optimization

OSDI '25 - Training with Confidence: Catching Silent Errors in Deep Learning Training with...

OSDI '25 - Understanding Stragglers in Large Model Training Using What-if Analysis

OSDI '25 - NanoFlow: Towards Optimal Large Language Model Serving Throughput

OSDI '25 - PipeThreader: Software-Defined Pipelining for Efficient DNN Execution

OSDI '25 - WLB-LLM: Workload-Balanced 4D Parallelism for Large Language Model Training

OSDI '25 - DecDEC: A Systems Approach to Advancing Low-Bit LLM Quantization

OSDI '25 - ZEN: Empowering Distributed Training with Sparsity-driven Data Synchronization

Databases

OSDI '25 - Tigon: A Distributed Database for a CXL Pod

OSDI '25 - Mako: Speculative Distributed Transactions with Geo-Replication

OSDI '25 - Scalio: Scaling up DPU-based JBOF Key-value Store with NVMe-oF Target Offload

OSDI '25 - Skybridge: Bounded Staleness for Distributed Caches

OSDI '25 - Low End-to-End Latency atop a Speculative Shared Log with Fix-Ante Ordering

OSDI '25 - Picsou: Enabling Replicated State Machines to Communicate Efficiently

Search

OSDI '25 - Quake: Adaptive Indexing for Vector Search

OSDI '25 - Achieving Low-Latency Graph-Based Vector Search via Aligning Best-First Search...

OSDI '25 - Compass: Encrypted Semantic Search with High Accuracy

XPUs

OSDI '25 - Enabling Efficient GPU Communication over Multiple NICs with FuseLink

OSDI '25 - KPerfIR: Towards a Open and Compiler-centric Ecosystem for GPU Kernel

OSDI '25 - Neutrino: Fine-grained GPU Kernel Profiling via Programmable Probing

OSDI '25 - XSched: Preemptive Scheduling for Diverse XPUs

File Systems

OSDI '25 - Fast and Synchronous Crash Consistency with Metadata Write-Once File System

OSDI '25 - Stripeless Data Placement for Erasure-Coded In-Memory Storage

OSDI '25 - Decentralized, Epoch-based F2FS Journaling with Fine-grained Crash Recovery

OSDI '25 - Okapi: Decoupling Data Striping and Redundancy Grouping in Cluster File Systems

OSDI '25 - PoWER Never Corrupts: Tool-Agnostic Verification of Crash Consistency and Corruption...

Networks

OSDI '25 - Disentangling the Dual Role of NIC Receive Rings

OSDI '25 - Söze: One Network Telemetry Is All You Need for Per-flow Weighted Bandwidth Allocation...

Compilers/Static analysis

OSDI '25 - Basilisk: Using Provenance Invariants to Automate Proofs of Undecidable Protocols

OSDI '25 - Mirage: A Multi-Level Superoptimizer for Tensor Programs

OSDI '25 - Paralegal: Practical Static Analysis for Privacy Bugs

OSDI '25 - Building Bridges: Safe Interactions with Foreign Languages through Omniglot

OSDI '25 - Deriving Semantic Checkers from Tests to Detect Silent Failures in Production...

VMs

OSDI '25 - To PRI or Not To PRI, That's the question

OSDI '25 - Kamino: Efficient VM Allocation at Scale with Latency-Driven Cache-Aware Scheduling

OSDI '25 - Decouple and Decompose: Scaling Resource Allocation with DeDe

OSDI '25 - Quantum Virtual Machines

OSDI

OSDI '25 - EMT: An OS Framework for New Memory Translation Architectures

OSDI '25 - KRR: Efficient and Scalable Kernel Record Replay

OSDI '25 - MettEagle: Costs and Benefits of Implementing Containers on Microkernels

OSDI '25 - QOS: Quantum Operating System

OSDI '25 - Tintin: A Unified Hardware Performance Profiling Infrastructure to Uncover and Manage Uncertainty

OSDI '25 - OS Rendering Service Made Parallel with Out-of-Order Execution and In-Order Commit

OSDI '25 - Extending Applications Safely and Efficiently

Apps/Cloud

OSDI '25 - Fork in the Road: Reflections and Optimizations for Cold Start Latency in Production...

OSDI '25 - FineMem: Breaking the Allocation Overhead vs. Memory Waste Dilemma in Fine-Grained...

OSDI '25 - Tiered Memory Management Beyond HotnessOSDI '25 - Deterministic Client: Enforcing Determinism on Untrusted Machine Code

OSDI '25 - Principles and Methodologies for Serial Performance Optimization

OSDI '25 - Weave: Efficient and Expressive Oblivious Analytics at Scale

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment