Skip to content

Instantly share code, notes, and snippets.

@vanbasten23
Created November 14, 2025 18:16
Show Gist options
  • Select an option

  • Save vanbasten23/dd4f3cbb314a7b9cf6c003103c23c019 to your computer and use it in GitHub Desktop.

Select an option

Save vanbasten23/dd4f3cbb314a7b9cf6c003103c23c019 to your computer and use it in GitHub Desktop.
{
"name": "newjax_benchmark_server",
"type": "debugpy",
"request": "launch",
"program": "/home/xiowei_google_com/miniconda3/envs/vllm_newjax/bin/vllm",
"console": "integratedTerminal",
"justMyCode": false,
"env": {
"MODEL_IMPL_TYPE": "vllm",
"TPU_BACKEND_TYPE": "jax",
},
"args": [
"serve",
"--seed",
"42",
"--model",
"Qwen/Qwen2.5-1.5B-Instruct",
"--max_model_len",
"2048",
"--max-num-batched-tokens",
"4096",
"--max-num-seqs",
"512",
"--no-enable-prefix-caching",
"--disable-log-requests",
"--tensor-parallel-size",
"1",
]
},
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment