
vLLM
Subscription: Prime
Subscription: Prime
A high-throughput and memory-efficient inference and serving engine for LLMs.
| Version | App version | Registered | Digest | Size |
|---|---|---|---|---|

A high-throughput and memory-efficient inference and serving engine for LLMs.
| Version | App version | Registered | Digest | Size |
|---|---|---|---|---|