Skip to content

Issues: vllm-project/vllm

[Roadmap] vLLM Roadmap Q2 2024
#3861 opened Apr 4, 2024 by simon-mo
Open 30
v0.5.0 Release Tracker
#5224 by simon-mo was closed Jun 12, 2024
Closed 9
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Label
Filter by label
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Milestones
Filter by milestone
Assignee
Filter by who’s assigned
Sort

Issues list

[Bug]: Prefix Caching with Multi-Lora Support bug Something isn't working
#5475 opened Jun 12, 2024 by curiositywan
[Bug]: Error when --tensor-parallel-size > 1 bug Something isn't working
#5458 opened Jun 12, 2024 by javi111717
[Bug]: vllm v0.5.0 internal assert failed bug Something isn't working
#5450 opened Jun 12, 2024 by changshivek
multilora_inference调用qwen2-1.5b报错 documentation Improvements or additions to documentation
#5445 opened Jun 12, 2024 by zigangzhao-ai
[Bug]: v0.4.3 AsyncEngineDeadError bug Something isn't working
#5443 opened Jun 12, 2024 by changshivek
[Bug]: TypeError: a bytes-like object is required, not 'str' bug Something isn't working
#5440 opened Jun 12, 2024 by yaoyasong
[Feature]: Support [RecurrentGemmaForCausalLM] new model Requests to new models
#5431 opened Jun 12, 2024 by sung-ho-moon
[Bug]:The vllm service takes two hours to start Because of NCCL bug Something isn't working
#5405 opened Jun 11, 2024 by zhaotyer
[Bug]: topk=1 and temperature=0 cause different output in vllm bug Something isn't working
#5404 opened Jun 11, 2024 by rangehow
ProTip! Type g p on any issue or pull request to go back to the pull request listing page.