Skip to content

Questions About VLLM Parallel Inference #7

@tulvgengenr

Description

@tulvgengenr

In evaluate_vllm.sh and evaluate_72B_vllm.sh, I see max_workers=1. This configuration fails to leverage VLLM's parallel processing advantages. However, when I set max_workers > 1, an error occurs. Have you encountered this issue?

Metadata

Metadata

Assignees

Labels

help wantedExtra attention is needed

Type

No type
No fields configured for issues without a type.

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions