[Model] Add T5Gemma2 model plugin integration by akh64bit · Pull Request #8 · vllm-project/bart-plugin

akh64bit · 2026-03-05T03:32:54Z

This PR implements the T5Gemma2 encoder-decoder model as an out-of-tree vLLM plugin, moving the implementation from the core vllm repository as suggested in vllm-project/vllm#32617.

Changes

Migrated the model implementation into vllm_bart_plugin/t5gemma2.py.
Fixed the q, k, and v tensor reshaping bugs in attention inputs (MMEncoderAttention and vLLM's Attention expect (num_tokens, num_heads, head_dim), not (1, num_tokens, hidden_size)).
Ensured residual connections are correctly placed in the T5Gemma2DecoderLayer block.
Ensured the RoPE (rotary_emb) is correctly applied during the forward pass.
Implemented SupportsMultiModal interface mapping.
Updated Attention imports to work with the latest vLLM versions.
Registered the model within vllm_bart_plugin/__init__.py.

How to test locally

To run this plugin alongside vLLM, ensure you have the required specific transformers fork and vLLM installed, then install the plugin in editable mode:

# 1. Clone and install the custom transformers library locally
git clone https://github.com/akh64bit/transformers.git -b t5gemma2
cd transformers
pip install -e .

# 2. Install the plugin in editable mode
cd ../bart-plugin
pip install -e .

# 3. Test loading the model using standard vLLM offline inference
python -c "
from vllm import LLM
# Will auto-load from the bart-plugin registry
llm = LLM(model='google/t5gemma-2-270m-270m', trust_remote_code=True)
print('Model loaded successfully!')
"

Signed-off-by: Akhilesh Kumar <akhilbussiness@gmail.com>

akh64bit · 2026-03-05T03:43:17Z

I've added an example_t5gemma2_usage.py script to demonstrate how to use the model with the plugin. You can test it out by running python example_t5gemma2_usage.py after following the setup instructions in the PR description.

Signed-off-by: Akhilesh Kumar <akhilbussiness@gmail.com>

Bullish-Design · 2026-03-06T14:58:34Z

Awesome! I've been looking for a good way to experiment with T5gemma2. How do you find its performance in comparison with other models of similar size? Are there any issues/limitations/things to be aware of when using it with vLLM?

NickLucche

Thanks for contributing @akh64bit !
Will look to get this merged after v0.16 support, as I see some of the changes are related to moving imports around (to sync with upstream)

[Model] Add T5Gemma2 model plugin integration

d945070

Signed-off-by: Akhilesh Kumar <akhilbussiness@gmail.com>

akh64bit force-pushed the ak/t5gemma2 branch from 67f8125 to d945070 Compare March 5, 2026 03:34

Add example usage script for T5Gemma2

675d182

Signed-off-by: Akhilesh Kumar <akhilbussiness@gmail.com>

akh64bit added 8 commits March 5, 2026 03:49

[Fix] Fix syntax error in example usage script

cb6b604

Signed-off-by: Akhilesh Kumar <akhilbussiness@gmail.com>

[Fix] Update utils import in T5Gemma2

e29ee3f

Signed-off-by: Akhilesh Kumar <akhilbussiness@gmail.com>

[Fix] Fix multimodal registration and utils import in T5Gemma2

7eaa5c8

Signed-off-by: Akhilesh Kumar <akhilbussiness@gmail.com>

[Fix] Fix multimodal text modality support for T5Gemma2

cefdd0c

Signed-off-by: Akhilesh Kumar <akhilbussiness@gmail.com>

[Fix] Fix missing ModalityDataItems import in T5Gemma2

6334dd3

Signed-off-by: Akhilesh Kumar <akhilbussiness@gmail.com>

[Fix] Update multimodal processor to v0.16 API

6cf17d1

Signed-off-by: Akhilesh Kumar <akhilbussiness@gmail.com>

[Fix] Fix multimodal processor fields and call API for T5Gemma2

078e386

Signed-off-by: Akhilesh Kumar <akhilbussiness@gmail.com>

[Fix] Fix TextProcessorItems to use 'text' field for Gemma3 processor

b4677d5

Signed-off-by: Akhilesh Kumar <akhilbussiness@gmail.com>

NickLucche reviewed Mar 17, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Model] Add T5Gemma2 model plugin integration#8

[Model] Add T5Gemma2 model plugin integration#8
akh64bit wants to merge 10 commits intovllm-project:masterfrom
akh64bit:ak/t5gemma2

akh64bit commented Mar 5, 2026

Uh oh!

akh64bit commented Mar 5, 2026

Uh oh!

Bullish-Design commented Mar 6, 2026

Uh oh!

NickLucche left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

akh64bit commented Mar 5, 2026

Changes

How to test locally

Uh oh!

akh64bit commented Mar 5, 2026

Uh oh!

Bullish-Design commented Mar 6, 2026

Uh oh!

NickLucche left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants