Update transformers to v5.x, unsloth, and add MoE LoRA conversion by Kovbo · Pull Request #576 · OpenPipe/ART

Kovbo · 2026-02-23T21:55:08Z

Summary

Update transformers from v4.57.x to v5.x (>=5.1.0)
Update unsloth from 2025.12.9 to 2026.2.1 and unsloth-zoo from VCS fork to PyPI 2026.2.1
Update peft from >=0.14.0 to >=0.18.0
Add MoE LoRA adapter conversion utility for vLLM compatibility
Fix multiple transformers v5 breaking changes (import paths, apply_chat_template return types, removed warnings_issued, deprecated TrainerArgs fields)

Changes

Dependencies (`pyproject.toml`, `requirements/backend.vcs.txt`)

transformers>=5.1.0 (was >=4.55.2,<=4.57.3)
unsloth==2026.2.1 (was 2025.12.9)
unsloth-zoo==2026.2.1 via PyPI (was VCS pin to bradhilton fork)
peft>=0.18.0 (was >=0.14.0)
Added override-dependencies for transformers and trl to bypass unsloth's overly strict PyPI constraints (confirmed working per unsloth Feb 2026 release notes)

Code fixes

src/art/unsloth/service.py: Fix import path (GenerationMixin, PreTrainedModel now directly from transformers instead of transformers.utils.dummy_pt_objects). Add warnings_issued compat shim. Integrate convert_checkpoint_if_needed calls.
src/art/preprocessing/tokenize.py: Add return_dict=False to apply_chat_template calls (v5 default changed from list[int] to BatchEncoding)
src/art/transformers/patches.py: Update return type for v5's _preprocess_mask_arguments (now returns 5 values instead of 4)
src/art/dev/model.py: Remove deprecated TrainerArgs fields
src/art/tinker/server.py: Add return_dict=False to apply_chat_template

New files

src/art/utils/convert_moe_lora.py: Converts fused MoE LoRA adapters (produced by unsloth + transformers v5) to per-expert format for vLLM compatibility. Runs automatically after checkpoint save; no-op for non-MoE models.

Test results

Tested on H200 GPU cluster with:

transformers: 5.2.0
unsloth: 2026.2.1
unsloth-zoo: 2026.2.1
peft: 0.18.1
vllm: 0.15.1
torch: 2.9.1+cu128
trl: 0.20.0

Test: 3-step yes-no-maybe RL training with Qwen2.5-7B-Instruct (LocalBackend)

=== Running 3 training steps ===

--- Step 1/3 ---
Skipping tuning (all same reward) → Advanced step 0→1

--- Step 2/3 ---
Packed 8 trajectories into 1 sequences of length 2048
train: loss=-0.00164, grad_norm=0.127, policy_loss=-0.00164, entropy=0.000413

--- Step 3/3 ---
Skipping tuning (all same reward) → Advanced step 2→3

=== All steps completed successfully! ===
Final step: 3

Full pipeline verified: model loading → inference (vLLM) → rollouts → tokenization → training (unsloth) → checkpoint save → LoRA swap → resume inference.

Test plan

RL training with Qwen2.5-7B-Instruct on H200 (LocalBackend, 3 steps)
CI tests (pending)

Closes #575

🤖 Generated with Claude Code

Update core dependencies for transformers v5 ecosystem: - transformers: >=4.55.2,<=4.57.3 → >=5.1.0 - unsloth: 2025.12.9 → 2026.2.1 - unsloth-zoo: 2025.12.7 → 2026.2.1 (+ updated VCS pin) - trl: 0.20.0 → >=0.28.0 - peft: >=0.14.0 → >=0.18.0 (required by transformers v5) Fix transformers v5 breaking changes: - Replace removed dummy_pt_objects import with direct transformers import - Update masking_utils patch return type (now returns 5 values) - Remove deprecated TrainerArgs fields (overwrite_output_dir, jit_mode_eval, mp_parameters, logging_dir, fp16_backend, push_to_hub_token/model_id/organization) Add MoE LoRA adapter conversion utility for vLLM compatibility: - Unsloth + transformers v5 saves MoE LoRA as fused 2D tensors - vLLM expects per-expert format - Auto-detect and convert after checkpoint save Closes #575 Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Unsloth 2026.2.1 requires trl>0.18.2,!=0.19.0,<=0.24.0. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Unsloth 2026.2.1's pyproject.toml has overly strict constraints (transformers<=4.57.6, trl<=0.24.0) but the February-2026 release notes confirm v5.1.0 + trl 0.27.1 work well. Use uv override-dependencies to allow the upgrade. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Transformers v5 removed `warnings_issued` from PreTrainedModel, but Unsloth's GRPOTrainer still accesses it during initialization. Add it as an empty dict on the PEFT model before creating the trainer. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

…ers v5 Transformers v5 changed apply_chat_template to return BatchEncoding by default when tokenize=True. Add return_dict=False to all calls that expect list[int] return type. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

The bradhilton/unsloth-zoo fork is at version 2025.8.4 which is missing modules needed by unsloth 2026.2.1 (e.g. unsloth_zoo.device_type). Switch to the official PyPI release which matches unsloth 2026.2.1. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

These changes were not needed for the transformers v5 upgrade: - backend.vcs.txt: not used for installation (pyproject.toml handles deps) - model.py TrainerArgs: TypedDict fields don't cause runtime errors Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Remove fields that transformers v5 dropped from TrainingArguments: overwrite_output_dir, logging_dir, jit_mode_eval, half_precision_backend, tpu_num_cores, past_index, fp16_backend, push_to_hub_model_id, push_to_hub_organization, push_to_hub_token, mp_parameters, torchdynamo, ray_scope. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

trl was originally pinned to 0.20.0. No reason to loosen it — 0.20.0 already satisfies unsloth's trl<=0.24.0 constraint. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Keep both the cast(list[int], ...) wrapper from main and the return_dict=False parameter needed for transformers v5. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Instead of adding return_dict=False to every call site, patch PreTrainedTokenizerBase.apply_chat_template once in patches.py to default return_dict=False. This restores transformers v4 behavior (returning list[int]) globally. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

The attribute wasn't removed in transformers v5 — Unsloth's model patching can leave the PEFT model without it. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Kovbo and others added 6 commits February 23, 2026 13:54

fix: pin trl<=0.24.0 for unsloth 2026.2.1 compatibility

2aa7cc1

Unsloth 2026.2.1 requires trl>0.18.2,!=0.19.0,<=0.24.0. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Kovbo marked this pull request as ready for review February 23, 2026 23:12

Kovbo and others added 7 commits February 23, 2026 15:23

fix: pin transformers==5.1.0 to avoid breakage from future releases

af60b72

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

fix: restore trl==0.20.0 pin and remove unnecessary trl override

0de56ff

trl was originally pinned to 0.20.0. No reason to loosen it — 0.20.0 already satisfies unsloth's trl<=0.24.0 constraint. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

merge: resolve conflict in tinker/server.py with main

17c5d57

Keep both the cast(list[int], ...) wrapper from main and the return_dict=False parameter needed for transformers v5. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

fix: correct comment about warnings_issued workaround

0ab1b94

The attribute wasn't removed in transformers v5 — Unsloth's model patching can leave the PEFT model without it. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Comments

Update transformers to v5.x, unsloth, and add MoE LoRA conversion#576

Update transformers to v5.x, unsloth, and add MoE LoRA conversion#576
Kovbo wants to merge 13 commits intomainfrom
update-transformers-v5

Kovbo commented Feb 23, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Comments

Conversation

Kovbo commented Feb 23, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Changes

Dependencies (pyproject.toml, requirements/backend.vcs.txt)

Code fixes

New files

Test results

Test plan

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Kovbo commented Feb 23, 2026 •

edited

Loading

Dependencies (`pyproject.toml`, `requirements/backend.vcs.txt`)