Multi-LoRA SFT support FSDP2 by kevssim · Pull Request #155 · modelscope/twinkle

kevssim · 2026-04-14T02:50:55Z

PR type

Bug Fix
New Feature
Document Updates
More Models or Datasets Support

PR information

Write the detail information belongs to this PR.

Experiment results

Paste your experiment result here(if needed).

gemini-code-assist

Code Review

This pull request implements FSDP2 support for MultiLoraTransformersModel by integrating it into the shared strategy and lazy-wrap lifecycle and introducing sharding-aware parameter access helpers. Review feedback identifies critical bugs in the distributed tensor handling: _write_param_tensor may incorrectly double-shard local data, set_state_dict risks shape mismatches when applying global state to local shards, and get_state_dict returns sharded tensors that could lead to corrupt checkpoints. Furthermore, the model's initialization should be refactored to properly use the parent class, and internal imports should be moved to the module level.

kevssim added 9 commits April 8, 2026 10:00

docs: add multi-lora fsdp2 design

5f05817

docs: expand multi-lora fsdp2 staged design

b19373f

docs: narrow multi-lora fsdp2 scope to sft

6f46dc3

docs: add multi-lora fsdp2 sft plan

2694bdc

wip

ec5f6a9

wip

ee46e60

wip

bd02a96

wip

6c359e0

wip

26cada3

gemini-code-assist bot reviewed Apr 14, 2026

View reviewed changes

Comment thread src/twinkle/model/multi_lora.py

Comment thread src/twinkle/model/multi_lora.py Outdated

Comment thread src/twinkle/model/multi_lora.py

Comment thread src/twinkle/model/transformers/multi_lora_transformers.py

Comment thread src/twinkle/model/multi_lora.py Outdated

kevssim added 5 commits April 14, 2026 11:05

Merge remote-tracking branch 'origin/main' into multilora_fsdp

1dff684

wip

f5127c7

wip

af731b5

fix

62c496c

fix

d43f541

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Multi-LoRA SFT support FSDP2 #155

Multi-LoRA SFT support FSDP2 #155
kevssim wants to merge 14 commits intomodelscope:mainfrom
kevssim:multilora_fsdp

kevssim commented Apr 14, 2026

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

kevssim commented Apr 14, 2026

PR type

PR information

Experiment results

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant