Load model with mismatched sizes #1107

qubvel · 2025-03-26T17:34:16Z

Allows to load pretrained model with different number of channels

For example:

import segmentation_models_pytorch as smp

model = smp.from_pretrained("smp-hub/segformer-b2-1024x1024-city-160k", classes=5, strict=False)

Reported in:

KeyError when using Segformer architecture in segmentation_models_pytorch #1019

codecov · 2025-03-26T17:36:03Z

Codecov Report

Attention: Patch coverage is 70.37037% with 8 lines in your changes missing coverage. Please review.

Files with missing lines	Patch %	Lines
segmentation_models_pytorch/base/model.py	70.37%	8 Missing ⚠️

Files with missing lines	Coverage Δ
segmentation_models_pytorch/base/model.py	`83.56% <70.37%> (+5.22%)`	⬆️

... and 2 files with indirect coverage changes

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

Copilot

Pull Request Overview

This PR adds support for loading pretrained models with mismatched channel sizes by filtering out incompatible weights during state dict loading. Key changes include:

Adding a test in tests/test_base.py to verify mismatched keys are handled correctly.
Updating load_state_dict in segmentation_models_pytorch/base/model.py to filter mismatched weights and issue a warning.
Adjusting the inference notebook to use up-to-date installation instructions.

Reviewed Changes

Copilot reviewed 3 out of 4 changed files in this pull request and generated 1 comment.

File	Description
tests/test_base.py	Adds a test to ensure models load with mismatched keys while only specific layers are adjusted.
segmentation_models_pytorch/base/model.py	Modifies load_state_dict to filter out mismatched weights and warn the user.
examples/segformer_inference_pretrained.ipynb	Updates installation instructions to ensure the latest library versions are used.

Files not reviewed (1)

docs/save_load.rst: Language not supported

Comments suppressed due to low confidence (2)

segmentation_models_pytorch/base/model.py:138

[nitpick] The warning message uses all caps and strong language, which may not be clear; consider rephrasing it to a more neutral and descriptive message (e.g., 'Mismatched key shapes detected; please retrain the model to update these layers:').

text = f"\n\n !!!!!! Mismatched keys !!!!!!\n\nYou should TRAIN the model to use it:\n{str_keys}\n"

segmentation_models_pytorch/base/model.py:139

[nitpick] Using 'stacklevel=-1' may not provide the expected context for the warning; consider using a more conventional stacklevel (such as 2) to point to the relevant caller.

warnings.warn(text, stacklevel=-1)

tests/test_base.py

qubvel added 4 commits March 26, 2025 17:29

Add a way to load model with mismatched sizes

22af917

Add test

93a832c

Update docs

28c6723

(unrelated) update packages in example

5fd265b

qubvel mentioned this pull request Mar 26, 2025

KeyError when using Segformer architecture in segmentation_models_pytorch #1019

Closed

qubvel requested review from adamjstewart and Copilot March 26, 2025 17:38

Copilot AI reviewed Mar 26, 2025

View reviewed changes

tests/test_base.py Outdated Show resolved Hide resolved

Fix typo

d43109d

qubvel merged commit d9a9c75 into main Mar 28, 2025
16 of 17 checks passed

qubvel mentioned this pull request Apr 16, 2025

Release v0.5.0 #1127

Merged

qubvel deleted the feature/load-mismatched-sizes branch April 18, 2025 12:31

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Load model with mismatched sizes #1107

Load model with mismatched sizes #1107

qubvel commented Mar 26, 2025 •

edited

Loading

codecov bot commented Mar 26, 2025 •

edited

Loading

Copilot AI left a comment

Load model with mismatched sizes #1107

Load model with mismatched sizes #1107

Conversation

qubvel commented Mar 26, 2025 • edited Loading

codecov bot commented Mar 26, 2025 • edited Loading

Codecov Report

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

qubvel commented Mar 26, 2025 •

edited

Loading

codecov bot commented Mar 26, 2025 •

edited

Loading