Fix nvfp4 convert_and_update_tensor shape check by skydoorkai · Pull Request #2670 · NVIDIA/TransformerEngine

skydoorkai · 2026-02-10T09:06:57Z

Description

This is to fix #2607

For nvfp4's columnwise data , it is using enforced 2D shape. Thus, the original check would fail if rowwise_data shape is 3D shape.

To fix :
(1) expected_data should be enforced into 2D shape from rowwise_data's shape.
(2) use rowwise_data's shape as the “ground truth" shape.

Fixes # (issue)

Type of change

Documentation change (change only to the documentation, either a fix or a new content)
Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to not work as expected)
Infra/Build change
Code refactoring

Changes

Please list the changes introduced in this PR:

Change A
Change B

Checklist:

I have read and followed the contributing guidelines
The functionality is complete
I have commented my code, particularly in hard-to-understand areas
I have made corresponding changes to the documentation
My changes generate no new warnings
I have added tests that prove my fix is effective or that my feature works
New and existing unit tests pass locally with my changes

Signed-off-by: 乙划 <zht108229@antgroup.com>

for more information, see https://pre-commit.ci

greptile-apps · 2026-02-10T09:09:53Z

Greptile Overview

Greptile Summary

Fixes shape mismatch error in NVFP4 quantization when using N-dimensional tensors with both rowwise and columnwise quantization modes.

Key changes:

Added compressShapeTo2D utility to flatten N-D shapes to 2D for comparison
Modified shape validation in convert_and_update_tensor to compare compressed 2D shapes instead of raw shapes
Uses rowwise shape as the ground truth since it preserves the original N-D structure
Added comprehensive test case for 3D tensor quantization with both modes

Technical context:
NVFP4 columnwise quantization enforces 2D storage by flattening all dimensions except the last, while rowwise preserves the original N-D shape. The previous direct shape comparison failed for N-D tensors (N>2) because it compared a 3D rowwise shape like [32, 4, 128] against a 2D columnwise shape like [128, 128]. The fix compresses both to equivalent 2D representations ([128, 128]) before comparison.

Confidence Score: 5/5

This PR is safe to merge with minimal risk
The fix correctly addresses the shape mismatch issue with a well-documented helper function, includes proper test coverage for the regression, and follows the existing code patterns. The logic is sound: compressing both shapes to 2D for comparison is the right approach given that columnwise storage is always 2D while rowwise preserves N-D structure.
No files require special attention

Important Files Changed

Filename	Overview
transformer_engine/pytorch/csrc/quantizer.cpp	Adds `compressShapeTo2D` helper to properly compare rowwise (N-D) and columnwise (2D) tensor shapes for NVFP4 quantization
tests/pytorch/nvfp4/test_nvfp4_quantize_exact.py	Adds regression test for 3D tensor quantization with both rowwise and columnwise modes, verifying shape consistency

Sequence Diagram

sequenceDiagram
    participant User
    participant NVFP4Quantizer
    participant convert_and_update_tensor
    participant compressShapeTo2D

    User->>NVFP4Quantizer: quantize 3D tensor [32,4,128]
    NVFP4Quantizer->>NVFP4Quantizer: Create rowwise data [32,4,64]
    NVFP4Quantizer->>NVFP4Quantizer: Create columnwise data [128,64]
    
    User->>NVFP4Quantizer: update_quantized()
    NVFP4Quantizer->>convert_and_update_tensor: Validate shapes
    
    convert_and_update_tensor->>convert_and_update_tensor: convert_shape_back_from_fp4(columnwise, true)<br/>Returns: [128, 128]
    convert_and_update_tensor->>convert_and_update_tensor: convert_shape_back_from_fp4(rowwise, false)<br/>Returns: [32, 4, 128]
    
    convert_and_update_tensor->>compressShapeTo2D: Compress rowwise [32,4,128]
    compressShapeTo2D-->>convert_and_update_tensor: [128, 128]
    
    convert_and_update_tensor->>compressShapeTo2D: Compress columnwise [128,128]
    compressShapeTo2D-->>convert_and_update_tensor: [128, 128]
    
    convert_and_update_tensor->>convert_and_update_tensor: Compare [128,128] == [128,128] ✓
    convert_and_update_tensor->>convert_and_update_tensor: Use rowwise shape [32,4,128] as ground truth
    
    convert_and_update_tensor-->>User: Success

_{Last reviewed commit: 075ebf6}

greptile-apps

_{1 file reviewed, 2 comments}

_{Edit Code Review Agent Settings | Greptile}

greptile-apps · 2026-02-10T09:10:05Z

Additional Comments (2)

transformer_engine/pytorch/csrc/quantizer.cpp
Missing include for std::accumulate

compressShapeTo2D uses std::accumulate and std::multiplies, but this file doesn’t include <numeric> (and possibly <functional>). This will fail to compile on toolchains that don’t get these transitively. Add the required headers at the top of quantizer.cpp.

transformer_engine/pytorch/csrc/quantizer.cpp
Shape check can reject valid tensors

compressShapeTo2D(expected_shape) only flattens the rowwise shape before comparing to shape derived from columnwise_data. If convert_shape_back_from_fp4(getTensorShape(*columnwise_data), true) ever returns a non-2D shape (or a different 2D flattening), this check will still fail even when both buffers represent the same logical tensor. Consider normalizing both sides to the same 2D convention (e.g., apply the same compression to shape as well) before comparing.

ptrendx · 2026-02-10T19:08:45Z

@skydoorkai I think the comments from Greptile make sense (even though they are not very high priority). Could you fix those? Other than that LGTM.

Signed-off-by: 乙划 <zht108229@antgroup.com>

…p4_shape_check

for more information, see https://pre-commit.ci

skydoorkai · 2026-02-11T05:51:58Z

@skydoorkai I think the comments from Greptile make sense (even though they are not very high priority). Could you fix those? Other than that LGTM.

Updated according to Greptile comments to add headers and compare 2D shapes.

greptile-apps

_{1 file reviewed, 1 comment}

_{Edit Code Review Agent Settings | Greptile}

transformer_engine/pytorch/csrc/quantizer.cpp

Co-authored-by: greptile-apps[bot] <165735046+greptile-apps[bot]@users.noreply.github.com> Signed-off-by: Przemyslaw Tredak <ptrendx@gmail.com>

ptrendx · 2026-02-11T18:00:48Z

/te-ci pytorch

greptile-apps

_{1 file reviewed, no comments}

_{Edit Code Review Agent Settings | Greptile}

…p4_shape_check

greptile-apps

_{1 file reviewed, 2 comments}

_{Edit Code Review Agent Settings | Greptile}

transformer_engine/pytorch/csrc/quantizer.cpp

Signed-off-by: 乙划 <zht108229@antgroup.com>

for more information, see https://pre-commit.ci

greptile-apps

_{2 files reviewed, no comments}

_{Edit Code Review Agent Settings | Greptile}

fix nvfp4 convert_and_update_tensor shape check

a35de59

Signed-off-by: 乙划 <zht108229@antgroup.com>

skydoorkai mentioned this pull request Feb 10, 2026

NVFP4Quantizer::convert_and_update_tensor columnwise_data shape does not match 'expected_shape' from rowwise_data #2607

Open

[pre-commit.ci] auto fixes from pre-commit.com hooks

c7eaa05

for more information, see https://pre-commit.ci

greptile-apps bot reviewed Feb 10, 2026

View reviewed changes

乙划 and others added 3 commits February 11, 2026 13:44

add headers and check 2D shapes

c0a8aa6

Signed-off-by: 乙划 <zht108229@antgroup.com>

Merge remote-tracking branch 'ten/fix_nvfp4_shape_check' into fix_nvf…

5b7f038

…p4_shape_check

[pre-commit.ci] auto fixes from pre-commit.com hooks

489f060

for more information, see https://pre-commit.ci

greptile-apps bot reviewed Feb 11, 2026

View reviewed changes

transformer_engine/pytorch/csrc/quantizer.cpp Outdated Show resolved Hide resolved

Fix

11fc532

Co-authored-by: greptile-apps[bot] <165735046+greptile-apps[bot]@users.noreply.github.com> Signed-off-by: Przemyslaw Tredak <ptrendx@gmail.com>

greptile-apps bot reviewed Feb 11, 2026

View reviewed changes

乙划 added 2 commits February 13, 2026 15:12

Merge branch 'main' into fix_nvfp4_shape_check

fc4f6e2

Merge remote-tracking branch 'ten/fix_nvfp4_shape_check' into fix_nvf…

36d7c65

…p4_shape_check

greptile-apps bot reviewed Feb 13, 2026

View reviewed changes

transformer_engine/pytorch/csrc/quantizer.cpp Show resolved Hide resolved

transformer_engine/pytorch/csrc/quantizer.cpp Show resolved Hide resolved

乙划 and others added 2 commits February 13, 2026 16:51

add unittest and doctring

486da66

Signed-off-by: 乙划 <zht108229@antgroup.com>

[pre-commit.ci] auto fixes from pre-commit.com hooks

075ebf6

for more information, see https://pre-commit.ci

greptile-apps bot reviewed Feb 13, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix nvfp4 convert_and_update_tensor shape check#2670

Fix nvfp4 convert_and_update_tensor shape check#2670
skydoorkai wants to merge 10 commits intoNVIDIA:mainfrom
skydoorkai:fix_nvfp4_shape_check

skydoorkai commented Feb 10, 2026

Uh oh!

greptile-apps bot commented Feb 10, 2026 •

edited

Loading

Uh oh!

greptile-apps bot left a comment

Uh oh!

greptile-apps bot commented Feb 10, 2026

Uh oh!

ptrendx commented Feb 10, 2026

Uh oh!

skydoorkai commented Feb 11, 2026

Uh oh!

greptile-apps bot left a comment

Uh oh!

Uh oh!

ptrendx commented Feb 11, 2026

Uh oh!

greptile-apps bot left a comment

Uh oh!

greptile-apps bot left a comment

Uh oh!

Uh oh!

Uh oh!

greptile-apps bot left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

skydoorkai commented Feb 10, 2026

Description

Type of change

Changes

Checklist:

Uh oh!

greptile-apps bot commented Feb 10, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Greptile Overview

Greptile Summary

Confidence Score: 5/5

Important Files Changed

Sequence Diagram

Uh oh!

greptile-apps bot left a comment

Choose a reason for hiding this comment

Uh oh!

greptile-apps bot commented Feb 10, 2026

Uh oh!

ptrendx commented Feb 10, 2026

Uh oh!

skydoorkai commented Feb 11, 2026

Uh oh!

greptile-apps bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

ptrendx commented Feb 11, 2026

Uh oh!

greptile-apps bot left a comment

Choose a reason for hiding this comment

Uh oh!

greptile-apps bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

greptile-apps bot left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

greptile-apps bot commented Feb 10, 2026 •

edited

Loading