Align rope init in lmdeploy by RangiLyu · Pull Request #4466 · InternLM/lmdeploy

RangiLyu · 2026-03-25T08:48:04Z

Align all rope initialization with

lmdeploy/lmdeploy/pytorch/backends/default/rotary_embedding.py

Line 89 in 90245a3

    
           inv_freq = 1.0 / (self.base**(torch.arange(0, self.dim, 2, dtype=torch.int64).float() / self.dim))

There is a slight difference in numerical precision between CPU and GPU, which leads to inconsistencies between training and inference in RL.

Copilot

Pull request overview

This PR standardizes RoPE inverse-frequency (inv_freq) initialization across several PyTorch model/backends to match the default rotary embedding implementation, aiming to reduce CPU vs GPU numerical differences that can cause RL training/inference inconsistencies.

Changes:

Update Qwen3.5 default RoPE parameter computation to compute inv_freq in float on CPU first, then move to the target device.
Update Qwen2-VL / Qwen2.5-VL vision rotary embeddings to follow the same arange(int64) -> float -> move device pattern.
Update dlinfer rotary embedding initialization to follow the same pattern before moving to CUDA.

Reviewed changes

Copilot reviewed 4 out of 4 changed files in this pull request and generated 1 comment.

File	Description
lmdeploy/pytorch/models/qwen3_5.py	Adjusts default RoPE `inv_freq` initialization to align numerically with the default backend implementation.
lmdeploy/pytorch/models/qwen2_vl.py	Aligns vision rotary embedding `inv_freq` init pattern (CPU float compute, then `.to(device)`).
lmdeploy/pytorch/models/qwen2_5_vl.py	Same alignment for Qwen2.5-VL vision rotary embedding `inv_freq`.
lmdeploy/pytorch/backends/dlinfer/rotary_embedding.py	Aligns dlinfer base rotary embedding `inv_freq` init pattern (then moves to CUDA).

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

lmdeploy/pytorch/models/qwen3_5.py

Align rope init in lmdeploy

2f8bbc2

Copilot AI review requested due to automatic review settings March 25, 2026 08:48

Copilot started reviewing on behalf of RangiLyu March 25, 2026 08:48 View session

Copilot AI reviewed Mar 25, 2026

View reviewed changes

lmdeploy/pytorch/models/qwen3_5.py Show resolved Hide resolved

lvhan028 approved these changes Mar 25, 2026

View reviewed changes

lvhan028 added the improvement label Mar 25, 2026

CUHKSZzxy mentioned this pull request Mar 25, 2026

unify rope device #4467

Open

lvhan028 merged commit 9d1dda3 into InternLM:main Mar 26, 2026
5 of 9 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Align rope init in lmdeploy#4466

Align rope init in lmdeploy#4466
lvhan028 merged 1 commit intoInternLM:mainfrom
RangiLyu:lcq/rope-init

RangiLyu commented Mar 25, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

RangiLyu commented Mar 25, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants