Draft model update params by CUHKSZzxy · Pull Request #4452 · InternLM/lmdeploy

CUHKSZzxy · 2026-03-24T04:18:11Z

Support qwen3.5 model update params
get_schedule_metrics (sync -> async), avoid race condition in ZMQ rpc client when update_params (sync).

Copilot

Pull request overview

This PR extends runtime parameter/weight updates to also cover the speculative-decoding draft model, and changes schedule-metrics retrieval to be async-safe to avoid ZMQ RPC client race conditions during synchronous update_params calls.

Changes:

Make get_schedule_metrics async through AsyncEngine and the OpenAI API server metrics loop.
Switch MP engine schedule-metrics RPC to an async RPC path (_collective_rpc_async).
Update ModelAgent to apply update_params, sleep, and wakeup to the speculative draft model as well as the main model.

Reviewed changes

Copilot reviewed 4 out of 4 changed files in this pull request and generated 1 comment.

File	Description
`lmdeploy/serve/openai/api_server.py`	Await schedule-metrics fetch in the periodic metrics logging task.
`lmdeploy/serve/core/async_engine.py`	Convert `get_schedule_metrics` to async and support both sync/async engine implementations.
`lmdeploy/pytorch/engine/mp_engine/base.py`	Make schedule-metrics retrieval async via `_collective_rpc_async`.
`lmdeploy/pytorch/engine/model_agent/agent.py`	Add draft-model support for `update_params` and include draft model in sleep/wakeup flows.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-03-24T09:55:51Z

lmdeploy/pytorch/engine/model_agent/agent.py

+            if not self.spec_agent.is_enabled():
+                return weights, []
+            main = [(name, weight) for name, weight in weights if not name.startswith('mtp.')]
+            draft = [(name, weight) for name, weight in weights if name.startswith('mtp.')]


Draft-model weight updates will likely fail because draft_weights retain the 'mtp.' prefix, but the spec draft model built by spec_agent is a standalone patched model whose parameter names typically do not include that outer prefix. This can lead to missing-key/KeyError inside load_weights when indexing params_dict[name]. Consider stripping the 'mtp.' prefix (and/or applying an explicit mapping) before passing weights through _rename_weights_iterator/load_weights for spec_model.

Suggested change

draft = [(name, weight) for name, weight in weights if name.startswith('mtp.')]

# For the draft (spec) model, strip the outer "mtp." prefix from parameter names

draft = [(name[len('mtp.'):], weight)

for name, weight in weights

if name.startswith('mtp.')]

RunningLeon · 2026-03-24T12:18:00Z

lmdeploy/pytorch/engine/model_agent/agent.py


        self.spec_agent.reset_graph_runner()

+    def _get_spec_model(self):


we may put this method to spec_agent class

RunningLeon · 2026-03-25T03:57:27Z

lmdeploy/pytorch/engine/model_agent/agent.py

+            return [(k, _construct(v)) for k, v in raw]
+
+        def _split_main_and_draft(weights):
+            if not self.spec_agent.is_enabled() or self.spec_agent.method != 'qwen3_5_mtp':


may add a TODO or warning message in here

CUHKSZzxy added 2 commits March 24, 2026 12:12

support update params for draft model

6f553a8

fix zmq rpc race condition

e944997

CUHKSZzxy requested a review from RunningLeon March 24, 2026 04:28

CUHKSZzxy marked this pull request as ready for review March 24, 2026 09:51

Copilot AI review requested due to automatic review settings March 24, 2026 09:51

Copilot started reviewing on behalf of CUHKSZzxy March 24, 2026 09:52 View session

Copilot AI reviewed Mar 24, 2026

View reviewed changes

RunningLeon reviewed Mar 24, 2026

View reviewed changes

CUHKSZzxy added 2 commits March 24, 2026 20:37

only for qwen35 mtp, move get_model function

7a48257

release state cache

b745d7f

RunningLeon reviewed Mar 25, 2026

View reviewed changes

CUHKSZzxy added 2 commits March 25, 2026 12:06

add TODO

36090b8

Merge branch 'main' into draft-model-update-params

5e9f548

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Draft model update params#4452

Draft model update params#4452
CUHKSZzxy wants to merge 6 commits intoInternLM:mainfrom
CUHKSZzxy:draft-model-update-params

CUHKSZzxy commented Mar 24, 2026 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Mar 24, 2026

Uh oh!

RunningLeon Mar 24, 2026

Uh oh!

RunningLeon Mar 25, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants


		self.spec_agent.reset_graph_runner()

		def _get_spec_model(self):

Conversation

CUHKSZzxy commented Mar 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Copilot AI Mar 24, 2026

Choose a reason for hiding this comment

Uh oh!

RunningLeon Mar 24, 2026

Choose a reason for hiding this comment

Uh oh!

RunningLeon Mar 25, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

CUHKSZzxy commented Mar 24, 2026 •

edited

Loading