[feat]: support offload for flux2 by wangshankun · Pull Request #1034 · ModelTC/LightX2V

wangshankun · 2026-04-23T03:02:07Z

No description provided.

gemini-code-assist

Code Review

This pull request introduces CPU offloading and lazy loading capabilities for the Flux2 model to optimize memory usage. Key changes include a new offload configuration, fallback mechanisms for pinned memory allocation in utils.py, and logic in flux2_runner.py to dynamically load and unload text encoders and VAE modules during inference. Feedback focuses on improving the robustness of device module retrieval, narrowing exception handling when allocating pinned memory, replacing assertions with explicit value errors for configuration validation, and ensuring consistent attribute deletion when unloading modules.

gemini-code-assist · 2026-04-23T03:03:51Z

 from lightx2v.utils.registry_factory import RUNNER_REGISTER
 from lightx2v_platform.base.global_var import AI_DEVICE

+torch_device_module = getattr(torch, AI_DEVICE)


Using getattr(torch, AI_DEVICE) is risky. If AI_DEVICE is a device string like "cuda:0" or "cpu", this will raise an AttributeError. Typically, AI_DEVICE refers to the device identifier used with torch.device(), while getattr expects a module name like "cuda" or "mps". Additionally, torch does not have a cpu attribute that acts as a device module. Consider extracting the device type (e.g., AI_DEVICE.split(':')[0]) and handling the "cpu" case explicitly to avoid a crash.

gemini-code-assist · 2026-04-23T03:03:51Z

-    pin_tensor = torch.empty(tensor.shape, pin_memory=True, dtype=dtype)
+    try:
+        pin_tensor = torch.empty(tensor.shape, pin_memory=True, dtype=dtype)
+    except Exception as e:


Catching the generic Exception class is too broad and can mask unrelated errors. For PyTorch memory allocation failures, it is better to catch RuntimeError specifically, as that is what torch.empty typically raises when pinned memory allocation fails.

Suggested change

except Exception as e:

except RuntimeError as e:

gemini-code-assist · 2026-04-23T03:03:52Z

+            self.load_model()
+            self.model.set_scheduler(self.scheduler)
+        elif self.config.get("lazy_load", False):
+            assert self.config.get("cpu_offload", False)


Using assert for runtime configuration validation is discouraged because assertions can be disabled in optimized Python execution (using the -O flag). It is better to raise a ValueError to ensure the check is always performed.

Suggested change

assert self.config.get("cpu_offload", False)

if not self.config.get("cpu_offload", False):

raise ValueError("cpu_offload must be enabled when lazy_load is true")

gemini-code-assist · 2026-04-23T03:03:52Z

        text_encoder_output = self.run_text_encoder(prompt, neg_prompt=self.input_info.negative_prompt)
-        torch.cuda.empty_cache()
+        if self.config.get("lazy_load", False) or self.config.get("unload_modules", False):
+            del self.text_encoders[0]


Deleting only the first element of the list leaves self.text_encoders as an empty list []. It is cleaner and more consistent with how self.vae is handled (line 281) to delete the entire attribute, which also avoids potential IndexError if the list is accessed elsewhere while empty.

Suggested change

del self.text_encoders[0]

del self.text_encoders

gemini-code-assist · 2026-04-23T03:03:52Z

+            self.text_encoders = self.load_text_encoder()
        text_encoder_output = self.run_text_encoder(prompt, neg_prompt=self.input_info.negative_prompt)
+        if self.config.get("lazy_load", False) or self.config.get("unload_modules", False):
+            del self.text_encoders[0]


Deleting only the first element of the list leaves self.text_encoders as an empty list []. It is cleaner and more consistent with how self.vae is handled (line 281) to delete the entire attribute.

Suggested change

del self.text_encoders[0]

del self.text_encoders

[feat]: support offload for flux2

8cb9eff

gemini-code-assist Bot reviewed Apr 23, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[feat]: support offload for flux2#1034

[feat]: support offload for flux2#1034
wangshankun wants to merge 1 commit intomainfrom
dev/flux2_offload

wangshankun commented Apr 23, 2026

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

gemini-code-assist Bot Apr 23, 2026

Uh oh!

gemini-code-assist Bot Apr 23, 2026

Uh oh!

gemini-code-assist Bot Apr 23, 2026

Uh oh!

gemini-code-assist Bot Apr 23, 2026

Uh oh!

gemini-code-assist Bot Apr 23, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

	assert self.config.get("cpu_offload", False)
	if not self.config.get("cpu_offload", False):
	raise ValueError("cpu_offload must be enabled when lazy_load is true")

Conversation

wangshankun commented Apr 23, 2026

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist Bot Apr 23, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot Apr 23, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot Apr 23, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot Apr 23, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot Apr 23, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant