diff --git a/README.md b/README.md
index d982090..39ac31a 100644
--- a/README.md
+++ b/README.md
@@ -15,27 +15,27 @@ This is the official repository for LTX-Video.
---
-## ๐ **New: LTX-2 is Now Available!**
+## ้ฆๆฎ **New: LTX-2 is Now Available!**
**We're excited to announce [LTX-2](https://github.com/Lightricks/LTX-2) - the next generation of LTX with synchronized audio+video generation!**
LTX-2 is the first DiT-based audio-video foundation model that contains all core capabilities of modern video generation in one model. **LTX-2 is now the primary home for LTX development** and includes significant improvements:
-- ๐ต **Synchronized Audio+Video Generation** - Generate videos with perfectly synchronized audio
-- ๐ฌ **Latest Model** - LTX-2 with improved quality and capabilities
-- ๐ **ComfyUI Integration** - Built into ComfyUI core for seamless workflows
-- ๐ฏ **Advanced Features:**
+- ้ฆๅน **Synchronized Audio+Video Generation** - Generate videos with perfectly synchronized audio
+- ้ฆๅน **Latest Model** - LTX-2 with improved quality and capabilities
+- ้ฆๆฒ **ComfyUI Integration** - Built into ComfyUI core for seamless workflows
+- ้ฆๅน **Advanced Features:**
- Multiple keyframe support
- IC-LoRA control models for precise generation
- Standard LoRA support for style customization
- Latent upsampler for multiscale pipelines
-- ๐ ๏ธ **Training Tools** - LoRA training capabilities
-- ๐ **Comprehensive Documentation** - Full documentation at [https://docs.ltx.video](https://docs.ltx.video)
-- ๐ **Active Development** - Ongoing improvements and community support
+- ้ฆๆด้?**Training Tools** - LoRA training capabilities
+- ้ฆๆ **Comprehensive Documentation** - Full documentation at [https://docs.ltx.video](https://docs.ltx.video)
+- ้ฆๆง **Active Development** - Ongoing improvements and community support
-**[๐ Check out LTX-2 here](https://github.com/Lightricks/LTX-2)**
+**[้ฆๆ Check out LTX-2 here](https://github.com/Lightricks/LTX-2)**
-**[๐ View Documentation](https://docs.ltx.video)**
+**[้ฆๆ View Documentation](https://docs.ltx.video)**
---
@@ -85,7 +85,7 @@ The model supports image-to-video, multi-keyframe conditioning, keyframe-based a
## October 23, 2025: LTX-2 Announced
-Today we announced our newest foundation model, LTX-2. LTX-2 represents a major leap forward from our previous model, LTXV 0.9.8. Hereโs whatโs new:
+Today we announced our newest foundation model, LTX-2. LTX-2 represents a major leap forward from our previous model, LTXV 0.9.8. Here้ฅๆช what้ฅๆช new:
* **Audio + Video, Together**: Visuals and sound are generated in one coherent process, with motion, dialogue, ambience, and music flowing simultaneously.
* **4K Fidelity**: Professional-grade precision with native 4K and up to 50 fps, sharp textures, clean motion, and synchronized audio.
* **Longer Generations**: LTX-2 supports longer, continuous clips with synchronized audio up to 10 seconds.
@@ -143,7 +143,7 @@ For more details, please see our [blog post](https://website.ltx.video/blog/intr
* Does not require classifier-free guidance and spatio-temporal guidance.
* Supports sampling with 8 (recommended), or less diffusion steps.
- Improved prompt adherence, motion quality and fine details.
-- New default resolution and FPS: 1216 ร 704 pixels at 30 FPS
+- New default resolution and FPS: 1216 ่ณ 704 pixels at 30 FPS
* Still real time on H100 with the distilled model.
* Other resolutions and FPS are still supported.
- Support stochastic inference (can improve visual quality when using the distilled model)
@@ -195,7 +195,7 @@ ltxv-2b-0.9.8-distilled | Smaller model, slight quality reduction compare
| ltxv-13b-0.9.8-distilled-fp8 | Quantized version of ltxv-13b-distilled | [ltxv-13b-0.9.8-distilled-fp8.yaml](https://github.com/Lightricks/LTX-Video/blob/main/configs/ltxv-13b-0.9.8-distilled-fp8.yaml) | [ltxv-13b-dist-i2v-base-fp8.json](https://github.com/Lightricks/ComfyUI-LTXVideo/blob/master/example_workflows/13b-distilled/ltxv-13b-dist-i2v-base-fp8.json) |
| ltxv-2b-0.9.8-distilled-fp8 | Quantized version of ltxv-2b-distilled | [ltxv-2b-0.9.8-distilled-fp8.yaml](https://github.com/Lightricks/LTX-Video/blob/main/configs/ltxv-2b-0.9.8-distilled-fp8.yaml) | N/A |
| ltxv-2b-0.9.6 | Good quality, lower VRAM requirement than ltxv-13b | [ltxv-2b-0.9.6-dev.yaml](https://github.com/Lightricks/LTX-Video/blob/main/configs/ltxv-2b-0.9.6-dev.yaml) | [ltxvideo-i2v.json](https://github.com/Lightricks/ComfyUI-LTXVideo/blob/master/example_workflows/low_level/ltxvideo-i2v.json) |
-| ltxv-2b-0.9.6-distilled | 15ร faster, real-time capable, fewer steps needed, no STG/CFG required | [ltxv-2b-0.9.6-distilled.yaml](https://github.com/Lightricks/LTX-Video/blob/main/configs/ltxv-2b-0.9.6-distilled.yaml) | [ltxvideo-i2v-distilled.json](https://github.com/Lightricks/ComfyUI-LTXVideo/blob/master/example_workflows/low_level/ltxvideo-i2v-distilled.json) |
+| ltxv-2b-0.9.6-distilled | 15่ณ faster, real-time capable, fewer steps needed, no STG/CFG required | [ltxv-2b-0.9.6-distilled.yaml](https://github.com/Lightricks/LTX-Video/blob/main/configs/ltxv-2b-0.9.6-distilled.yaml) | [ltxvideo-i2v-distilled.json](https://github.com/Lightricks/ComfyUI-LTXVideo/blob/master/example_workflows/low_level/ltxvideo-i2v-distilled.json) |
# Quick Start Guide
@@ -224,13 +224,25 @@ source env/bin/activate
python -m pip install -e .\[inference\]
```
+#### Model access
+
+`inference.py` downloads model weights from the Hugging Face Hub on first run. Before running local inference:
+
+1. Visit the [LTX-Video model page](https://huggingface.co/Lightricks/LTX-Video) and accept the model terms if prompted.
+2. Authenticate locally with the Hugging Face CLI so `hf_hub_download()` can access the weights:
+
+```bash
+pip install -U "huggingface_hub[cli]"
+hf auth login
+```
+
#### FP8 Kernels (optional)
[FP8 kernels](https://github.com/Lightricks/LTXVideo-Q8-Kernels) developed for LTX-Video provide performance boost on supported graphics cards (Ada architecture and later). To install FP8 kernels, follow the instructions in that repository.
### Inference
-๐ **Note:** For best results, we recommend using our [ComfyUI](#comfyui-integration) workflow. We're working on updating the inference.py script to match the high quality and output fidelity of ComfyUI.
+้ฆๆ **Note:** For best results, we recommend using our [ComfyUI](#comfyui-integration) workflow. We're working on updating the inference.py script to match the high quality and output fidelity of ComfyUI.
To use our model, please follow the inference code in [inference.py](./inference.py):
@@ -242,7 +254,7 @@ python inference.py --prompt "PROMPT" --conditioning_media_paths IMAGE_PATH --co
#### Extending a video:
-๐ **Note:** Input video segments must contain a multiple of 8 frames plus 1 (e.g., 9, 17, 25, etc.), and the target frame number should be a multiple of 8.
+้ฆๆ **Note:** Input video segments must contain a multiple of 8 frames plus 1 (e.g., 9, 17, 25, etc.), and the target frame number should be a multiple of 8.
```bash
@@ -285,7 +297,7 @@ Diffusers also support an 8-bit version of LTX-Video, [see details below](#ltx-v
# Model User Guide
-## ๐ Prompt Engineering
+## ้ฆๆ Prompt Engineering
When writing prompts, focus on detailed, chronological descriptions of actions and scenes. Include specific movements, appearances, camera angles, and environmental details - all in a single flowing paragraph. Start directly with the action, and keep descriptions literal and precise. Think like a cinematographer describing a shot list. Keep within 200 words. For best results, build your prompts using this structure:
@@ -302,53 +314,52 @@ When writing prompts, focus on detailed, chronological descriptions of actions a
When using `LTXVideoPipeline` directly, you can enable prompt enhancement by setting `enhance_prompt=True`.
-## ๐ฎ Parameter Guide
+## ้ฆๅน Parameter Guide
* Resolution Preset: Higher resolutions for detailed scenes, lower for faster generation and simpler scenes. The model works on resolutions that are divisible by 32 and number of frames that are divisible by 8 + 1 (e.g. 257). In case the resolution or number of frames are not divisible by 32 or 8 + 1, the input will be padded with -1 and then cropped to the desired resolution and number of frames. The model works best on resolutions under 720 x 1280 and number of frames below 257
* Seed: Save seed values to recreate specific styles or compositions you like
* Guidance Scale: 3-3.5 are the recommended values
* Inference Steps: More steps (40+) for quality, fewer steps (20-30) for speed
-๐ For advanced parameters usage, please see `python inference.py --help`
+้ฆๆ For advanced parameters usage, please see `python inference.py --help`
## Community Contribution
-### ComfyUI-LTXTricks ๐ ๏ธ
-
+### ComfyUI-LTXTricks ้ฆๆด้?
A community project providing additional nodes for enhanced control over the LTX Video model. It includes implementations of advanced techniques like RF-Inversion, RF-Edit, FlowEdit, and more. These nodes enable workflows such as Image and Video to Video (I+V2V), enhanced sampling via Spatiotemporal Skip Guidance (STG), and interpolation with precise frame settings.
- **Repository:** [ComfyUI-LTXTricks](https://github.com/logtd/ComfyUI-LTXTricks)
- **Features:**
- - ๐ **RF-Inversion:** Implements [RF-Inversion](https://rf-inversion.github.io/) with an [example workflow here](https://github.com/logtd/ComfyUI-LTXTricks/blob/main/example_workflows/example_ltx_inversion.json).
- - โ๏ธ **RF-Edit:** Implements [RF-Solver-Edit](https://github.com/wangjiangshan0725/RF-Solver-Edit) with an [example workflow here](https://github.com/logtd/ComfyUI-LTXTricks/blob/main/example_workflows/example_ltx_rf_edit.json).
- - ๐ **FlowEdit:** Implements [FlowEdit](https://github.com/fallenshock/FlowEdit) with an [example workflow here](https://github.com/logtd/ComfyUI-LTXTricks/blob/main/example_workflows/example_ltx_flow_edit.json).
- - ๐ฅ **I+V2V:** Enables Video to Video with a reference image. [Example workflow](https://github.com/logtd/ComfyUI-LTXTricks/blob/main/example_workflows/example_ltx_iv2v.json).
- - โจ **Enhance:** Partial implementation of [STGuidance](https://junhahyung.github.io/STGuidance/). [Example workflow](https://github.com/logtd/ComfyUI-LTXTricks/blob/main/example_workflows/example_ltxv_stg.json).
- - ๐ผ๏ธ **Interpolation and Frame Setting:** Nodes for precise control of latents per frame. [Example workflow](https://github.com/logtd/ComfyUI-LTXTricks/blob/main/example_workflows/example_ltx_interpolation.json).
+ - ้ฆๆง **RF-Inversion:** Implements [RF-Inversion](https://rf-inversion.github.io/) with an [example workflow here](https://github.com/logtd/ComfyUI-LTXTricks/blob/main/example_workflows/example_ltx_inversion.json).
+ - ้ๅฆ็ฌ **RF-Edit:** Implements [RF-Solver-Edit](https://github.com/wangjiangshan0725/RF-Solver-Edit) with an [example workflow here](https://github.com/logtd/ComfyUI-LTXTricks/blob/main/example_workflows/example_ltx_rf_edit.json).
+ - ้ฆๅฏ **FlowEdit:** Implements [FlowEdit](https://github.com/fallenshock/FlowEdit) with an [example workflow here](https://github.com/logtd/ComfyUI-LTXTricks/blob/main/example_workflows/example_ltx_flow_edit.json).
+ - ้ฆๅธด **I+V2V:** Enables Video to Video with a reference image. [Example workflow](https://github.com/logtd/ComfyUI-LTXTricks/blob/main/example_workflows/example_ltx_iv2v.json).
+ - ้?**Enhance:** Partial implementation of [STGuidance](https://junhahyung.github.io/STGuidance/). [Example workflow](https://github.com/logtd/ComfyUI-LTXTricks/blob/main/example_workflows/example_ltxv_stg.json).
+ - ้ฆๆค้?**Interpolation and Frame Setting:** Nodes for precise control of latents per frame. [Example workflow](https://github.com/logtd/ComfyUI-LTXTricks/blob/main/example_workflows/example_ltx_interpolation.json).
-### LTX-VideoQ8 ๐ฑ
+### LTX-VideoQ8 ้ฆๅน
**LTX-VideoQ8** is an 8-bit optimized version of [LTX-Video](https://github.com/Lightricks/LTX-Video), designed for faster performance on NVIDIA ADA GPUs.
- **Repository:** [LTX-VideoQ8](https://github.com/KONAKONA666/LTX-Video)
- **Features:**
- - ๐ Up to 3X speed-up with no accuracy loss
- - ๐ฅ Generate 720x480x121 videos in under a minute on RTX 4060 (8GB VRAM)
- - ๐ ๏ธ Fine-tune 2B transformer models with precalculated latents
+ - ้ฆๆฎ Up to 3X speed-up with no accuracy loss
+ - ้ฆๅธด Generate 720x480x121 videos in under a minute on RTX 4060 (8GB VRAM)
+ - ้ฆๆด้?Fine-tune 2B transformer models with precalculated latents
- **Community Discussion:** [Reddit Thread](https://www.reddit.com/r/StableDiffusion/comments/1h79ks2/fast_ltx_video_on_rtx_4060_and_other_ada_gpus/)
- **Diffusers integration:** A diffusers integration for the 8-bit model is already out! [Details here](https://github.com/sayakpaul/q8-ltx-video)
-### TeaCache for LTX-Video ๐ต
+### TeaCache for LTX-Video ้ฆๅต
**TeaCache** is a training-free caching approach that leverages timestep differences across model outputs to accelerate LTX-Video inference by up to 2x without significant visual quality degradation.
- **Repository:** [TeaCache4LTX-Video](https://github.com/ali-vilab/TeaCache/tree/main/TeaCache4LTX-Video)
- **Features:**
- - ๐ Speeds up LTX-Video inference.
- - ๐ Adjustable trade-offs between speed (up to 2x) and visual quality using configurable parameters.
- - ๐ ๏ธ No retraining required: Works directly with existing models.
+ - ้ฆๆฎ Speeds up LTX-Video inference.
+ - ้ฆๆณ Adjustable trade-offs between speed (up to 2x) and visual quality using configurable parameters.
+ - ้ฆๆด้?No retraining required: Works directly with existing models.
### Your Contribution
@@ -392,7 +403,7 @@ We are grateful for the following awesome projects when implementing LTX-Video:
## Citation
-๐ Our tech report is out! If you find our work helpful, please โญ๏ธ star the repository and cite our paper.
+้ฆๆซ Our tech report is out! If you find our work helpful, please ็ธๆถ็ฌ star the repository and cite our paper.
```
@article{HaCohen2024LTXVideo,