Wan2.2

什么是 Wan2.2？

Wan2.2 是相当于 Wan-2.1 的正统后继的视频生成模型家族。大体由 2 个模型构成。

14B：切换 high_noise / low_noise 2 个模型的二段构成
5B：在单一模型处理 text2video 和 image2video 的 TI2V 模型＋Wan2.2 VAE

Wan2.2 14B 模型

Wan2.2-A14B 是前半段采样由 high_noise 模型，后半段由 low_noise 模型担当的二段管线。

通过分为 2 个模型，如果不增加 VRAM 的使用量，将模型的大小加倍，谋求性能提升。

模型的下载

diffusion_models
text_encoders
- umt5_xxl_fp8_e4m3fn_scaled.safetensors（与 Wan2.1 通用）
vae
- wan_2.1_vae.safetensors（与 Wan2.1 通用）
gguf（任意）
- Wan2.2-T2V-A14B-GGUF
- Wan2.2-I2V-A14B-GGUF

📂ComfyUI/
└── 📂models/
    ├── 📂diffusion_models/
    │   ├── wan2.2_t2v_high_noise_14B_fp8_scaled.safetensors
    │   ├── wan2.2_t2v_low_noise_14B_fp8_scaled.safetensors
    │   ├── wan2.2_i2v_high_noise_14B_fp8_scaled.safetensors
    │   └── wan2.2_i2v_low_noise_14B_fp8_scaled.safetensors
    ├── 📂text_encoders/
    │   └── umt5_xxl_fp8_e4m3fn_scaled.safetensors
    ├── 📂unet/
    │   ├── wan2.2_t2v_high_noise_14B-XXXX.gguf    ← 仅在使用 gguf 时
    │   ├── wan2.2_t2v_low_noise_14B-XXXX.gguf     ← 仅在使用 gguf 时
    │   ├── wan2.2_i2v_high_noise_14B-XXXX.gguf    ← 仅在使用 gguf 时
    │   └── wan2.2_i2v_low_noise_14B-XXXX.gguf     ← 仅在使用 gguf 时
    └── 📂vae/
        └── wan_2.1_vae.safetensors

text2video（14B）

使用 KSampler Advanced，前半用 high_noise，后半用 low_noise 模型处理。

Wan2.2_text2video_14B.json

{
  "id": "d8034549-7e0a-40f1-8c2e-de3ffc6f1cae",
  "revision": 0,
  "last_node_id": 159,
  "last_link_id": 327,
  "nodes": [
    {
      "id": 38,
      "type": "CLIPLoader",
      "pos": [
        56.288665771484375,
        312.74468994140625
      ],
      "size": [
        301.3524169921875,
        106
      ],
      "flags": {},
      "order": 0,
      "mode": 0,
      "inputs": [],
      "outputs": [
        {
          "name": "CLIP",
          "type": "CLIP",
          "slot_index": 0,
          "links": [
            74,
            75
          ]
        }
      ],
      "properties": {
        "cnr_id": "comfy-core",
        "ver": "0.3.33",
        "Node name for S&R": "CLIPLoader"
      },
      "widgets_values": [
        "umt5_xxl_fp8_e4m3fn_scaled.safetensors",
        "wan",
        "default"
      ],
      "color": "#432",
      "bgcolor": "#653"
    },
    {
      "id": 8,
      "type": "VAEDecode",
      "pos": [
        1613.413330078125,
        176.01361083984375
      ],
      "size": [
        157.56002807617188,
        46
      ],
      "flags": {},
      "order": 13,
      "mode": 0,
      "inputs": [
        {
          "name": "samples",
          "type": "LATENT",
          "link": 317
        },
        {
          "name": "vae",
          "type": "VAE",
          "link": 76
        }
      ],
      "outputs": [
        {
          "name": "IMAGE",
          "type": "IMAGE",
          "slot_index": 0,
          "links": [
            256
          ]
        }
      ],
      "properties": {
        "cnr_id": "comfy-core",
        "ver": "0.3.33",
        "Node name for S&R": "VAEDecode"
      },
      "widgets_values": []
    },
    {
      "id": 7,
      "type": "CLIPTextEncode",
      "pos": [
        417.8738708496094,
        389
      ],
      "size": [
        419.3189392089844,
        138.8924560546875
      ],
      "flags": {},
      "order": 8,
      "mode": 0,
      "inputs": [
        {
          "name": "clip",
          "type": "CLIP",
          "link": 75
        }
      ],
      "outputs": [
        {
          "name": "CONDITIONING",
          "type": "CONDITIONING",
          "slot_index": 0,
          "links": [
            319,
            321
          ]
        }
      ],
      "title": "CLIP Text Encode (Negative Prompt)",
      "properties": {
        "cnr_id": "comfy-core",
        "ver": "0.3.33",
        "Node name for S&R": "CLIPTextEncode"
      },
      "widgets_values": [
        "色调艳丽，过曝，静态，细节模糊不清，字幕，风格，作品，画作，画面，静止，整体发灰，最差质量，低质量，JPEG压缩残留，丑陋的，残缺的，多余的手指，画得不好的手部，画得不好的脸部，畸形的，毁容的，形态畸形的肢体，手指融合，静止不动的画面，杂乱的背景，三条腿，背景人很多，倒着走 "
      ]
    },
    {
      "id": 48,
      "type": "ModelSamplingSD3",
      "pos": [
        627.1928100585938,
        49.284149169921875
      ],
      "size": [
        210,
        58
      ],
      "flags": {},
      "order": 9,
      "mode": 0,
      "inputs": [
        {
          "name": "model",
          "type": "MODEL",
          "link": 134
        }
      ],
      "outputs": [
        {
          "name": "MODEL",
          "type": "MODEL",
          "slot_index": 0,
          "links": [
            322
          ]
        }
      ],
      "properties": {
        "cnr_id": "comfy-core",
        "ver": "0.3.33",
        "Node name for S&R": "ModelSamplingSD3"
      },
      "widgets_values": [
        8
      ]
    },
    {
      "id": 157,
      "type": "ModelSamplingSD3",
      "pos": [
        1005.2630004882812,
        -109.75807189941406
      ],
      "size": [
        210,
        58
      ],
      "flags": {},
      "order": 10,
      "mode": 0,
      "inputs": [
        {
          "name": "model",
          "type": "MODEL",
          "link": 323
        }
      ],
      "outputs": [
        {
          "name": "MODEL",
          "type": "MODEL",
          "slot_index": 0,
          "links": [
            324
          ]
        }
      ],
      "properties": {
        "cnr_id": "comfy-core",
        "ver": "0.3.33",
        "Node name for S&R": "ModelSamplingSD3"
      },
      "widgets_values": [
        8
      ]
    },
    {
      "id": 73,
      "type": "UNETLoader",
      "pos": [
        329.4610595703125,
        46.62214660644531
      ],
      "size": [
        270,
        82
      ],
      "flags": {},
      "order": 1,
      "mode": 0,
      "inputs": [],
      "outputs": [
        {
          "name": "MODEL",
          "type": "MODEL",
          "links": [
            134
          ]
        }
      ],
      "properties": {
        "cnr_id": "comfy-core",
        "ver": "0.3.33",
        "Node name for S&R": "UNETLoader"
      },
      "widgets_values": [
        "Wan2.2\\wan2.2_t2v_high_noise_14B_fp8_scaled.safetensors",
        "fp8_e4m3fn"
      ],
      "color": "#323",
      "bgcolor": "#535"
    },
    {
      "id": 156,
      "type": "UNETLoader",
      "pos": [
        705.2611694335938,
        -109.75807189941406
      ],
      "size": [
        270,
        82
      ],
      "flags": {},
      "order": 2,
      "mode": 0,
      "inputs": [],
      "outputs": [
        {
          "name": "MODEL",
          "type": "MODEL",
          "links": [
            323
          ]
        }
      ],
      "properties": {
        "cnr_id": "comfy-core",
        "ver": "0.3.33",
        "Node name for S&R": "UNETLoader"
      },
      "widgets_values": [
        "Wan2.2\\wan2.2_t2v_low_noise_14B_fp8_scaled.safetensors",
        "fp8_e4m3fn"
      ],
      "color": "#323",
      "bgcolor": "#535"
    },
    {
      "id": 151,
      "type": "MarkdownNote",
      "pos": [
        -113.67849731445312,
        -40.50404357910156
      ],
      "size": [
        405.56439208984375,
        254.14065551757812
      ],
      "flags": {},
      "order": 3,
      "mode": 0,
      "inputs": [],
      "outputs": [],
      "properties": {},
      "widgets_values": [
        "## models\n- [wan2.2_t2v_high_noise_14B_fp8_scaled.safetensors](https://huggingface.co/Comfy-Org/Wan_2.2_ComfyUI_Repackaged/blob/main/split_files/diffusion_models/wan2.2_t2v_high_noise_14B_fp8_scaled.safetensors)\n- [wan2.2_t2v_low_noise_14B_fp8_scaled.safetensors](https://huggingface.co/Comfy-Org/Wan_2.2_ComfyUI_Repackaged/blob/main/split_files/diffusion_models/wan2.2_t2v_low_noise_14B_fp8_scaled.safetensors)\n\n- [umt5_xxl.safetensors](https://huggingface.co/Comfy-Org/Wan_2.1_ComfyUI_repackaged/tree/main/split_files/text_encoders)\n- [wan_2.1_vae.safetensors](https://huggingface.co/Comfy-Org/Wan_2.1_ComfyUI_repackaged/blob/main/split_files/vae/wan_2.1_vae.safetensors)\n\n\n```\n📂ComfyUI/\n└──📂models/\n    ├── 📂diffusion_models/\n    │   ├── wan2.2_t2v_high_noise_14B_fp8_scaled.safetensors\n    │   └── wan2.2_t2v_low_noise_14B_fp8_scaled.safetensors\n    ├── 📂text_encoders/\n    │   └── umt5_xxl (fp16 or fp8).safetensors\n    └── 📂vae/\n         └── wan_2.1_vae.safetensors\n\n```"
      ],
      "color": "#323",
      "bgcolor": "#535"
    },
    {
      "id": 113,
      "type": "VHS_VideoCombine",
      "pos": [
        1805.07373046875,
        176.01361083984375
      ],
      "size": [
        372.2688903808594,
        334
      ],
      "flags": {},
      "order": 14,
      "mode": 0,
      "inputs": [
        {
          "name": "images",
          "type": "IMAGE",
          "link": 256
        },
        {
          "name": "audio",
          "shape": 7,
          "type": "AUDIO",
          "link": null
        },
        {
          "name": "meta_batch",
          "shape": 7,
          "type": "VHS_BatchManager",
          "link": null
        },
        {
          "name": "vae",
          "shape": 7,
          "type": "VAE",
          "link": null
        }
      ],
      "outputs": [
        {
          "name": "Filenames",
          "type": "VHS_FILENAMES",
          "links": null
        }
      ],
      "properties": {
        "cnr_id": "comfyui-videohelpersuite",
        "ver": "a7ce59e381934733bfae03b1be029756d6ce936d",
        "Node name for S&R": "VHS_VideoCombine"
      },
      "widgets_values": {
        "frame_rate": 24,
        "loop_count": 0,
        "filename_prefix": "Wan2.2",
        "format": "video/h264-mp4",
        "pix_fmt": "yuv420p",
        "crf": 19,
        "save_metadata": true,
        "trim_to_audio": false,
        "pingpong": false,
        "save_output": true,
        "videopreview": {
          "hidden": false,
          "paused": false,
          "params": {
            "filename": "Wan2.2_00090.mp4",
            "subfolder": "",
            "type": "output",
            "format": "video/h264-mp4",
            "frame_rate": 24,
            "workflow": "Wan2.2_00090.png",
            "fullpath": "D:\\AI\\ComfyUI_windows_portable\\ComfyUI\\output\\Wan2.2_00090.mp4"
          }
        }
      }
    },
    {
      "id": 153,
      "type": "EmptyHunyuanLatentVideo",
      "pos": [
        567.0985107421875,
        585.5717163085938
      ],
      "size": [
        270.0943298339844,
        130
      ],
      "flags": {},
      "order": 4,
      "mode": 0,
      "inputs": [],
      "outputs": [
        {
          "name": "LATENT",
          "type": "LATENT",
          "links": [
            327
          ]
        }
      ],
      "properties": {
        "cnr_id": "comfy-core",
        "ver": "0.3.46",
        "Node name for S&R": "EmptyHunyuanLatentVideo"
      },
      "widgets_values": [
        848,
        480,
        81,
        1
      ]
    },
    {
      "id": 158,
      "type": "PrimitiveNode",
      "pos": [
        627.1928100585938,
        772.84326171875
      ],
      "size": [
        210,
        82
      ],
      "flags": {},
      "order": 5,
      "mode": 0,
      "inputs": [],
      "outputs": [
        {
          "name": "INT",
          "type": "INT",
          "widget": {
            "name": "start_at_step"
          },
          "links": [
            325,
            326
          ]
        }
      ],
      "title": "start_at_step",
      "properties": {
        "Run widget replace on values": false
      },
      "widgets_values": [
        4,
        "fixed"
      ],
      "color": "#232",
      "bgcolor": "#353"
    },
    {
      "id": 6,
      "type": "CLIPTextEncode",
      "pos": [
        417.9232177734375,
        186
      ],
      "size": [
        419.26959228515625,
        148.8194122314453
      ],
      "flags": {},
      "order": 7,
      "mode": 0,
      "inputs": [
        {
          "name": "clip",
          "type": "CLIP",
          "link": 74
        }
      ],
      "outputs": [
        {
          "name": "CONDITIONING",
          "type": "CONDITIONING",
          "slot_index": 0,
          "links": [
            318,
            320
          ]
        }
      ],
      "title": "CLIP Text Encode (Positive Prompt)",
      "properties": {
        "cnr_id": "comfy-core",
        "ver": "0.3.33",
        "Node name for S&R": "CLIPTextEncode"
      },
      "widgets_values": [
        "A high-quality, ultra-detailed image of a kingfisher diving into a crystal-clear pool of water, captured at the precise moment its beak touches the surface. The bird’s vibrant blue and orange feathers are sharply defined, with water droplets suspended in the air around it. The background features a softly blurred natural riverside setting, with lush green foliage and gentle sunlight filtering through the trees, creating a serene and dynamic scene."
      ]
    },
    {
      "id": 154,
      "type": "KSamplerAdvanced",
      "pos": [
        925.6195068359375,
        171.68209838867188
      ],
      "size": [
        304.748046875,
        334
      ],
      "flags": {},
      "order": 11,
      "mode": 0,
      "inputs": [
        {
          "name": "model",
          "type": "MODEL",
          "link": 322
        },
        {
          "name": "positive",
          "type": "CONDITIONING",
          "link": 318
        },
        {
          "name": "negative",
          "type": "CONDITIONING",
          "link": 319
        },
        {
          "name": "latent_image",
          "type": "LATENT",
          "link": 327
        },
        {
          "name": "end_at_step",
          "type": "INT",
          "widget": {
            "name": "end_at_step"
          },
          "link": 326
        }
      ],
      "outputs": [
        {
          "name": "LATENT",
          "type": "LATENT",
          "links": [
            316
          ]
        }
      ],
      "properties": {
        "cnr_id": "comfy-core",
        "ver": "0.3.46",
        "Node name for S&R": "KSamplerAdvanced"
      },
      "widgets_values": [
        "enable",
        12345,
        "fixed",
        20,
        4,
        "euler",
        "simple",
        0,
        4,
        "enable"
      ]
    },
    {
      "id": 155,
      "type": "KSamplerAdvanced",
      "pos": [
        1274.5648193359375,
        176.01361083984375
      ],
      "size": [
        304.748046875,
        334
      ],
      "flags": {},
      "order": 12,
      "mode": 0,
      "inputs": [
        {
          "name": "model",
          "type": "MODEL",
          "link": 324
        },
        {
          "name": "positive",
          "type": "CONDITIONING",
          "link": 320
        },
        {
          "name": "negative",
          "type": "CONDITIONING",
          "link": 321
        },
        {
          "name": "latent_image",
          "type": "LATENT",
          "link": 316
        },
        {
          "name": "start_at_step",
          "type": "INT",
          "widget": {
            "name": "start_at_step"
          },
          "link": 325
        }
      ],
      "outputs": [
        {
          "name": "LATENT",
          "type": "LATENT",
          "links": [
            317
          ]
        }
      ],
      "properties": {
        "cnr_id": "comfy-core",
        "ver": "0.3.46",
        "Node name for S&R": "KSamplerAdvanced"
      },
      "widgets_values": [
        "disable",
        0,
        "fixed",
        20,
        3,
        "euler",
        "simple",
        4,
        9999,
        "disable"
      ]
    },
    {
      "id": 39,
      "type": "VAELoader",
      "pos": [
        1314.0928344726562,
        49.20225813953267
      ],
      "size": [
        265.22003173828125,
        58
      ],
      "flags": {},
      "order": 6,
      "mode": 0,
      "inputs": [],
      "outputs": [
        {
          "name": "VAE",
          "type": "VAE",
          "slot_index": 0,
          "links": [
            76
          ]
        }
      ],
      "properties": {
        "cnr_id": "comfy-core",
        "ver": "0.3.33",
        "Node name for S&R": "VAELoader"
      },
      "widgets_values": [
        "wan_2.1_vae.safetensors"
      ],
      "color": "#322",
      "bgcolor": "#533"
    }
  ],
  "links": [
    [
      74,
      38,
      0,
      6,
      0,
      "CLIP"
    ],
    [
      75,
      38,
      0,
      7,
      0,
      "CLIP"
    ],
    [
      76,
      39,
      0,
      8,
      1,
      "VAE"
    ],
    [
      134,
      73,
      0,
      48,
      0,
      "MODEL"
    ],
    [
      256,
      8,
      0,
      113,
      0,
      "IMAGE"
    ],
    [
      316,
      154,
      0,
      155,
      3,
      "LATENT"
    ],
    [
      317,
      155,
      0,
      8,
      0,
      "LATENT"
    ],
    [
      318,
      6,
      0,
      154,
      1,
      "CONDITIONING"
    ],
    [
      319,
      7,
      0,
      154,
      2,
      "CONDITIONING"
    ],
    [
      320,
      6,
      0,
      155,
      1,
      "CONDITIONING"
    ],
    [
      321,
      7,
      0,
      155,
      2,
      "CONDITIONING"
    ],
    [
      322,
      48,
      0,
      154,
      0,
      "MODEL"
    ],
    [
      323,
      156,
      0,
      157,
      0,
      "MODEL"
    ],
    [
      324,
      157,
      0,
      155,
      0,
      "MODEL"
    ],
    [
      325,
      158,
      0,
      155,
      4,
      "INT"
    ],
    [
      326,
      158,
      0,
      154,
      4,
      "INT"
    ],
    [
      327,
      153,
      0,
      154,
      3,
      "LATENT"
    ]
  ],
  "groups": [],
  "config": {},
  "extra": {
    "ds": {
      "scale": 0.7513148009015777,
      "offset": [
        213.67849731445312,
        209.75807189941406
      ]
    },
    "frontendVersion": "1.35.0",
    "VHS_latentpreview": false,
    "VHS_latentpreviewrate": 0,
    "VHS_MetadataImage": true,
    "VHS_KeepIntermediate": true
  },
  "version": 0.4
}

🟩 指定全体 20 steps 中，在第几 step 切换 high_noise → low_noise
- 关于这个切换的时机，规定推荐 “非噪声部分”和“噪声部分”的比例 为 1 : 1 时。
- 虽然可以计算这个，但 Sampler / Scheduler / sigma_shift / step 数相互纠缠很难。
- 而且，也没理由说完全对上的就一定是最佳。
- 在这个工作流中，在 4 steps 切换，首先请以此为基准尝试各种设定。
🟨🟥 文本编码器和 VAE 与 Wan2.1 相同。

image2video（14B）

Wan2.2_image2video_14B.json

{
  "id": "d8034549-7e0a-40f1-8c2e-de3ffc6f1cae",
  "revision": 0,
  "last_node_id": 164,
  "last_link_id": 351,
  "nodes": [
    {
      "id": 38,
      "type": "CLIPLoader",
      "pos": [
        35.79126739501953,
        314.20880126953125
      ],
      "size": [
        301.3524169921875,
        106
      ],
      "flags": {},
      "order": 0,
      "mode": 0,
      "inputs": [],
      "outputs": [
        {
          "name": "CLIP",
          "type": "CLIP",
          "slot_index": 0,
          "links": [
            74,
            75
          ]
        }
      ],
      "properties": {
        "cnr_id": "comfy-core",
        "ver": "0.3.33",
        "Node name for S&R": "CLIPLoader"
      },
      "widgets_values": [
        "umt5_xxl_fp8_e4m3fn_scaled.safetensors",
        "wan",
        "default"
      ],
      "color": "#432",
      "bgcolor": "#653"
    },
    {
      "id": 8,
      "type": "VAEDecode",
      "pos": [
        1846.9432373046875,
        185.6936798095703
      ],
      "size": [
        157.56002807617188,
        46
      ],
      "flags": {},
      "order": 16,
      "mode": 0,
      "inputs": [
        {
          "name": "samples",
          "type": "LATENT",
          "link": 317
        },
        {
          "name": "vae",
          "type": "VAE",
          "link": 76
        }
      ],
      "outputs": [
        {
          "name": "IMAGE",
          "type": "IMAGE",
          "slot_index": 0,
          "links": [
            256
          ]
        }
      ],
      "properties": {
        "cnr_id": "comfy-core",
        "ver": "0.3.33",
        "Node name for S&R": "VAEDecode"
      },
      "widgets_values": []
    },
    {
      "id": 160,
      "type": "LoadImage",
      "pos": [
        46.66357421875,
        723.9078369140625
      ],
      "size": [
        334.93658447265625,
        388.3801574707031
      ],
      "flags": {},
      "order": 1,
      "mode": 0,
      "inputs": [],
      "outputs": [
        {
          "name": "IMAGE",
          "type": "IMAGE",
          "links": [
            330
          ]
        },
        {
          "name": "MASK",
          "type": "MASK",
          "links": null
        }
      ],
      "properties": {
        "cnr_id": "comfy-core",
        "ver": "0.3.46",
        "Node name for S&R": "LoadImage"
      },
      "widgets_values": [
        "Clipboard - 2025-05-13 21.27.11.png",
        "image"
      ]
    },
    {
      "id": 73,
      "type": "UNETLoader",
      "pos": [
        624.2608642578125,
        44.42214584350586
      ],
      "size": [
        270,
        82
      ],
      "flags": {},
      "order": 2,
      "mode": 0,
      "inputs": [],
      "outputs": [
        {
          "name": "MODEL",
          "type": "MODEL",
          "links": [
            134
          ]
        }
      ],
      "properties": {
        "cnr_id": "comfy-core",
        "ver": "0.3.33",
        "Node name for S&R": "UNETLoader"
      },
      "widgets_values": [
        "Wan2.2\\wan2.2_i2v_high_noise_14B_fp8_scaled.safetensors",
        "fp8_e4m3fn"
      ],
      "color": "#323",
      "bgcolor": "#535"
    },
    {
      "id": 7,
      "type": "CLIPTextEncode",
      "pos": [
        397.4257507324219,
        390.464111328125
      ],
      "size": [
        419.3189392089844,
        138.8924560546875
      ],
      "flags": {},
      "order": 8,
      "mode": 0,
      "inputs": [
        {
          "name": "clip",
          "type": "CLIP",
          "link": 75
        }
      ],
      "outputs": [
        {
          "name": "CONDITIONING",
          "type": "CONDITIONING",
          "slot_index": 0,
          "links": [
            341
          ]
        }
      ],
      "title": "CLIP Text Encode (Negative Prompt)",
      "properties": {
        "cnr_id": "comfy-core",
        "ver": "0.3.33",
        "Node name for S&R": "CLIPTextEncode"
      },
      "widgets_values": [
        "色调艳丽，过曝，静态，细节模糊不清，字幕，风格，作品，画作，画面，静止，整体发灰，最差质量，低质量，JPEG压缩残留，丑陋的，残缺的，多余的手指，画得不好的手部，画得不好的脸部，畸形的，毁容的，形态畸形的肢体，手指融合，静止不动的画面，杂乱的背景，三条腿，背景人很多，倒着走 "
      ]
    },
    {
      "id": 48,
      "type": "ModelSamplingSD3",
      "pos": [
        921.9925537109375,
        47.08414840698242
      ],
      "size": [
        210,
        58
      ],
      "flags": {},
      "order": 10,
      "mode": 0,
      "inputs": [
        {
          "name": "model",
          "type": "MODEL",
          "link": 134
        }
      ],
      "outputs": [
        {
          "name": "MODEL",
          "type": "MODEL",
          "slot_index": 0,
          "links": [
            322
          ]
        }
      ],
      "properties": {
        "cnr_id": "comfy-core",
        "ver": "0.3.33",
        "Node name for S&R": "ModelSamplingSD3"
      },
      "widgets_values": [
        8
      ]
    },
    {
      "id": 154,
      "type": "KSamplerAdvanced",
      "pos": [
        1170.0396728515625,
        186.20217895507812
      ],
      "size": [
        304.748046875,
        334
      ],
      "flags": {},
      "order": 14,
      "mode": 0,
      "inputs": [
        {
          "name": "model",
          "type": "MODEL",
          "link": 322
        },
        {
          "name": "positive",
          "type": "CONDITIONING",
          "link": 342
        },
        {
          "name": "negative",
          "type": "CONDITIONING",
          "link": 343
        },
        {
          "name": "latent_image",
          "type": "LATENT",
          "link": 346
        },
        {
          "name": "end_at_step",
          "type": "INT",
          "widget": {
            "name": "end_at_step"
          },
          "link": 326
        }
      ],
      "outputs": [
        {
          "name": "LATENT",
          "type": "LATENT",
          "links": [
            316
          ]
        }
      ],
      "properties": {
        "cnr_id": "comfy-core",
        "ver": "0.3.46",
        "Node name for S&R": "KSamplerAdvanced"
      },
      "widgets_values": [
        "enable",
        1234,
        "fixed",
        20,
        4,
        "euler",
        "simple",
        0,
        4,
        "enable"
      ]
    },
    {
      "id": 162,
      "type": "GetImageSize",
      "pos": [
        676.7446899414062,
        723.2024536132812
      ],
      "size": [
        140,
        124
      ],
      "flags": {},
      "order": 12,
      "mode": 0,
      "inputs": [
        {
          "name": "image",
          "type": "IMAGE",
          "link": 348
        }
      ],
      "outputs": [
        {
          "name": "width",
          "type": "INT",
          "links": [
            350
          ]
        },
        {
          "name": "height",
          "type": "INT",
          "links": [
            351
          ]
        },
        {
          "name": "batch_size",
          "type": "INT",
          "links": null
        }
      ],
      "properties": {
        "cnr_id": "comfy-core",
        "ver": "0.3.46",
        "Node name for S&R": "GetImageSize"
      },
      "widgets_values": [
        "width: 724, height: 724\n batch size: 1"
      ]
    },
    {
      "id": 6,
      "type": "CLIPTextEncode",
      "pos": [
        397.47509765625,
        187.46409606933594
      ],
      "size": [
        419.26959228515625,
        148.8194122314453
      ],
      "flags": {},
      "order": 7,
      "mode": 0,
      "inputs": [
        {
          "name": "clip",
          "type": "CLIP",
          "link": 74
        }
      ],
      "outputs": [
        {
          "name": "CONDITIONING",
          "type": "CONDITIONING",
          "slot_index": 0,
          "links": [
            340
          ]
        }
      ],
      "title": "CLIP Text Encode (Positive Prompt)",
      "properties": {
        "cnr_id": "comfy-core",
        "ver": "0.3.33",
        "Node name for S&R": "CLIPTextEncode"
      },
      "widgets_values": [
        "Photorealistic style. A young girl with blonde braided hair, wearing a white dress, stands in a lush green garden filled with warm afternoon sunlight. She carefully holds a glass jar with both hands, watching a small goldfish swim gracefully inside with a gentle smile. The camera starts with a medium shot focused on the girl and the jar, then slowly and smoothly zooms in for a close-up of the goldfish. The overall atmosphere is soft, calm, and heartwarming."
      ]
    },
    {
      "id": 39,
      "type": "VAELoader",
      "pos": [
        551.524658203125,
        597.2347412109375
      ],
      "size": [
        265.22003173828125,
        58
      ],
      "flags": {},
      "order": 3,
      "mode": 0,
      "inputs": [],
      "outputs": [
        {
          "name": "VAE",
          "type": "VAE",
          "slot_index": 0,
          "links": [
            76,
            347
          ]
        }
      ],
      "properties": {
        "cnr_id": "comfy-core",
        "ver": "0.3.33",
        "Node name for S&R": "VAELoader"
      },
      "widgets_values": [
        "wan_2.1_vae.safetensors"
      ],
      "color": "#322",
      "bgcolor": "#533"
    },
    {
      "id": 161,
      "type": "ImageScaleToTotalPixels",
      "pos": [
        405.4178161621094,
        723.9078369140625
      ],
      "size": [
        239.38699340820312,
        82
      ],
      "flags": {},
      "order": 9,
      "mode": 0,
      "inputs": [
        {
          "name": "image",
          "type": "IMAGE",
          "link": 330
        }
      ],
      "outputs": [
        {
          "name": "IMAGE",
          "type": "IMAGE",
          "links": [
            348,
            349
          ]
        }
      ],
      "properties": {
        "cnr_id": "comfy-core",
        "ver": "0.3.46",
        "Node name for S&R": "ImageScaleToTotalPixels"
      },
      "widgets_values": [
        "nearest-exact",
        0.5
      ]
    },
    {
      "id": 151,
      "type": "MarkdownNote",
      "pos": [
        -113.67849731445312,
        -40.50404357910156
      ],
      "size": [
        405.56439208984375,
        254.14065551757812
      ],
      "flags": {},
      "order": 4,
      "mode": 0,
      "inputs": [],
      "outputs": [],
      "properties": {},
      "widgets_values": [
        "## models\n- [wan2.2_i2v_high_noise_14B_fp8_scaled.safetensors](https://huggingface.co/Comfy-Org/Wan_2.2_ComfyUI_Repackaged/blob/main/split_files/diffusion_models/wan2.2_i2v_high_noise_14B_fp8_scaled.safetensors)\n- [wan2.2_i2v_low_noise_14B_fp8_scaled.safetensors](https://huggingface.co/Comfy-Org/Wan_2.2_ComfyUI_Repackaged/blob/main/split_files/diffusion_models/wan2.2_i2v_low_noise_14B_fp8_scaled.safetensors)\n\n- [umt5_xxl.safetensors](https://huggingface.co/Comfy-Org/Wan_2.1_ComfyUI_repackaged/tree/main/split_files/text_encoders)\n- [wan_2.1_vae.safetensors](https://huggingface.co/Comfy-Org/Wan_2.1_ComfyUI_repackaged/blob/main/split_files/vae/wan_2.1_vae.safetensors)\n\n\n```\n📂ComfyUI/\n└──📂models/\n    ├── 📂diffusion_models/\n    │   ├── wan2.2_i2v_high_noise_14B_fp8_scaled.safetensors\n    │   └── wan2.2_i2v_low_noise_14B_fp8_scaled.safetensors\n    ├── 📂text_encoders/\n    │   └── umt5_xxl (fp16 or fp8).safetensors\n    └── 📂vae/\n         └── wan_2.1_vae.safetensors\n\n```"
      ],
      "color": "#323",
      "bgcolor": "#535"
    },
    {
      "id": 157,
      "type": "ModelSamplingSD3",
      "pos": [
        1264.7877197265625,
        -104.88626098632812
      ],
      "size": [
        210,
        58
      ],
      "flags": {},
      "order": 11,
      "mode": 0,
      "inputs": [
        {
          "name": "model",
          "type": "MODEL",
          "link": 323
        }
      ],
      "outputs": [
        {
          "name": "MODEL",
          "type": "MODEL",
          "slot_index": 0,
          "links": [
            324
          ]
        }
      ],
      "properties": {
        "cnr_id": "comfy-core",
        "ver": "0.3.33",
        "Node name for S&R": "ModelSamplingSD3"
      },
      "widgets_values": [
        8
      ]
    },
    {
      "id": 156,
      "type": "UNETLoader",
      "pos": [
        959.7449340820312,
        -104.88626098632812
      ],
      "size": [
        270,
        82
      ],
      "flags": {},
      "order": 5,
      "mode": 0,
      "inputs": [],
      "outputs": [
        {
          "name": "MODEL",
          "type": "MODEL",
          "links": [
            323
          ]
        }
      ],
      "properties": {
        "cnr_id": "comfy-core",
        "ver": "0.3.33",
        "Node name for S&R": "UNETLoader"
      },
      "widgets_values": [
        "Wan2.2\\wan2.2_i2v_low_noise_14B_fp8_scaled.safetensors",
        "fp8_e4m3fn"
      ],
      "color": "#323",
      "bgcolor": "#535"
    },
    {
      "id": 113,
      "type": "VHS_VideoCombine",
      "pos": [
        2038.6036376953125,
        185.6936798095703
      ],
      "size": [
        372.2688903808594,
        700.2689208984375
      ],
      "flags": {},
      "order": 17,
      "mode": 0,
      "inputs": [
        {
          "name": "images",
          "type": "IMAGE",
          "link": 256
        },
        {
          "name": "audio",
          "shape": 7,
          "type": "AUDIO",
          "link": null
        },
        {
          "name": "meta_batch",
          "shape": 7,
          "type": "VHS_BatchManager",
          "link": null
        },
        {
          "name": "vae",
          "shape": 7,
          "type": "VAE",
          "link": null
        }
      ],
      "outputs": [
        {
          "name": "Filenames",
          "type": "VHS_FILENAMES",
          "links": null
        }
      ],
      "properties": {
        "cnr_id": "comfyui-videohelpersuite",
        "ver": "a7ce59e381934733bfae03b1be029756d6ce936d",
        "Node name for S&R": "VHS_VideoCombine"
      },
      "widgets_values": {
        "frame_rate": 24,
        "loop_count": 0,
        "filename_prefix": "Wan2.2",
        "format": "video/h264-mp4",
        "pix_fmt": "yuv420p",
        "crf": 19,
        "save_metadata": true,
        "trim_to_audio": false,
        "pingpong": false,
        "save_output": true,
        "videopreview": {
          "hidden": false,
          "paused": false,
          "params": {
            "filename": "Wan2.2_00091.mp4",
            "subfolder": "",
            "type": "output",
            "format": "video/h264-mp4",
            "frame_rate": 24,
            "workflow": "Wan2.2_00091.png",
            "fullpath": "D:\\AI\\ComfyUI_windows_portable\\ComfyUI\\output\\Wan2.2_00091.mp4"
          }
        }
      }
    },
    {
      "id": 158,
      "type": "PrimitiveNode",
      "pos": [
        920.8279418945312,
        588.53564453125
      ],
      "size": [
        210,
        82
      ],
      "flags": {},
      "order": 6,
      "mode": 0,
      "inputs": [],
      "outputs": [
        {
          "name": "INT",
          "type": "INT",
          "widget": {
            "name": "start_at_step"
          },
          "links": [
            325,
            326
          ]
        }
      ],
      "title": "start_at_step",
      "properties": {
        "Run widget replace on values": false
      },
      "widgets_values": [
        4,
        "fixed"
      ],
      "color": "#232",
      "bgcolor": "#353"
    },
    {
      "id": 164,
      "type": "WanImageToVideo",
      "pos": [
        870.6785278320312,
        204.1481475830078
      ],
      "size": [
        270,
        210
      ],
      "flags": {},
      "order": 13,
      "mode": 0,
      "inputs": [
        {
          "name": "positive",
          "type": "CONDITIONING",
          "link": 340
        },
        {
          "name": "negative",
          "type": "CONDITIONING",
          "link": 341
        },
        {
          "name": "vae",
          "type": "VAE",
          "link": 347
        },
        {
          "name": "clip_vision_output",
          "shape": 7,
          "type": "CLIP_VISION_OUTPUT",
          "link": null
        },
        {
          "name": "start_image",
          "shape": 7,
          "type": "IMAGE",
          "link": 349
        },
        {
          "name": "width",
          "type": "INT",
          "widget": {
            "name": "width"
          },
          "link": 350
        },
        {
          "name": "height",
          "type": "INT",
          "widget": {
            "name": "height"
          },
          "link": 351
        }
      ],
      "outputs": [
        {
          "name": "positive",
          "type": "CONDITIONING",
          "links": [
            342,
            344
          ]
        },
        {
          "name": "negative",
          "type": "CONDITIONING",
          "links": [
            343,
            345
          ]
        },
        {
          "name": "latent",
          "type": "LATENT",
          "links": [
            346
          ]
        }
      ],
      "properties": {
        "cnr_id": "comfy-core",
        "ver": "0.3.46",
        "Node name for S&R": "WanImageToVideo"
      },
      "widgets_values": [
        832,
        480,
        81,
        1
      ],
      "color": "#2a363b",
      "bgcolor": "#3f5159"
    },
    {
      "id": 155,
      "type": "KSamplerAdvanced",
      "pos": [
        1510.51513671875,
        185.6936798095703
      ],
      "size": [
        304.748046875,
        334
      ],
      "flags": {},
      "order": 15,
      "mode": 0,
      "inputs": [
        {
          "name": "model",
          "type": "MODEL",
          "link": 324
        },
        {
          "name": "positive",
          "type": "CONDITIONING",
          "link": 344
        },
        {
          "name": "negative",
          "type": "CONDITIONING",
          "link": 345
        },
        {
          "name": "latent_image",
          "type": "LATENT",
          "link": 316
        },
        {
          "name": "start_at_step",
          "type": "INT",
          "widget": {
            "name": "start_at_step"
          },
          "link": 325
        }
      ],
      "outputs": [
        {
          "name": "LATENT",
          "type": "LATENT",
          "links": [
            317
          ]
        }
      ],
      "properties": {
        "cnr_id": "comfy-core",
        "ver": "0.3.46",
        "Node name for S&R": "KSamplerAdvanced"
      },
      "widgets_values": [
        "disable",
        0,
        "fixed",
        20,
        3,
        "euler",
        "simple",
        4,
        9999,
        "disable"
      ]
    }
  ],
  "links": [
    [
      74,
      38,
      0,
      6,
      0,
      "CLIP"
    ],
    [
      75,
      38,
      0,
      7,
      0,
      "CLIP"
    ],
    [
      76,
      39,
      0,
      8,
      1,
      "VAE"
    ],
    [
      134,
      73,
      0,
      48,
      0,
      "MODEL"
    ],
    [
      256,
      8,
      0,
      113,
      0,
      "IMAGE"
    ],
    [
      316,
      154,
      0,
      155,
      3,
      "LATENT"
    ],
    [
      317,
      155,
      0,
      8,
      0,
      "LATENT"
    ],
    [
      322,
      48,
      0,
      154,
      0,
      "MODEL"
    ],
    [
      323,
      156,
      0,
      157,
      0,
      "MODEL"
    ],
    [
      324,
      157,
      0,
      155,
      0,
      "MODEL"
    ],
    [
      325,
      158,
      0,
      155,
      4,
      "INT"
    ],
    [
      326,
      158,
      0,
      154,
      4,
      "INT"
    ],
    [
      330,
      160,
      0,
      161,
      0,
      "IMAGE"
    ],
    [
      340,
      6,
      0,
      164,
      0,
      "CONDITIONING"
    ],
    [
      341,
      7,
      0,
      164,
      1,
      "CONDITIONING"
    ],
    [
      342,
      164,
      0,
      154,
      1,
      "CONDITIONING"
    ],
    [
      343,
      164,
      1,
      154,
      2,
      "CONDITIONING"
    ],
    [
      344,
      164,
      0,
      155,
      1,
      "CONDITIONING"
    ],
    [
      345,
      164,
      1,
      155,
      2,
      "CONDITIONING"
    ],
    [
      346,
      164,
      2,
      154,
      3,
      "LATENT"
    ],
    [
      347,
      39,
      0,
      164,
      2,
      "VAE"
    ],
    [
      348,
      161,
      0,
      162,
      0,
      "IMAGE"
    ],
    [
      349,
      161,
      0,
      164,
      4,
      "IMAGE"
    ],
    [
      350,
      162,
      0,
      164,
      5,
      "INT"
    ],
    [
      351,
      162,
      1,
      164,
      6,
      "INT"
    ]
  ],
  "groups": [],
  "config": {},
  "extra": {
    "ds": {
      "scale": 0.6209213230591554,
      "offset": [
        213.67849731445312,
        204.88626098632812
      ]
    },
    "frontendVersion": "1.28.0",
    "VHS_latentpreview": false,
    "VHS_latentpreviewrate": 0,
    "VHS_MetadataImage": true,
    "VHS_KeepIntermediate": true
  },
  "version": 0.4
}

🟦 初始图像输入到 WanImageToVideo 系节点。
Wan2.2 与 Wan2.1 时不同，不使用 clip_vision。

FLF2V（14B / First–Last Frame to Video）

Wan2.1 中 FLF2V 有专用模型，但 Wan2.2 的 image2video 模型也对应 FLF2V。

在 ComfyUI 中，只要向 WanFirstLastFrameToVideo 节点输入 Start / End 的 2 张图像，就能生成补充了 2 张之间的视频。

Wan2.2_image2video_FLF2V_14B.json

{
  "id": "d8034549-7e0a-40f1-8c2e-de3ffc6f1cae",
  "revision": 0,
  "last_node_id": 171,
  "last_link_id": 372,
  "nodes": [
    {
      "id": 38,
      "type": "CLIPLoader",
      "pos": [
        35.79126739501953,
        314.20880126953125
      ],
      "size": [
        301.3524169921875,
        106
      ],
      "flags": {},
      "order": 0,
      "mode": 0,
      "inputs": [],
      "outputs": [
        {
          "name": "CLIP",
          "type": "CLIP",
          "slot_index": 0,
          "links": [
            74,
            75
          ]
        }
      ],
      "properties": {
        "cnr_id": "comfy-core",
        "ver": "0.3.33",
        "Node name for S&R": "CLIPLoader"
      },
      "widgets_values": [
        "umt5_xxl_fp8_e4m3fn_scaled.safetensors",
        "wan",
        "default"
      ],
      "color": "#432",
      "bgcolor": "#653"
    },
    {
      "id": 8,
      "type": "VAEDecode",
      "pos": [
        1846.9432373046875,
        185.6936798095703
      ],
      "size": [
        157.56002807617188,
        46
      ],
      "flags": {},
      "order": 20,
      "mode": 0,
      "inputs": [
        {
          "name": "samples",
          "type": "LATENT",
          "link": 317
        },
        {
          "name": "vae",
          "type": "VAE",
          "link": 76
        }
      ],
      "outputs": [
        {
          "name": "IMAGE",
          "type": "IMAGE",
          "slot_index": 0,
          "links": [
            256
          ]
        }
      ],
      "properties": {
        "cnr_id": "comfy-core",
        "ver": "0.3.33",
        "Node name for S&R": "VAEDecode"
      },
      "widgets_values": []
    },
    {
      "id": 73,
      "type": "UNETLoader",
      "pos": [
        624.2608642578125,
        44.42214584350586
      ],
      "size": [
        270,
        82
      ],
      "flags": {},
      "order": 1,
      "mode": 0,
      "inputs": [],
      "outputs": [
        {
          "name": "MODEL",
          "type": "MODEL",
          "links": [
            134
          ]
        }
      ],
      "properties": {
        "cnr_id": "comfy-core",
        "ver": "0.3.33",
        "Node name for S&R": "UNETLoader"
      },
      "widgets_values": [
        "Wan2.2\\wan2.2_i2v_high_noise_14B_fp8_scaled.safetensors",
        "fp8_e4m3fn"
      ],
      "color": "#323",
      "bgcolor": "#535"
    },
    {
      "id": 48,
      "type": "ModelSamplingSD3",
      "pos": [
        921.9925537109375,
        47.08414840698242
      ],
      "size": [
        210,
        58
      ],
      "flags": {},
      "order": 10,
      "mode": 0,
      "inputs": [
        {
          "name": "model",
          "type": "MODEL",
          "link": 134
        }
      ],
      "outputs": [
        {
          "name": "MODEL",
          "type": "MODEL",
          "slot_index": 0,
          "links": [
            322
          ]
        }
      ],
      "properties": {
        "cnr_id": "comfy-core",
        "ver": "0.3.33",
        "Node name for S&R": "ModelSamplingSD3"
      },
      "widgets_values": [
        8
      ]
    },
    {
      "id": 151,
      "type": "MarkdownNote",
      "pos": [
        -113.67849731445312,
        -40.50404357910156
      ],
      "size": [
        405.56439208984375,
        254.14065551757812
      ],
      "flags": {},
      "order": 2,
      "mode": 0,
      "inputs": [],
      "outputs": [],
      "properties": {},
      "widgets_values": [
        "## models\n- [wan2.2_i2v_high_noise_14B_fp8_scaled.safetensors](https://huggingface.co/Comfy-Org/Wan_2.2_ComfyUI_Repackaged/blob/main/split_files/diffusion_models/wan2.2_i2v_high_noise_14B_fp8_scaled.safetensors)\n- [wan2.2_i2v_low_noise_14B_fp8_scaled.safetensors](https://huggingface.co/Comfy-Org/Wan_2.2_ComfyUI_Repackaged/blob/main/split_files/diffusion_models/wan2.2_i2v_low_noise_14B_fp8_scaled.safetensors)\n\n- [umt5_xxl.safetensors](https://huggingface.co/Comfy-Org/Wan_2.1_ComfyUI_repackaged/tree/main/split_files/text_encoders)\n- [wan_2.1_vae.safetensors](https://huggingface.co/Comfy-Org/Wan_2.1_ComfyUI_repackaged/blob/main/split_files/vae/wan_2.1_vae.safetensors)\n\n\n```\n📂ComfyUI/\n└──📂models/\n    ├── 📂diffusion_models/\n    │   ├── wan2.2_i2v_high_noise_14B_fp8_scaled.safetensors\n    │   └── wan2.2_i2v_low_noise_14B_fp8_scaled.safetensors\n    ├── 📂text_encoders/\n    │   └── umt5_xxl (fp16 or fp8).safetensors\n    └── 📂vae/\n         └── wan_2.1_vae.safetensors\n\n```"
      ],
      "color": "#323",
      "bgcolor": "#535"
    },
    {
      "id": 157,
      "type": "ModelSamplingSD3",
      "pos": [
        1264.7877197265625,
        -104.88626098632812
      ],
      "size": [
        210,
        58
      ],
      "flags": {},
      "order": 11,
      "mode": 0,
      "inputs": [
        {
          "name": "model",
          "type": "MODEL",
          "link": 323
        }
      ],
      "outputs": [
        {
          "name": "MODEL",
          "type": "MODEL",
          "slot_index": 0,
          "links": [
            324
          ]
        }
      ],
      "properties": {
        "cnr_id": "comfy-core",
        "ver": "0.3.33",
        "Node name for S&R": "ModelSamplingSD3"
      },
      "widgets_values": [
        8
      ]
    },
    {
      "id": 156,
      "type": "UNETLoader",
      "pos": [
        959.7449340820312,
        -104.88626098632812
      ],
      "size": [
        270,
        82
      ],
      "flags": {},
      "order": 3,
      "mode": 0,
      "inputs": [],
      "outputs": [
        {
          "name": "MODEL",
          "type": "MODEL",
          "links": [
            323
          ]
        }
      ],
      "properties": {
        "cnr_id": "comfy-core",
        "ver": "0.3.33",
        "Node name for S&R": "UNETLoader"
      },
      "widgets_values": [
        "Wan2.2\\wan2.2_i2v_low_noise_14B_fp8_scaled.safetensors",
        "fp8_e4m3fn"
      ],
      "color": "#323",
      "bgcolor": "#535"
    },
    {
      "id": 113,
      "type": "VHS_VideoCombine",
      "pos": [
        2038.6036376953125,
        185.6936798095703
      ],
      "size": [
        372.2688903808594,
        855.2672119140625
      ],
      "flags": {},
      "order": 21,
      "mode": 0,
      "inputs": [
        {
          "name": "images",
          "type": "IMAGE",
          "link": 256
        },
        {
          "name": "audio",
          "shape": 7,
          "type": "AUDIO",
          "link": null
        },
        {
          "name": "meta_batch",
          "shape": 7,
          "type": "VHS_BatchManager",
          "link": null
        },
        {
          "name": "vae",
          "shape": 7,
          "type": "VAE",
          "link": null
        }
      ],
      "outputs": [
        {
          "name": "Filenames",
          "type": "VHS_FILENAMES",
          "links": null
        }
      ],
      "properties": {
        "cnr_id": "comfyui-videohelpersuite",
        "ver": "a7ce59e381934733bfae03b1be029756d6ce936d",
        "Node name for S&R": "VHS_VideoCombine"
      },
      "widgets_values": {
        "frame_rate": 24,
        "loop_count": 0,
        "filename_prefix": "Wan2.2",
        "format": "video/h264-mp4",
        "pix_fmt": "yuv420p",
        "crf": 19,
        "save_metadata": true,
        "trim_to_audio": false,
        "pingpong": false,
        "save_output": true,
        "videopreview": {
          "hidden": false,
          "paused": false,
          "params": {
            "filename": "Wan2.2_00093.mp4",
            "subfolder": "",
            "type": "output",
            "format": "video/h264-mp4",
            "frame_rate": 24,
            "workflow": "Wan2.2_00093.png",
            "fullpath": "D:\\AI\\ComfyUI_windows_portable\\ComfyUI\\output\\Wan2.2_00093.mp4"
          }
        }
      }
    },
    {
      "id": 7,
      "type": "CLIPTextEncode",
      "pos": [
        397.4257507324219,
        390.464111328125
      ],
      "size": [
        419.3189392089844,
        138.8924560546875
      ],
      "flags": {},
      "order": 9,
      "mode": 0,
      "inputs": [
        {
          "name": "clip",
          "type": "CLIP",
          "link": 75
        }
      ],
      "outputs": [
        {
          "name": "CONDITIONING",
          "type": "CONDITIONING",
          "slot_index": 0,
          "links": [
            353
          ]
        }
      ],
      "title": "CLIP Text Encode (Negative Prompt)",
      "properties": {
        "cnr_id": "comfy-core",
        "ver": "0.3.33",
        "Node name for S&R": "CLIPTextEncode"
      },
      "widgets_values": [
        "色调艳丽，过曝，静态，细节模糊不清，字幕，风格，作品，画作，画面，静止，整体发灰，最差质量，低质量，JPEG压缩残留，丑陋的，残缺的，多余的手指，画得不好的手部，画得不好的脸部，畸形的，毁容的，形态畸形的肢体，手指融合，静止不动的画面，杂乱的背景，三条腿，背景人很多，倒着走 "
      ]
    },
    {
      "id": 39,
      "type": "VAELoader",
      "pos": [
        551.524658203125,
        597.2347412109375
      ],
      "size": [
        265.22003173828125,
        58
      ],
      "flags": {},
      "order": 4,
      "mode": 0,
      "inputs": [],
      "outputs": [
        {
          "name": "VAE",
          "type": "VAE",
          "slot_index": 0,
          "links": [
            76,
            354
          ]
        }
      ],
      "properties": {
        "cnr_id": "comfy-core",
        "ver": "0.3.33",
        "Node name for S&R": "VAELoader"
      },
      "widgets_values": [
        "wan_2.1_vae.safetensors"
      ],
      "color": "#322",
      "bgcolor": "#533"
    },
    {
      "id": 166,
      "type": "WanFirstLastFrameToVideo",
      "pos": [
        866.294921875,
        205.22122192382812
      ],
      "size": [
        275.09765625,
        250
      ],
      "flags": {},
      "order": 17,
      "mode": 0,
      "inputs": [
        {
          "name": "positive",
          "type": "CONDITIONING",
          "link": 352
        },
        {
          "name": "negative",
          "type": "CONDITIONING",
          "link": 353
        },
        {
          "name": "vae",
          "type": "VAE",
          "link": 354
        },
        {
          "name": "clip_vision_start_image",
          "shape": 7,
          "type": "CLIP_VISION_OUTPUT",
          "link": null
        },
        {
          "name": "clip_vision_end_image",
          "shape": 7,
          "type": "CLIP_VISION_OUTPUT",
          "link": null
        },
        {
          "name": "start_image",
          "shape": 7,
          "type": "IMAGE",
          "link": 370
        },
        {
          "name": "end_image",
          "shape": 7,
          "type": "IMAGE",
          "link": 371
        },
        {
          "name": "width",
          "type": "INT",
          "widget": {
            "name": "width"
          },
          "link": 355
        },
        {
          "name": "height",
          "type": "INT",
          "widget": {
            "name": "height"
          },
          "link": 356
        }
      ],
      "outputs": [
        {
          "name": "positive",
          "type": "CONDITIONING",
          "links": [
            357,
            362
          ]
        },
        {
          "name": "negative",
          "type": "CONDITIONING",
          "links": [
            358,
            363
          ]
        },
        {
          "name": "latent",
          "type": "LATENT",
          "links": [
            359
          ]
        }
      ],
      "properties": {
        "cnr_id": "comfy-core",
        "ver": "0.3.48",
        "Node name for S&R": "WanFirstLastFrameToVideo"
      },
      "widgets_values": [
        832,
        480,
        81,
        1
      ],
      "color": "#2a363b",
      "bgcolor": "#3f5159"
    },
    {
      "id": 168,
      "type": "LoadImage",
      "pos": [
        -248.64210510253906,
        963.0194091796875
      ],
      "size": [
        334.93658447265625,
        388.3801574707031
      ],
      "flags": {},
      "order": 5,
      "mode": 0,
      "inputs": [],
      "outputs": [
        {
          "name": "IMAGE",
          "type": "IMAGE",
          "links": [
            366
          ]
        },
        {
          "name": "MASK",
          "type": "MASK",
          "links": null
        }
      ],
      "properties": {
        "cnr_id": "comfy-core",
        "ver": "0.3.46",
        "Node name for S&R": "LoadImage"
      },
      "widgets_values": [
        "ComfyUI_00507_.png",
        "image"
      ]
    },
    {
      "id": 167,
      "type": "ImageBatch",
      "pos": [
        161.45394897460938,
        801.67626953125
      ],
      "size": [
        140,
        46
      ],
      "flags": {},
      "order": 12,
      "mode": 0,
      "inputs": [
        {
          "name": "image1",
          "type": "IMAGE",
          "link": 365
        },
        {
          "name": "image2",
          "type": "IMAGE",
          "link": 366
        }
      ],
      "outputs": [
        {
          "name": "IMAGE",
          "type": "IMAGE",
          "links": [
            367
          ]
        }
      ],
      "properties": {
        "cnr_id": "comfy-core",
        "ver": "0.3.48",
        "Node name for S&R": "ImageBatch"
      },
      "widgets_values": [],
      "color": "#2a363b",
      "bgcolor": "#3f5159"
    },
    {
      "id": 170,
      "type": "ImageFromBatch",
      "pos": [
        604.3246459960938,
        867.176513671875
      ],
      "size": [
        210,
        82
      ],
      "flags": {},
      "order": 15,
      "mode": 0,
      "inputs": [
        {
          "name": "image",
          "type": "IMAGE",
          "link": 369
        }
      ],
      "outputs": [
        {
          "name": "IMAGE",
          "type": "IMAGE",
          "links": [
            371
          ]
        }
      ],
      "properties": {
        "cnr_id": "comfy-core",
        "ver": "0.3.48",
        "Node name for S&R": "ImageFromBatch"
      },
      "widgets_values": [
        1,
        1
      ],
      "color": "#2a363b",
      "bgcolor": "#3f5159"
    },
    {
      "id": 169,
      "type": "ImageFromBatch",
      "pos": [
        604.3246459960938,
        729.67626953125
      ],
      "size": [
        210,
        82
      ],
      "flags": {},
      "order": 14,
      "mode": 0,
      "inputs": [
        {
          "name": "image",
          "type": "IMAGE",
          "link": 368
        }
      ],
      "outputs": [
        {
          "name": "IMAGE",
          "type": "IMAGE",
          "links": [
            370,
            372
          ]
        }
      ],
      "properties": {
        "cnr_id": "comfy-core",
        "ver": "0.3.48",
        "Node name for S&R": "ImageFromBatch"
      },
      "widgets_values": [
        0,
        1
      ],
      "color": "#2a363b",
      "bgcolor": "#3f5159"
    },
    {
      "id": 160,
      "type": "LoadImage",
      "pos": [
        -249.5445556640625,
        511.5525817871094
      ],
      "size": [
        334.93658447265625,
        388.3801574707031
      ],
      "flags": {},
      "order": 6,
      "mode": 0,
      "inputs": [],
      "outputs": [
        {
          "name": "IMAGE",
          "type": "IMAGE",
          "links": [
            365
          ]
        },
        {
          "name": "MASK",
          "type": "MASK",
          "links": null
        }
      ],
      "properties": {
        "cnr_id": "comfy-core",
        "ver": "0.3.46",
        "Node name for S&R": "LoadImage"
      },
      "widgets_values": [
        "MCNK7B5FR2J9K.png",
        "image"
      ]
    },
    {
      "id": 162,
      "type": "GetImageSize",
      "pos": [
        604.3246459960938,
        1025.370361328125
      ],
      "size": [
        210,
        136
      ],
      "flags": {},
      "order": 16,
      "mode": 0,
      "inputs": [
        {
          "name": "image",
          "type": "IMAGE",
          "link": 372
        }
      ],
      "outputs": [
        {
          "name": "width",
          "type": "INT",
          "links": [
            355
          ]
        },
        {
          "name": "height",
          "type": "INT",
          "links": [
            356
          ]
        },
        {
          "name": "batch_size",
          "type": "INT",
          "links": null
        }
      ],
      "properties": {
        "cnr_id": "comfy-core",
        "ver": "0.3.46",
        "Node name for S&R": "GetImageSize"
      },
      "widgets_values": [
        "width: 606, height: 865\n batch size: 1"
      ]
    },
    {
      "id": 161,
      "type": "ImageScaleToTotalPixels",
      "pos": [
        330.69580078125,
        801.67626953125
      ],
      "size": [
        239.38699340820312,
        82
      ],
      "flags": {},
      "order": 13,
      "mode": 0,
      "inputs": [
        {
          "name": "image",
          "type": "IMAGE",
          "link": 367
        }
      ],
      "outputs": [
        {
          "name": "IMAGE",
          "type": "IMAGE",
          "links": [
            368,
            369
          ]
        }
      ],
      "properties": {
        "cnr_id": "comfy-core",
        "ver": "0.3.46",
        "Node name for S&R": "ImageScaleToTotalPixels"
      },
      "widgets_values": [
        "nearest-exact",
        0.5
      ],
      "color": "#2a363b",
      "bgcolor": "#3f5159"
    },
    {
      "id": 6,
      "type": "CLIPTextEncode",
      "pos": [
        397.47509765625,
        187.46409606933594
      ],
      "size": [
        419.26959228515625,
        148.8194122314453
      ],
      "flags": {},
      "order": 8,
      "mode": 0,
      "inputs": [
        {
          "name": "clip",
          "type": "CLIP",
          "link": 74
        }
      ],
      "outputs": [
        {
          "name": "CONDITIONING",
          "type": "CONDITIONING",
          "slot_index": 0,
          "links": [
            352
          ]
        }
      ],
      "title": "CLIP Text Encode (Positive Prompt)",
      "properties": {
        "cnr_id": "comfy-core",
        "ver": "0.3.33",
        "Node name for S&R": "CLIPTextEncode"
      },
      "widgets_values": [
        "A girl is about to sit as the camera moves upward"
      ]
    },
    {
      "id": 158,
      "type": "PrimitiveNode",
      "pos": [
        920.8279418945312,
        588.53564453125
      ],
      "size": [
        210,
        82
      ],
      "flags": {},
      "order": 7,
      "mode": 0,
      "inputs": [],
      "outputs": [
        {
          "name": "INT",
          "type": "INT",
          "widget": {
            "name": "start_at_step"
          },
          "links": [
            325,
            326
          ]
        }
      ],
      "title": "start_at_step",
      "properties": {
        "Run widget replace on values": false
      },
      "widgets_values": [
        4,
        "fixed"
      ],
      "color": "#232",
      "bgcolor": "#353"
    },
    {
      "id": 155,
      "type": "KSamplerAdvanced",
      "pos": [
        1510.51513671875,
        185.6936798095703
      ],
      "size": [
        304.748046875,
        334
      ],
      "flags": {},
      "order": 19,
      "mode": 0,
      "inputs": [
        {
          "name": "model",
          "type": "MODEL",
          "link": 324
        },
        {
          "name": "positive",
          "type": "CONDITIONING",
          "link": 362
        },
        {
          "name": "negative",
          "type": "CONDITIONING",
          "link": 363
        },
        {
          "name": "latent_image",
          "type": "LATENT",
          "link": 316
        },
        {
          "name": "start_at_step",
          "type": "INT",
          "widget": {
            "name": "start_at_step"
          },
          "link": 325
        }
      ],
      "outputs": [
        {
          "name": "LATENT",
          "type": "LATENT",
          "links": [
            317
          ]
        }
      ],
      "properties": {
        "cnr_id": "comfy-core",
        "ver": "0.3.46",
        "Node name for S&R": "KSamplerAdvanced"
      },
      "widgets_values": [
        "disable",
        0,
        "fixed",
        20,
        3,
        "euler",
        "simple",
        4,
        9999,
        "disable"
      ]
    },
    {
      "id": 154,
      "type": "KSamplerAdvanced",
      "pos": [
        1170.0396728515625,
        186.20217895507812
      ],
      "size": [
        304.748046875,
        334
      ],
      "flags": {},
      "order": 18,
      "mode": 0,
      "inputs": [
        {
          "name": "model",
          "type": "MODEL",
          "link": 322
        },
        {
          "name": "positive",
          "type": "CONDITIONING",
          "link": 357
        },
        {
          "name": "negative",
          "type": "CONDITIONING",
          "link": 358
        },
        {
          "name": "latent_image",
          "type": "LATENT",
          "link": 359
        },
        {
          "name": "end_at_step",
          "type": "INT",
          "widget": {
            "name": "end_at_step"
          },
          "link": 326
        }
      ],
      "outputs": [
        {
          "name": "LATENT",
          "type": "LATENT",
          "links": [
            316
          ]
        }
      ],
      "properties": {
        "cnr_id": "comfy-core",
        "ver": "0.3.46",
        "Node name for S&R": "KSamplerAdvanced"
      },
      "widgets_values": [
        "enable",
        123,
        "fixed",
        20,
        4,
        "euler",
        "simple",
        0,
        4,
        "enable"
      ]
    }
  ],
  "links": [
    [
      74,
      38,
      0,
      6,
      0,
      "CLIP"
    ],
    [
      75,
      38,
      0,
      7,
      0,
      "CLIP"
    ],
    [
      76,
      39,
      0,
      8,
      1,
      "VAE"
    ],
    [
      134,
      73,
      0,
      48,
      0,
      "MODEL"
    ],
    [
      256,
      8,
      0,
      113,
      0,
      "IMAGE"
    ],
    [
      316,
      154,
      0,
      155,
      3,
      "LATENT"
    ],
    [
      317,
      155,
      0,
      8,
      0,
      "LATENT"
    ],
    [
      322,
      48,
      0,
      154,
      0,
      "MODEL"
    ],
    [
      323,
      156,
      0,
      157,
      0,
      "MODEL"
    ],
    [
      324,
      157,
      0,
      155,
      0,
      "MODEL"
    ],
    [
      325,
      158,
      0,
      155,
      4,
      "INT"
    ],
    [
      326,
      158,
      0,
      154,
      4,
      "INT"
    ],
    [
      352,
      6,
      0,
      166,
      0,
      "CONDITIONING"
    ],
    [
      353,
      7,
      0,
      166,
      1,
      "CONDITIONING"
    ],
    [
      354,
      39,
      0,
      166,
      2,
      "VAE"
    ],
    [
      355,
      162,
      0,
      166,
      7,
      "INT"
    ],
    [
      356,
      162,
      1,
      166,
      8,
      "INT"
    ],
    [
      357,
      166,
      0,
      154,
      1,
      "CONDITIONING"
    ],
    [
      358,
      166,
      1,
      154,
      2,
      "CONDITIONING"
    ],
    [
      359,
      166,
      2,
      154,
      3,
      "LATENT"
    ],
    [
      362,
      166,
      0,
      155,
      1,
      "CONDITIONING"
    ],
    [
      363,
      166,
      1,
      155,
      2,
      "CONDITIONING"
    ],
    [
      365,
      160,
      0,
      167,
      0,
      "IMAGE"
    ],
    [
      366,
      168,
      0,
      167,
      1,
      "IMAGE"
    ],
    [
      367,
      167,
      0,
      161,
      0,
      "IMAGE"
    ],
    [
      368,
      161,
      0,
      169,
      0,
      "IMAGE"
    ],
    [
      369,
      161,
      0,
      170,
      0,
      "IMAGE"
    ],
    [
      370,
      169,
      0,
      166,
      5,
      "IMAGE"
    ],
    [
      371,
      170,
      0,
      166,
      6,
      "IMAGE"
    ],
    [
      372,
      169,
      0,
      162,
      0,
      "IMAGE"
    ]
  ],
  "groups": [],
  "config": {},
  "extra": {
    "ds": {
      "scale": 0.424097618372485,
      "offset": [
        349.5445556640625,
        204.88626098632812
      ]
    },
    "frontendVersion": "1.28.0",
    "VHS_latentpreview": false,
    "VHS_latentpreviewrate": 0,
    "VHS_MetadataImage": true,
    "VHS_KeepIntermediate": true
  },
  "version": 0.4
}

🟦 与 Wan2.1 同样，向 WanFirstLastFrameToVideo 节点输入 Start / End 图像。

Wan2.2 5B 模型（TI2V-5B）

Wan2.2-TI2V-5B 是在 1 个模型处理 text2video / image2video 双方的 TI2V 模型。通过组合压缩率更高的 VAE 和补丁化处理，可以用 比 14B 更轻的计算成本 生成 720p・24fps・5秒左右的视频。

与其说是 14B 的 1.3B 式的缩小版，不如认为是稍微从根本上设计不同的路线比较好吧。

虽然设计很有趣，但遗憾的是果然在性能上赢不了 14B，实际上几乎不被使用。

模型的下载

diffusion_models
- wan2.2_ti2v_5B_fp16.safetensors
vae
- wan2.2_vae.safetensors
gguf（任意）
- Wan2.2-TI2V-5B-GGUF

配置例。

📂ComfyUI/
└── 📂models/
    ├── 📂diffusion_models/
    │   └── wan2.2_ti2v_5B_fp16.safetensors
    ├── 📂unet/
    │   └── Wan2.2-TI2V-5B-XXXX.gguf     ← 仅在使用 gguf 时
    └── 📂vae/
        └── wan2.2_vae.safetensors

text2video（5B）

Wan2.2_text2video_5B.json

{
  "id": "d8034549-7e0a-40f1-8c2e-de3ffc6f1cae",
  "revision": 0,
  "last_node_id": 165,
  "last_link_id": 341,
  "nodes": [
    {
      "id": 38,
      "type": "CLIPLoader",
      "pos": [
        56.288665771484375,
        312.74468994140625
      ],
      "size": [
        301.3524169921875,
        106
      ],
      "flags": {},
      "order": 0,
      "mode": 0,
      "inputs": [],
      "outputs": [
        {
          "name": "CLIP",
          "type": "CLIP",
          "slot_index": 0,
          "links": [
            74,
            75
          ]
        }
      ],
      "properties": {
        "cnr_id": "comfy-core",
        "ver": "0.3.33",
        "Node name for S&R": "CLIPLoader"
      },
      "widgets_values": [
        "umt5_xxl_fp8_e4m3fn_scaled.safetensors",
        "wan",
        "default"
      ],
      "color": "#432",
      "bgcolor": "#653"
    },
    {
      "id": 48,
      "type": "ModelSamplingSD3",
      "pos": [
        627.1928100585938,
        49.284149169921875
      ],
      "size": [
        210,
        58
      ],
      "flags": {},
      "order": 6,
      "mode": 0,
      "inputs": [
        {
          "name": "model",
          "type": "MODEL",
          "link": 134
        }
      ],
      "outputs": [
        {
          "name": "MODEL",
          "type": "MODEL",
          "slot_index": 0,
          "links": [
            329
          ]
        }
      ],
      "properties": {
        "cnr_id": "comfy-core",
        "ver": "0.3.33",
        "Node name for S&R": "ModelSamplingSD3"
      },
      "widgets_values": [
        8
      ]
    },
    {
      "id": 6,
      "type": "CLIPTextEncode",
      "pos": [
        417.9232177734375,
        186
      ],
      "size": [
        419.26959228515625,
        148.8194122314453
      ],
      "flags": {},
      "order": 4,
      "mode": 0,
      "inputs": [
        {
          "name": "clip",
          "type": "CLIP",
          "link": 74
        }
      ],
      "outputs": [
        {
          "name": "CONDITIONING",
          "type": "CONDITIONING",
          "slot_index": 0,
          "links": [
            330
          ]
        }
      ],
      "title": "CLIP Text Encode (Positive Prompt)",
      "properties": {
        "cnr_id": "comfy-core",
        "ver": "0.3.33",
        "Node name for S&R": "CLIPTextEncode"
      },
      "widgets_values": [
        "A high-quality, ultra-detailed image of a kingfisher diving into a crystal-clear pool of water, captured at the precise moment its beak touches the surface. The bird’s vibrant blue and orange feathers are sharply defined, with water droplets suspended in the air around it. The background features a softly blurred natural riverside setting, with lush green foliage and gentle sunlight filtering through the trees, creating a serene and dynamic scene."
      ]
    },
    {
      "id": 7,
      "type": "CLIPTextEncode",
      "pos": [
        417.8738708496094,
        389
      ],
      "size": [
        419.3189392089844,
        138.8924560546875
      ],
      "flags": {},
      "order": 5,
      "mode": 0,
      "inputs": [
        {
          "name": "clip",
          "type": "CLIP",
          "link": 75
        }
      ],
      "outputs": [
        {
          "name": "CONDITIONING",
          "type": "CONDITIONING",
          "slot_index": 0,
          "links": [
            331
          ]
        }
      ],
      "title": "CLIP Text Encode (Negative Prompt)",
      "properties": {
        "cnr_id": "comfy-core",
        "ver": "0.3.33",
        "Node name for S&R": "CLIPTextEncode"
      },
      "widgets_values": [
        "色调艳丽，过曝，静态，细节模糊不清，字幕，风格，作品，画作，画面，静止，整体发灰，最差质量，低质量，JPEG压缩残留，丑陋的，残缺的，多余的手指，画得不好的手部，画得不好的脸部，畸形的，毁容的，形态畸形的肢体，手指融合，静止不动的画面，杂乱的背景，三条腿，背景人很多，倒着走 "
      ]
    },
    {
      "id": 73,
      "type": "UNETLoader",
      "pos": [
        329.4610595703125,
        46.62214660644531
      ],
      "size": [
        270,
        82
      ],
      "flags": {},
      "order": 1,
      "mode": 0,
      "inputs": [],
      "outputs": [
        {
          "name": "MODEL",
          "type": "MODEL",
          "links": [
            134
          ]
        }
      ],
      "properties": {
        "cnr_id": "comfy-core",
        "ver": "0.3.33",
        "Node name for S&R": "UNETLoader"
      },
      "widgets_values": [
        "Wan2.2\\wan2.2_ti2v_5B_fp16.safetensors",
        "default"
      ],
      "color": "#323",
      "bgcolor": "#535"
    },
    {
      "id": 151,
      "type": "MarkdownNote",
      "pos": [
        -87.05850982666016,
        -15.094049453735352
      ],
      "size": [
        366.41766357421875,
        229.31654357910156
      ],
      "flags": {},
      "order": 2,
      "mode": 0,
      "inputs": [],
      "outputs": [],
      "properties": {},
      "widgets_values": [
        "## models\n- [wan2.2_ti2v_5B_fp16.safetensors](https://huggingface.co/Comfy-Org/Wan_2.2_ComfyUI_Repackaged/blob/main/split_files/diffusion_models/wan2.2_ti2v_5B_fp16.safetensors)\n- [umt5_xxl.safetensors](https://huggingface.co/Comfy-Org/Wan_2.1_ComfyUI_repackaged/tree/main/split_files/text_encoders)\n- [wan2.2_vae.safetensors](https://huggingface.co/Comfy-Org/Wan_2.2_ComfyUI_Repackaged/blob/main/split_files/vae/wan2.2_vae.safetensors)\n\n\n```\n📂ComfyUI/\n└──📂models/\n    ├── 📂diffusion_models/\n    │   └── wan2.2_ti2v_5B_fp16.safetensors\n    ├── 📂text_encoders/\n    │   └── umt5_xxl (fp16 or fp8).safetensors\n    └── 📂vae/\n         └── wan2.2_vae.safetensors\n\n```"
      ],
      "color": "#323",
      "bgcolor": "#535"
    },
    {
      "id": 161,
      "type": "Wan22ImageToVideoLatent",
      "pos": [
        563.1520385742188,
        593.4163818359375
      ],
      "size": [
        271.9126892089844,
        150
      ],
      "flags": {},
      "order": 7,
      "mode": 0,
      "inputs": [
        {
          "name": "vae",
          "type": "VAE",
          "link": 336
        },
        {
          "name": "start_image",
          "shape": 7,
          "type": "IMAGE",
          "link": null
        }
      ],
      "outputs": [
        {
          "name": "LATENT",
          "type": "LATENT",
          "links": [
            335
          ]
        }
      ],
      "properties": {
        "cnr_id": "comfy-core",
        "ver": "0.3.46",
        "Node name for S&R": "Wan22ImageToVideoLatent"
      },
      "widgets_values": [
        1280,
        704,
        53,
        1
      ],
      "color": "#322",
      "bgcolor": "#533"
    },
    {
      "id": 160,
      "type": "KSampler",
      "pos": [
        877.5444946289062,
        186
      ],
      "size": [
        270,
        262
      ],
      "flags": {},
      "order": 8,
      "mode": 0,
      "inputs": [
        {
          "name": "model",
          "type": "MODEL",
          "link": 329
        },
        {
          "name": "positive",
          "type": "CONDITIONING",
          "link": 330
        },
        {
          "name": "negative",
          "type": "CONDITIONING",
          "link": 331
        },
        {
          "name": "latent_image",
          "type": "LATENT",
          "link": 335
        }
      ],
      "outputs": [
        {
          "name": "LATENT",
          "type": "LATENT",
          "links": [
            332
          ]
        }
      ],
      "properties": {
        "cnr_id": "comfy-core",
        "ver": "0.3.46",
        "Node name for S&R": "KSampler"
      },
      "widgets_values": [
        123456,
        "fixed",
        20,
        4,
        "uni_pc",
        "simple",
        1
      ]
    },
    {
      "id": 39,
      "type": "VAELoader",
      "pos": [
        277.54449462890625,
        596.2767333984375
      ],
      "size": [
        265.22003173828125,
        58
      ],
      "flags": {},
      "order": 3,
      "mode": 0,
      "inputs": [],
      "outputs": [
        {
          "name": "VAE",
          "type": "VAE",
          "slot_index": 0,
          "links": [
            76,
            336
          ]
        }
      ],
      "properties": {
        "cnr_id": "comfy-core",
        "ver": "0.3.33",
        "Node name for S&R": "VAELoader"
      },
      "widgets_values": [
        "wan2.2_vae.safetensors"
      ],
      "color": "#322",
      "bgcolor": "#533"
    },
    {
      "id": 8,
      "type": "VAEDecode",
      "pos": [
        1179.2911376953125,
        186
      ],
      "size": [
        157.56002807617188,
        46
      ],
      "flags": {},
      "order": 9,
      "mode": 0,
      "inputs": [
        {
          "name": "samples",
          "type": "LATENT",
          "link": 332
        },
        {
          "name": "vae",
          "type": "VAE",
          "link": 76
        }
      ],
      "outputs": [
        {
          "name": "IMAGE",
          "type": "IMAGE",
          "slot_index": 0,
          "links": [
            341
          ]
        }
      ],
      "properties": {
        "cnr_id": "comfy-core",
        "ver": "0.3.33",
        "Node name for S&R": "VAEDecode"
      },
      "widgets_values": []
    },
    {
      "id": 113,
      "type": "VHS_VideoCombine",
      "pos": [
        1368.597900390625,
        186
      ],
      "size": [
        372.2688903808594,
        541.7478637695312
      ],
      "flags": {},
      "order": 10,
      "mode": 0,
      "inputs": [
        {
          "name": "images",
          "type": "IMAGE",
          "link": 341
        },
        {
          "name": "audio",
          "shape": 7,
          "type": "AUDIO",
          "link": null
        },
        {
          "name": "meta_batch",
          "shape": 7,
          "type": "VHS_BatchManager",
          "link": null
        },
        {
          "name": "vae",
          "shape": 7,
          "type": "VAE",
          "link": null
        }
      ],
      "outputs": [
        {
          "name": "Filenames",
          "type": "VHS_FILENAMES",
          "links": null
        }
      ],
      "properties": {
        "cnr_id": "comfyui-videohelpersuite",
        "ver": "a7ce59e381934733bfae03b1be029756d6ce936d",
        "Node name for S&R": "VHS_VideoCombine"
      },
      "widgets_values": {
        "frame_rate": 24,
        "loop_count": 0,
        "filename_prefix": "Wan2.2",
        "format": "video/h264-mp4",
        "pix_fmt": "yuv420p",
        "crf": 19,
        "save_metadata": true,
        "trim_to_audio": false,
        "pingpong": false,
        "save_output": true,
        "videopreview": {
          "hidden": false,
          "paused": false,
          "params": {
            "filename": "Wan2.2_00006.mp4",
            "subfolder": "",
            "type": "output",
            "format": "video/h264-mp4",
            "frame_rate": 24,
            "workflow": "Wan2.2_00006.png",
            "fullpath": "D:\\AI\\ComfyUI_windows_portable\\ComfyUI\\output\\Wan2.2_00006.mp4"
          }
        }
      }
    }
  ],
  "links": [
    [
      74,
      38,
      0,
      6,
      0,
      "CLIP"
    ],
    [
      75,
      38,
      0,
      7,
      0,
      "CLIP"
    ],
    [
      76,
      39,
      0,
      8,
      1,
      "VAE"
    ],
    [
      134,
      73,
      0,
      48,
      0,
      "MODEL"
    ],
    [
      329,
      48,
      0,
      160,
      0,
      "MODEL"
    ],
    [
      330,
      6,
      0,
      160,
      1,
      "CONDITIONING"
    ],
    [
      331,
      7,
      0,
      160,
      2,
      "CONDITIONING"
    ],
    [
      332,
      160,
      0,
      8,
      0,
      "LATENT"
    ],
    [
      335,
      161,
      0,
      160,
      3,
      "LATENT"
    ],
    [
      336,
      39,
      0,
      161,
      0,
      "VAE"
    ],
    [
      341,
      8,
      0,
      113,
      0,
      "IMAGE"
    ]
  ],
  "groups": [],
  "config": {},
  "extra": {
    "ds": {
      "scale": 0.7513148009015777,
      "offset": [
        187.05850982666016,
        115.09404945373535
      ]
    },
    "frontendVersion": "1.25.1",
    "VHS_latentpreview": false,
    "VHS_latentpreviewrate": 0,
    "VHS_MetadataImage": true,
    "VHS_KeepIntermediate": true
  },
  "version": 0.4
}

5B 的 text2video，在内部经由“最初的帧 latent”生成视频。

🟥 VAE 使用 wan2.2_vae 因为与 Wan2.1 的 VAE 压缩构造不同，所以忘记替换的话画质和动作会严重崩溃。
🟩 在 text2video 也夹入 TI2V 用的 latent 节点（例：Wan22ImageToVideoLatent） 5B 是以“1 帧 latent → 视频”的管线为前提设计的，所以没有预想跳过这里的构成。

如果理解为“虽然是 text2video，但里面是 image2video 的特别案例”，就容易与其他 TI2V 系模型合并整理。

image2video（5B）

Wan2.2_image2video_5B.json

{
  "id": "d8034549-7e0a-40f1-8c2e-de3ffc6f1cae",
  "revision": 0,
  "last_node_id": 168,
  "last_link_id": 347,
  "nodes": [
    {
      "id": 38,
      "type": "CLIPLoader",
      "pos": [
        56.288665771484375,
        312.74468994140625
      ],
      "size": [
        301.3524169921875,
        106
      ],
      "flags": {},
      "order": 0,
      "mode": 0,
      "inputs": [],
      "outputs": [
        {
          "name": "CLIP",
          "type": "CLIP",
          "slot_index": 0,
          "links": [
            74,
            75
          ]
        }
      ],
      "properties": {
        "cnr_id": "comfy-core",
        "ver": "0.3.33",
        "Node name for S&R": "CLIPLoader"
      },
      "widgets_values": [
        "umt5_xxl_fp8_e4m3fn_scaled.safetensors",
        "wan",
        "default"
      ],
      "color": "#432",
      "bgcolor": "#653"
    },
    {
      "id": 48,
      "type": "ModelSamplingSD3",
      "pos": [
        627.1928100585938,
        49.284149169921875
      ],
      "size": [
        210,
        58
      ],
      "flags": {},
      "order": 7,
      "mode": 0,
      "inputs": [
        {
          "name": "model",
          "type": "MODEL",
          "link": 134
        }
      ],
      "outputs": [
        {
          "name": "MODEL",
          "type": "MODEL",
          "slot_index": 0,
          "links": [
            329
          ]
        }
      ],
      "properties": {
        "cnr_id": "comfy-core",
        "ver": "0.3.33",
        "Node name for S&R": "ModelSamplingSD3"
      },
      "widgets_values": [
        8
      ]
    },
    {
      "id": 7,
      "type": "CLIPTextEncode",
      "pos": [
        417.8738708496094,
        389
      ],
      "size": [
        419.3189392089844,
        138.8924560546875
      ],
      "flags": {},
      "order": 6,
      "mode": 0,
      "inputs": [
        {
          "name": "clip",
          "type": "CLIP",
          "link": 75
        }
      ],
      "outputs": [
        {
          "name": "CONDITIONING",
          "type": "CONDITIONING",
          "slot_index": 0,
          "links": [
            331
          ]
        }
      ],
      "title": "CLIP Text Encode (Negative Prompt)",
      "properties": {
        "cnr_id": "comfy-core",
        "ver": "0.3.33",
        "Node name for S&R": "CLIPTextEncode"
      },
      "widgets_values": [
        "色调艳丽，过曝，静态，细节模糊不清，字幕，风格，作品，画作，画面，静止，整体发灰，最差质量，低质量，JPEG压缩残留，丑陋的，残缺的，多余的手指，画得不好的手部，画得不好的脸部，畸形的，毁容的，形态畸形的肢体，手指融合，静止不动的画面，杂乱的背景，三条腿，背景人很多，倒着走 "
      ]
    },
    {
      "id": 73,
      "type": "UNETLoader",
      "pos": [
        329.4610595703125,
        46.62214660644531
      ],
      "size": [
        270,
        82
      ],
      "flags": {},
      "order": 1,
      "mode": 0,
      "inputs": [],
      "outputs": [
        {
          "name": "MODEL",
          "type": "MODEL",
          "links": [
            134
          ]
        }
      ],
      "properties": {
        "cnr_id": "comfy-core",
        "ver": "0.3.33",
        "Node name for S&R": "UNETLoader"
      },
      "widgets_values": [
        "Wan2.2\\wan2.2_ti2v_5B_fp16.safetensors",
        "default"
      ],
      "color": "#323",
      "bgcolor": "#535"
    },
    {
      "id": 151,
      "type": "MarkdownNote",
      "pos": [
        -87.05850982666016,
        -15.094049453735352
      ],
      "size": [
        366.41766357421875,
        229.31654357910156
      ],
      "flags": {},
      "order": 2,
      "mode": 0,
      "inputs": [],
      "outputs": [],
      "properties": {},
      "widgets_values": [
        "## models\n- [wan2.2_ti2v_5B_fp16.safetensors](https://huggingface.co/Comfy-Org/Wan_2.2_ComfyUI_Repackaged/blob/main/split_files/diffusion_models/wan2.2_ti2v_5B_fp16.safetensors)\n- [umt5_xxl.safetensors](https://huggingface.co/Comfy-Org/Wan_2.1_ComfyUI_repackaged/tree/main/split_files/text_encoders)\n- [wan2.2_vae.safetensors](https://huggingface.co/Comfy-Org/Wan_2.2_ComfyUI_Repackaged/blob/main/split_files/vae/wan2.2_vae.safetensors)\n\n\n```\n📂ComfyUI/\n└──📂models/\n    ├── 📂diffusion_models/\n    │   └── wan2.2_ti2v_5B_fp16.safetensors\n    ├── 📂text_encoders/\n    │   └── umt5_xxl (fp16 or fp8).safetensors\n    └── 📂vae/\n         └── wan2.2_vae.safetensors\n\n```"
      ],
      "color": "#323",
      "bgcolor": "#535"
    },
    {
      "id": 39,
      "type": "VAELoader",
      "pos": [
        277.54449462890625,
        596.2767333984375
      ],
      "size": [
        265.22003173828125,
        58
      ],
      "flags": {},
      "order": 3,
      "mode": 0,
      "inputs": [],
      "outputs": [
        {
          "name": "VAE",
          "type": "VAE",
          "slot_index": 0,
          "links": [
            76,
            336
          ]
        }
      ],
      "properties": {
        "cnr_id": "comfy-core",
        "ver": "0.3.33",
        "Node name for S&R": "VAELoader"
      },
      "widgets_values": [
        "wan2.2_vae.safetensors"
      ],
      "color": "#322",
      "bgcolor": "#533"
    },
    {
      "id": 8,
      "type": "VAEDecode",
      "pos": [
        1179.2911376953125,
        186
      ],
      "size": [
        157.56002807617188,
        46
      ],
      "flags": {},
      "order": 12,
      "mode": 0,
      "inputs": [
        {
          "name": "samples",
          "type": "LATENT",
          "link": 332
        },
        {
          "name": "vae",
          "type": "VAE",
          "link": 76
        }
      ],
      "outputs": [
        {
          "name": "IMAGE",
          "type": "IMAGE",
          "slot_index": 0,
          "links": [
            341
          ]
        }
      ],
      "properties": {
        "cnr_id": "comfy-core",
        "ver": "0.3.33",
        "Node name for S&R": "VAEDecode"
      },
      "widgets_values": []
    },
    {
      "id": 113,
      "type": "VHS_VideoCombine",
      "pos": [
        1368.597900390625,
        186
      ],
      "size": [
        372.2688903808594,
        886.1885986328125
      ],
      "flags": {},
      "order": 13,
      "mode": 0,
      "inputs": [
        {
          "name": "images",
          "type": "IMAGE",
          "link": 341
        },
        {
          "name": "audio",
          "shape": 7,
          "type": "AUDIO",
          "link": null
        },
        {
          "name": "meta_batch",
          "shape": 7,
          "type": "VHS_BatchManager",
          "link": null
        },
        {
          "name": "vae",
          "shape": 7,
          "type": "VAE",
          "link": null
        }
      ],
      "outputs": [
        {
          "name": "Filenames",
          "type": "VHS_FILENAMES",
          "links": null
        }
      ],
      "properties": {
        "cnr_id": "comfyui-videohelpersuite",
        "ver": "a7ce59e381934733bfae03b1be029756d6ce936d",
        "Node name for S&R": "VHS_VideoCombine"
      },
      "widgets_values": {
        "frame_rate": 24,
        "loop_count": 0,
        "filename_prefix": "Wan2.2",
        "format": "video/h264-mp4",
        "pix_fmt": "yuv420p",
        "crf": 19,
        "save_metadata": true,
        "trim_to_audio": false,
        "pingpong": false,
        "save_output": true,
        "videopreview": {
          "hidden": false,
          "paused": false,
          "params": {
            "filename": "Wan2.2_00012.mp4",
            "subfolder": "",
            "type": "output",
            "format": "video/h264-mp4",
            "frame_rate": 24,
            "workflow": "Wan2.2_00012.png",
            "fullpath": "D:\\AI\\ComfyUI_windows_portable\\ComfyUI\\output\\Wan2.2_00012.mp4"
          }
        }
      }
    },
    {
      "id": 167,
      "type": "ImageScaleToTotalPixels",
      "pos": [
        93.42740631103516,
        723.7991943359375
      ],
      "size": [
        270,
        82
      ],
      "flags": {},
      "order": 8,
      "mode": 0,
      "inputs": [
        {
          "name": "image",
          "type": "IMAGE",
          "link": 343
        }
      ],
      "outputs": [
        {
          "name": "IMAGE",
          "type": "IMAGE",
          "links": [
            344,
            345
          ]
        }
      ],
      "properties": {
        "cnr_id": "comfy-core",
        "ver": "0.3.46",
        "Node name for S&R": "ImageScaleToTotalPixels"
      },
      "widgets_values": [
        "nearest-exact",
        0.5
      ]
    },
    {
      "id": 168,
      "type": "GetImageSize",
      "pos": [
        393.28973388671875,
        771.4497680664062
      ],
      "size": [
        140,
        124
      ],
      "flags": {},
      "order": 9,
      "mode": 0,
      "inputs": [
        {
          "name": "image",
          "type": "IMAGE",
          "link": 345
        }
      ],
      "outputs": [
        {
          "name": "width",
          "type": "INT",
          "links": [
            346
          ]
        },
        {
          "name": "height",
          "type": "INT",
          "links": [
            347
          ]
        },
        {
          "name": "batch_size",
          "type": "INT",
          "links": null
        }
      ],
      "properties": {
        "cnr_id": "comfy-core",
        "ver": "0.3.46",
        "Node name for S&R": "GetImageSize"
      },
      "widgets_values": []
    },
    {
      "id": 161,
      "type": "Wan22ImageToVideoLatent",
      "pos": [
        563.1520385742188,
        593.4163818359375
      ],
      "size": [
        271.9126892089844,
        150
      ],
      "flags": {},
      "order": 10,
      "mode": 0,
      "inputs": [
        {
          "name": "vae",
          "type": "VAE",
          "link": 336
        },
        {
          "name": "start_image",
          "shape": 7,
          "type": "IMAGE",
          "link": 344
        },
        {
          "name": "width",
          "type": "INT",
          "widget": {
            "name": "width"
          },
          "link": 346
        },
        {
          "name": "height",
          "type": "INT",
          "widget": {
            "name": "height"
          },
          "link": 347
        }
      ],
      "outputs": [
        {
          "name": "LATENT",
          "type": "LATENT",
          "links": [
            335
          ]
        }
      ],
      "properties": {
        "cnr_id": "comfy-core",
        "ver": "0.3.46",
        "Node name for S&R": "Wan22ImageToVideoLatent"
      },
      "widgets_values": [
        1280,
        704,
        65,
        1
      ],
      "color": "#322",
      "bgcolor": "#533"
    },
    {
      "id": 166,
      "type": "LoadImage",
      "pos": [
        -210.51498413085938,
        721.4742431640625
      ],
      "size": [
        275.080078125,
        478
      ],
      "flags": {},
      "order": 4,
      "mode": 0,
      "inputs": [],
      "outputs": [
        {
          "name": "IMAGE",
          "type": "IMAGE",
          "links": [
            343
          ]
        },
        {
          "name": "MASK",
          "type": "MASK",
          "links": null
        }
      ],
      "properties": {
        "cnr_id": "comfy-core",
        "ver": "0.3.46",
        "Node name for S&R": "LoadImage"
      },
      "widgets_values": [
        "cloud.png",
        "image"
      ]
    },
    {
      "id": 6,
      "type": "CLIPTextEncode",
      "pos": [
        417.9232177734375,
        186
      ],
      "size": [
        419.26959228515625,
        148.8194122314453
      ],
      "flags": {},
      "order": 5,
      "mode": 0,
      "inputs": [
        {
          "name": "clip",
          "type": "CLIP",
          "link": 74
        }
      ],
      "outputs": [
        {
          "name": "CONDITIONING",
          "type": "CONDITIONING",
          "slot_index": 0,
          "links": [
            330
          ]
        }
      ],
      "title": "CLIP Text Encode (Positive Prompt)",
      "properties": {
        "cnr_id": "comfy-core",
        "ver": "0.3.33",
        "Node name for S&R": "CLIPTextEncode"
      },
      "widgets_values": [
        "Anime girl with light blue hair gradually dissolving into white clouds at the tips. She is looking upwards against a bright blue sky with soft white clouds"
      ]
    },
    {
      "id": 160,
      "type": "KSampler",
      "pos": [
        877.5444946289062,
        186
      ],
      "size": [
        270,
        262
      ],
      "flags": {},
      "order": 11,
      "mode": 0,
      "inputs": [
        {
          "name": "model",
          "type": "MODEL",
          "link": 329
        },
        {
          "name": "positive",
          "type": "CONDITIONING",
          "link": 330
        },
        {
          "name": "negative",
          "type": "CONDITIONING",
          "link": 331
        },
        {
          "name": "latent_image",
          "type": "LATENT",
          "link": 335
        }
      ],
      "outputs": [
        {
          "name": "LATENT",
          "type": "LATENT",
          "links": [
            332
          ]
        }
      ],
      "properties": {
        "cnr_id": "comfy-core",
        "ver": "0.3.46",
        "Node name for S&R": "KSampler"
      },
      "widgets_values": [
        1234,
        "fixed",
        20,
        4,
        "uni_pc",
        "simple",
        1
      ]
    }
  ],
  "links": [
    [
      74,
      38,
      0,
      6,
      0,
      "CLIP"
    ],
    [
      75,
      38,
      0,
      7,
      0,
      "CLIP"
    ],
    [
      76,
      39,
      0,
      8,
      1,
      "VAE"
    ],
    [
      134,
      73,
      0,
      48,
      0,
      "MODEL"
    ],
    [
      329,
      48,
      0,
      160,
      0,
      "MODEL"
    ],
    [
      330,
      6,
      0,
      160,
      1,
      "CONDITIONING"
    ],
    [
      331,
      7,
      0,
      160,
      2,
      "CONDITIONING"
    ],
    [
      332,
      160,
      0,
      8,
      0,
      "LATENT"
    ],
    [
      335,
      161,
      0,
      160,
      3,
      "LATENT"
    ],
    [
      336,
      39,
      0,
      161,
      0,
      "VAE"
    ],
    [
      341,
      8,
      0,
      113,
      0,
      "IMAGE"
    ],
    [
      343,
      166,
      0,
      167,
      0,
      "IMAGE"
    ],
    [
      344,
      167,
      0,
      161,
      1,
      "IMAGE"
    ],
    [
      345,
      167,
      0,
      168,
      0,
      "IMAGE"
    ],
    [
      346,
      168,
      0,
      161,
      2,
      "INT"
    ],
    [
      347,
      168,
      1,
      161,
      3,
      "INT"
    ]
  ],
  "groups": [],
  "config": {},
  "extra": {
    "ds": {
      "scale": 0.6830134553650707,
      "offset": [
        310.5149841308594,
        115.09404945373535
      ]
    },
    "frontendVersion": "1.25.1",
    "VHS_latentpreview": false,
    "VHS_latentpreviewrate": 0,
    "VHS_MetadataImage": true,
    "VHS_KeepIntermediate": true
  },
  "version": 0.4
}

image2video 也使用与 text2video 相同的 TI2V 模型。只是输入增加，后段的 KSampler 以后几乎通用。

🟦 start 图像输入到 TI2V 用的 latent 节点，制作压缩 latent。
🟩 text2video / image2video 双方都能用同一模型解决，所以首先用 5B 固化工作流，再根据需要加上 14B，这样的运用变得容易了。

Wan2.2

什么是 Wan2.2？

Wan2.2 14B 模型

推荐设定值

模型的下载

text2video（14B）

image2video（14B）

FLF2V（14B / First–Last Frame to Video）

Wan2.2 5B 模型（TI2V-5B）

推荐设定值

模型的下载

text2video（5B）

image2video（5B）

参考链接

什么是 JSON 复制按钮？

这个页面有问题！

请补充讲解！

感想 / 其他

感谢！