放大・图像修复

放大是将分辨率小的图像放大的任务，但如果只是单纯放大，PowerPoint 等也可以做到。

但是，如果只是将低画质且粗糙的图像单纯地放大 2 倍、4 倍，只会得到“巨大的粗糙图像”，信息量不会增加。

因此，这里说的放大是指 “放大图像”、“看起来自然地补充缺失的细节，修复画质”，这两者成套进行的技术。

此外，也有更专注于图像修复的技术。消除旧照片的划痕，或自动为黑白照片上色等处理，也可以视为“图像修复”的一种。

让我们只看看使用了什么手法・模型，以及代表性的东西。

GAN / 传统型放大

使用 GAN 或传统型超分辨率模型进行的放大。这是 Stable Diffusion 以前就有的系统，现在有时仍作为轻量处理被使用。

ESRGAN.json

{
  "id": "856e71ca-93c8-443a-a6c2-c2d179f2bd60",
  "revision": 0,
  "last_node_id": 6,
  "last_link_id": 4,
  "nodes": [
    {
      "id": 3,
      "type": "LoadImage",
      "pos": [
        557.2993863646276,
        332.50277383751643
      ],
      "size": [
        249.53462357954527,
        493.0909090909091
      ],
      "flags": {},
      "order": 0,
      "mode": 0,
      "inputs": [],
      "outputs": [
        {
          "name": "IMAGE",
          "type": "IMAGE",
          "links": [
            1
          ]
        },
        {
          "name": "MASK",
          "type": "MASK",
          "links": null
        }
      ],
      "properties": {
        "cnr_id": "comfy-core",
        "ver": "0.3.75",
        "Node name for S&R": "LoadImage"
      },
      "widgets_values": [
        "5905871d320b72c5dd9db3ab44d81854-png.jpg",
        "image"
      ]
    },
    {
      "id": 2,
      "type": "ImageUpscaleWithModel",
      "pos": [
        843.6630227282637,
        313.41186474660685
      ],
      "size": [
        246.5274857954545,
        46
      ],
      "flags": {},
      "order": 2,
      "mode": 0,
      "inputs": [
        {
          "name": "upscale_model",
          "type": "UPSCALE_MODEL",
          "link": 2
        },
        {
          "name": "image",
          "type": "IMAGE",
          "link": 1
        }
      ],
      "outputs": [
        {
          "name": "IMAGE",
          "type": "IMAGE",
          "links": [
            4
          ]
        }
      ],
      "properties": {
        "cnr_id": "comfy-core",
        "ver": "0.3.75",
        "Node name for S&R": "ImageUpscaleWithModel"
      },
      "color": "#232",
      "bgcolor": "#353"
    },
    {
      "id": 6,
      "type": "SaveImage",
      "pos": [
        1121.8448409100806,
        315.2300465647888
      ],
      "size": [
        304.5454545454545,
        506.3636363636364
      ],
      "flags": {},
      "order": 3,
      "mode": 0,
      "inputs": [
        {
          "name": "images",
          "type": "IMAGE",
          "link": 4
        }
      ],
      "outputs": [],
      "properties": {
        "cnr_id": "comfy-core",
        "ver": "0.3.75"
      },
      "widgets_values": [
        "ComfyUI"
      ]
    },
    {
      "id": 4,
      "type": "UpscaleModelLoader",
      "pos": [
        596.8340099441729,
        213.4118647466074
      ],
      "size": [
        210,
        58
      ],
      "flags": {},
      "order": 1,
      "mode": 0,
      "inputs": [],
      "outputs": [
        {
          "name": "UPSCALE_MODEL",
          "type": "UPSCALE_MODEL",
          "links": [
            2
          ]
        }
      ],
      "properties": {
        "cnr_id": "comfy-core",
        "ver": "0.3.75",
        "Node name for S&R": "UpscaleModelLoader"
      },
      "widgets_values": [
        "ESRGAN\\ESRGAN_4x.pth"
      ],
      "color": "#232",
      "bgcolor": "#353"
    }
  ],
  "links": [
    [
      1,
      3,
      0,
      2,
      1,
      "IMAGE"
    ],
    [
      2,
      4,
      0,
      2,
      0,
      "UPSCALE_MODEL"
    ],
    [
      4,
      2,
      0,
      6,
      0,
      "IMAGE"
    ]
  ],
  "groups": [],
  "config": {},
  "extra": {
    "ds": {
      "scale": 0.9229599817706415,
      "offset": [
        -361.95397406280983,
        -53.820982057971335
      ]
    },
    "frontendVersion": "1.34.2",
    "VHS_latentpreview": false,
    "VHS_latentpreviewrate": 0,
    "VHS_MetadataImage": true,
    "VHS_KeepIntermediate": true
  },
  "version": 0.4
}

ESRGAN
Real-ESRGAN
SwinIR
HYPIR

面部修复模型（专注于面部周围）

专注于面部，用于恢复模糊、崩坏、低分辨率面部的模型。虽然有名为 ReActor 的 FaceSwap 技术，但因为它只能生成低分辨率，所以作为后处理被使用。

GFPGAN
CodeFormer

扩散模型系放大・修复

使用 Stable Diffusion 等扩散模型，一边重绘图像一边进行放大・修复的方法。

image2image
- 虽然是以图像为底稿生成图像的功能，但通过抑制 denoise 量，可以在不怎么改变构图和内容的情况下作为“修复”使用。
Ultimate SD upscale
- 如果只是单纯的 image2image，受限于该模型能处理的分辨率或 PC 规格，能生成的尺寸有限制。
- 因此，将图像分割成瓦片状，逐个进行 image2image 后再合并，从而能够处理更大图像的机制。
SUPIR
- 基于 SDXL，专注于放大・图像恢复的模型。目的是从低画质的输入中恢复自然的高分辨率图像。

扩散模型中的放大，在某种意义上是重绘。因此，超越单纯的修复，有 “做过头” 的倾向。当然这也是表现的一种，为了与尽量保持原图像的放大区别开来，有时也被称为 Enhance。

基于指令的图像编辑进行的修复

在最近的“基于指令的图像编辑”模型中，也有只需用文本指示，就能汇总进行接近放大・修复处理的模型。

即使不分别准备专用的模型，只要指示“把这张照片变漂亮”、“减少噪点”、“给黑白照片上色”等，它就会汇总进行这些处理。

Qwen-Image-Edit-2509.json

{
  "id": "d8034549-7e0a-40f1-8c2e-de3ffc6f1cae",
  "revision": 0,
  "last_node_id": 123,
  "last_link_id": 319,
  "nodes": [
    {
      "id": 54,
      "type": "ModelSamplingAuraFlow",
      "pos": [
        634.9767456054688,
        -1.8326886892318726
      ],
      "size": [
        230.33058166503906,
        58
      ],
      "flags": {},
      "order": 5,
      "mode": 0,
      "inputs": [
        {
          "name": "model",
          "type": "MODEL",
          "link": 282
        }
      ],
      "outputs": [
        {
          "name": "MODEL",
          "type": "MODEL",
          "links": [
            123
          ]
        }
      ],
      "properties": {
        "cnr_id": "comfy-core",
        "ver": "0.3.49",
        "Node name for S&R": "ModelSamplingAuraFlow"
      },
      "widgets_values": [
        3.1000000000000005
      ]
    },
    {
      "id": 63,
      "type": "VAEEncode",
      "pos": [
        714.6403198242188,
        673.7313842773438
      ],
      "size": [
        140,
        46
      ],
      "flags": {},
      "order": 8,
      "mode": 0,
      "inputs": [
        {
          "name": "pixels",
          "type": "IMAGE",
          "link": 239
        },
        {
          "name": "vae",
          "type": "VAE",
          "link": 115
        }
      ],
      "outputs": [
        {
          "name": "LATENT",
          "type": "LATENT",
          "links": [
            112
          ]
        }
      ],
      "properties": {
        "cnr_id": "comfy-core",
        "ver": "0.3.51",
        "Node name for S&R": "VAEEncode"
      },
      "widgets_values": []
    },
    {
      "id": 8,
      "type": "VAEDecode",
      "pos": [
        1293.939697265625,
        143.6978759765625
      ],
      "size": [
        157.56002807617188,
        46
      ],
      "flags": {},
      "order": 12,
      "mode": 0,
      "inputs": [
        {
          "name": "samples",
          "type": "LATENT",
          "link": 35
        },
        {
          "name": "vae",
          "type": "VAE",
          "link": 76
        }
      ],
      "outputs": [
        {
          "name": "IMAGE",
          "type": "IMAGE",
          "slot_index": 0,
          "links": [
            254
          ]
        }
      ],
      "properties": {
        "cnr_id": "comfy-core",
        "ver": "0.3.33",
        "Node name for S&R": "VAEDecode"
      },
      "widgets_values": []
    },
    {
      "id": 112,
      "type": "CLIPLoader",
      "pos": [
        75.53079223632812,
        277.016357421875
      ],
      "size": [
        270,
        106
      ],
      "flags": {},
      "order": 0,
      "mode": 0,
      "inputs": [],
      "outputs": [
        {
          "name": "CLIP",
          "type": "CLIP",
          "links": [
            290,
            291
          ]
        }
      ],
      "properties": {
        "cnr_id": "comfy-core",
        "ver": "0.3.51",
        "Node name for S&R": "CLIPLoader"
      },
      "widgets_values": [
        "qwen_2.5_vl_7b_fp8_scaled.safetensors",
        "qwen_image",
        "default"
      ],
      "color": "#432",
      "bgcolor": "#653"
    },
    {
      "id": 39,
      "type": "VAELoader",
      "pos": [
        107.53079223632812,
        446.7167663574219
      ],
      "size": [
        238,
        58
      ],
      "flags": {},
      "order": 1,
      "mode": 0,
      "inputs": [],
      "outputs": [
        {
          "name": "VAE",
          "type": "VAE",
          "slot_index": 0,
          "links": [
            76,
            115,
            292,
            293
          ]
        }
      ],
      "properties": {
        "cnr_id": "comfy-core",
        "ver": "0.3.33",
        "Node name for S&R": "VAELoader"
      },
      "widgets_values": [
        "qwen_image_vae.safetensors"
      ],
      "color": "#322",
      "bgcolor": "#533"
    },
    {
      "id": 114,
      "type": "TextEncodeQwenImageEditPlus",
      "pos": [
        454.6401672363281,
        419.63690185546875
      ],
      "size": [
        400,
        200
      ],
      "flags": {},
      "order": 10,
      "mode": 0,
      "inputs": [
        {
          "name": "clip",
          "type": "CLIP",
          "link": 291
        },
        {
          "name": "vae",
          "shape": 7,
          "type": "VAE",
          "link": 293
        },
        {
          "name": "image1",
          "shape": 7,
          "type": "IMAGE",
          "link": 295
        },
        {
          "name": "image2",
          "shape": 7,
          "type": "IMAGE",
          "link": null
        },
        {
          "name": "image3",
          "shape": 7,
          "type": "IMAGE",
          "link": null
        }
      ],
      "outputs": [
        {
          "name": "CONDITIONING",
          "type": "CONDITIONING",
          "links": [
            315
          ]
        }
      ],
      "properties": {
        "cnr_id": "comfy-core",
        "ver": "0.3.59",
        "Node name for S&R": "TextEncodeQwenImageEditPlus"
      },
      "widgets_values": [
        ""
      ],
      "color": "#232",
      "bgcolor": "#353"
    },
    {
      "id": 111,
      "type": "UNETLoader",
      "pos": [
        330.1968994140625,
        -1.8326886892318726
      ],
      "size": [
        276.62274169921875,
        82
      ],
      "flags": {},
      "order": 2,
      "mode": 0,
      "inputs": [],
      "outputs": [
        {
          "name": "MODEL",
          "type": "MODEL",
          "links": [
            282
          ]
        }
      ],
      "properties": {
        "cnr_id": "comfy-core",
        "ver": "0.3.51",
        "Node name for S&R": "UNETLoader"
      },
      "widgets_values": [
        "Qwen-Image\\qwen_image_edit_2509_fp8_e4m3fn.safetensors",
        "fp8_e4m3fn"
      ],
      "color": "#323",
      "bgcolor": "#535"
    },
    {
      "id": 3,
      "type": "KSampler",
      "pos": [
        933.5941772460938,
        143.6978759765625
      ],
      "size": [
        315,
        262
      ],
      "flags": {},
      "order": 11,
      "mode": 0,
      "inputs": [
        {
          "name": "model",
          "type": "MODEL",
          "link": 123
        },
        {
          "name": "positive",
          "type": "CONDITIONING",
          "link": 314
        },
        {
          "name": "negative",
          "type": "CONDITIONING",
          "link": 315
        },
        {
          "name": "latent_image",
          "type": "LATENT",
          "link": 112
        }
      ],
      "outputs": [
        {
          "name": "LATENT",
          "type": "LATENT",
          "slot_index": 0,
          "links": [
            35
          ]
        }
      ],
      "properties": {
        "cnr_id": "comfy-core",
        "ver": "0.3.33",
        "Node name for S&R": "KSampler"
      },
      "widgets_values": [
        1234,
        "fixed",
        20,
        2.5,
        "res_multistep",
        "simple",
        1
      ]
    },
    {
      "id": 82,
      "type": "ImageScaleToTotalPixels",
      "pos": [
        -224.63221740722656,
        668.4074096679688
      ],
      "size": [
        270,
        82
      ],
      "flags": {},
      "order": 6,
      "mode": 0,
      "inputs": [
        {
          "name": "image",
          "type": "IMAGE",
          "link": 275
        }
      ],
      "outputs": [
        {
          "name": "IMAGE",
          "type": "IMAGE",
          "links": [
            244
          ]
        }
      ],
      "properties": {
        "cnr_id": "comfy-core",
        "ver": "0.3.51",
        "Node name for S&R": "ImageScaleToTotalPixels"
      },
      "widgets_values": [
        "nearest-exact",
        1
      ]
    },
    {
      "id": 83,
      "type": "ImageResizeKJv2",
      "pos": [
        75.53079223632812,
        668.4074096679688
      ],
      "size": [
        270,
        336
      ],
      "flags": {},
      "order": 7,
      "mode": 0,
      "inputs": [
        {
          "name": "image",
          "type": "IMAGE",
          "link": 244
        },
        {
          "name": "mask",
          "shape": 7,
          "type": "MASK",
          "link": null
        }
      ],
      "outputs": [
        {
          "name": "IMAGE",
          "type": "IMAGE",
          "links": [
            239,
            294,
            295
          ]
        },
        {
          "name": "width",
          "type": "INT",
          "links": null
        },
        {
          "name": "height",
          "type": "INT",
          "links": null
        },
        {
          "name": "mask",
          "type": "MASK",
          "links": []
        }
      ],
      "properties": {
        "cnr_id": "comfyui-kjnodes",
        "ver": "e2ce0843d1183aea86ce6a1617426f492dcdc802",
        "Node name for S&R": "ImageResizeKJv2"
      },
      "widgets_values": [
        0,
        0,
        "nearest-exact",
        "crop",
        "0, 0, 0",
        "center",
        8,
        "cpu",
        "<tr><td>Output: </td><td><b>1</b> x <b>1024</b> x <b>1024 | 12.00MB</b></td></tr>"
      ]
    },
    {
      "id": 55,
      "type": "MarkdownNote",
      "pos": [
        -84.94583892822266,
        -171.1671905517578
      ],
      "size": [
        386.9856262207031,
        251.33447265625
      ],
      "flags": {},
      "order": 3,
      "mode": 0,
      "inputs": [],
      "outputs": [],
      "properties": {},
      "widgets_values": [
        "## models\n- [qwen_image_edit_2509_fp8_e4m3fn.safetensors](https://huggingface.co/Comfy-Org/Qwen-Image-Edit_ComfyUI/blob/main/split_files/diffusion_models/qwen_image_edit_2509_fp8_e4m3fn.safetensors)\n- [qwen_2.5_vl_7b_fp8_scaled.safetensors](https://huggingface.co/Comfy-Org/Qwen-Image_ComfyUI/blob/main/split_files/text_encoders/qwen_2.5_vl_7b_fp8_scaled.safetensors)\n- [qwen_image_vae.safetensors](https://huggingface.co/Comfy-Org/Qwen-Image_ComfyUI/tree/main/split_files/vae)\n\n\n```\n📂ComfyUI/\n└──📂models/\n    ├── 📂diffusion_models/\n    │   └── qwen_image_edit_2509_fp8_e4m3fn.safetensors\n    ├── 📂text_encoders/\n    │   └── qwen_2.5_vl_7b_fp8.safetensors\n    └── 📂vae/\n         └── wan_2.1_vae.safetensors\n\n```"
      ],
      "color": "#323",
      "bgcolor": "#535"
    },
    {
      "id": 113,
      "type": "TextEncodeQwenImageEditPlus",
      "pos": [
        454.6401672363281,
        163.63690185546875
      ],
      "size": [
        400,
        200
      ],
      "flags": {},
      "order": 9,
      "mode": 0,
      "inputs": [
        {
          "name": "clip",
          "type": "CLIP",
          "link": 290
        },
        {
          "name": "vae",
          "shape": 7,
          "type": "VAE",
          "link": 292
        },
        {
          "name": "image1",
          "shape": 7,
          "type": "IMAGE",
          "link": 294
        },
        {
          "name": "image2",
          "shape": 7,
          "type": "IMAGE",
          "link": null
        },
        {
          "name": "image3",
          "shape": 7,
          "type": "IMAGE",
          "link": null
        }
      ],
      "outputs": [
        {
          "name": "CONDITIONING",
          "type": "CONDITIONING",
          "links": [
            314
          ]
        }
      ],
      "properties": {
        "cnr_id": "comfy-core",
        "ver": "0.3.59",
        "Node name for S&R": "TextEncodeQwenImageEditPlus"
      },
      "widgets_values": [
        "Make this image look clear and in focus. Reduce blur, enhance edges and textures, and keep the original colors and overall look."
      ],
      "color": "#232",
      "bgcolor": "#353"
    },
    {
      "id": 99,
      "type": "LoadImage",
      "pos": [
        -787.9675541015623,
        668.4074096679688
      ],
      "size": [
        525.4008842507812,
        636.6210345562502
      ],
      "flags": {},
      "order": 4,
      "mode": 0,
      "inputs": [],
      "outputs": [
        {
          "name": "IMAGE",
          "type": "IMAGE",
          "links": [
            275
          ]
        },
        {
          "name": "MASK",
          "type": "MASK",
          "links": null
        }
      ],
      "properties": {
        "cnr_id": "comfy-core",
        "ver": "0.3.51",
        "Node name for S&R": "LoadImage"
      },
      "widgets_values": [
        "pasted/image (40).png",
        "image"
      ]
    },
    {
      "id": 97,
      "type": "SaveImage",
      "pos": [
        1495.48046875,
        143.6978759765625
      ],
      "size": [
        606.8076645485153,
        669.4791159073438
      ],
      "flags": {},
      "order": 13,
      "mode": 0,
      "inputs": [
        {
          "name": "images",
          "type": "IMAGE",
          "link": 254
        }
      ],
      "outputs": [],
      "properties": {
        "cnr_id": "comfy-core",
        "ver": "0.3.51"
      },
      "widgets_values": [
        "ComfyUI"
      ]
    }
  ],
  "links": [
    [
      35,
      3,
      0,
      8,
      0,
      "LATENT"
    ],
    [
      76,
      39,
      0,
      8,
      1,
      "VAE"
    ],
    [
      112,
      63,
      0,
      3,
      3,
      "LATENT"
    ],
    [
      115,
      39,
      0,
      63,
      1,
      "VAE"
    ],
    [
      123,
      54,
      0,
      3,
      0,
      "MODEL"
    ],
    [
      239,
      83,
      0,
      63,
      0,
      "IMAGE"
    ],
    [
      244,
      82,
      0,
      83,
      0,
      "IMAGE"
    ],
    [
      254,
      8,
      0,
      97,
      0,
      "IMAGE"
    ],
    [
      275,
      99,
      0,
      82,
      0,
      "IMAGE"
    ],
    [
      282,
      111,
      0,
      54,
      0,
      "MODEL"
    ],
    [
      290,
      112,
      0,
      113,
      0,
      "CLIP"
    ],
    [
      291,
      112,
      0,
      114,
      0,
      "CLIP"
    ],
    [
      292,
      39,
      0,
      113,
      1,
      "VAE"
    ],
    [
      293,
      39,
      0,
      114,
      1,
      "VAE"
    ],
    [
      294,
      83,
      0,
      113,
      2,
      "IMAGE"
    ],
    [
      295,
      83,
      0,
      114,
      2,
      "IMAGE"
    ],
    [
      314,
      113,
      0,
      3,
      1,
      "CONDITIONING"
    ],
    [
      315,
      114,
      0,
      3,
      2,
      "CONDITIONING"
    ]
  ],
  "groups": [],
  "config": {},
  "extra": {
    "ds": {
      "scale": 0.6830134553650712,
      "offset": [
        309.96283560156246,
        -29.313273468242187
      ]
    },
    "frontendVersion": "1.34.2",
    "VHS_latentpreview": false,
    "VHS_latentpreviewrate": 0,
    "VHS_MetadataImage": true,
    "VHS_KeepIntermediate": true
  },
  "version": 0.4
}

详情在“基于指令的图像编辑”页面介绍。

视频的放大・视频修复

如果逐帧应用图像放大，视频的放大姑且也是可能的。

但是，这种方法因为没有时间上的一致性，可能会残留闪烁或抖动。

视频专用的放大・修复模型，通过使用前后帧的信息，旨在抑制闪烁或抖动的同时提高画质。

代表性的系列有以下这些。

SeedVR2
FlashVSR

将它们作为静止画的放大使用也没问题。不如说在那个用途上人气也挺高。

放大・图像修复