JavaScript is unavailable, so some features (menu/search) are unavailable. Check browser or extension settings.

video2audio

What is video2audio?

Except for Sora 2 and Veo 3, current video generation models still only generate video. In other words, there is no sound.

That's where video2audio comes in handy - technology to generate audio from video.

It understands "what is happening" from the video and generates sound corresponding to that content so that it synchronizes with the video.

FoleyCrafter

FoleyCrafter is a Video2Audio framework that adds a "video adapter" on top of an existing Text2Audio model.

It is an image of adding information on "what sound is appropriate by looking at the video" and "when it should sound (timing)" to the original Text2Audio model.

HunyuanVideo-Foley

HunyuanVideo-Foley is a multimodal diffusion Transformer that assumes text + video -> audio from the beginning.

It does not add functions to a text2audio model like FoleyCrafter from the start, but learns text, video, and audio together.

Report a correction for this page

Frame Interpolation Lip Sync

What is the JSON copy button?

It exports every node, connection, and parameter so someone else can reproduce the exact workflow.

Select the Copy JSON button on any workflow article to place the graph on your clipboard, then press Ctrl+V inside ComfyUI to load it.

This page has an issue!

Include a short repro step or the correct information if you can. We attach the current URL automatically.

Double-check before sending. We'll create an anonymous GitHub issue.

We'll file a GitHub issue—please avoid personal data.

Please explain more!

Tell us what topic or model you need so we can plan the write-up.

Double-check before sending. We'll create an anonymous GitHub issue.

We'll file a GitHub issue—please avoid personal data.

Feedback / Other

Feel free to share ideas, impressions, or anything else.

Double-check before sending. We'll create an anonymous GitHub issue.

Allow site citation/use

Allow Do not allow

We'll file a GitHub issue—please avoid personal data.