Add `image-text-to-image` and `image-text-to-video` tasks #1866

apolinario · 2025-12-03T00:51:06Z

The goal of this new tasks is to support models that take in both image and text input and output either image or video.

The goal of this PR is making the tasks as analogous to image-to-image and image-to-video as possible, with the only difference that the image input should now be optional, as an empty image and a valid prompt should still work for a model like FLUX.2 (supports both text-to-image and image-to-image tasks) or LTX Video (both text-to-video and image-to-video)

Once this is in, I'll also have a widget PR in Moon to support this task in the model cards / widgets etc. and a follow up PR adding this to the inference providers, so that we can then PR repos to change the task for compatible models

apolinario · 2025-12-03T00:53:49Z

AI agent disclosure: I added the about.md files with Claude and haven't reviewed its slop yet. Will do it but should not be a blocker for the more structural stuff

pcuenca · 2025-12-03T08:22:25Z

cc @merveenoyan

gary149

looks good - maybe @merveenoyan you'll want to do some edits to md files

merveenoyan

thanks a lot for seeing this gap and working on it!

packages/tasks/src/tasks/image-text-to-image/data.ts

packages/tasks/src/tasks/image-text-to-video/about.md

packages/tasks/src/tasks/image-text-to-image/data.ts

packages/tasks/src/tasks/image-text-to-video/data.ts

merveenoyan · 2025-12-03T09:45:30Z

packages/tasks/src/tasks/image-text-to-video/data.ts

+	],
+	models: [
+		{
+			description: "A powerful model for image-text-to-video generation.",


would be nice to add more here and the Space too!

packages/tasks/src/tasks/image-text-to-video/data.ts

Wauplin · 2025-12-03T10:24:07Z

cc @julien-c as well on the (orthogonal) topic of generic any-to-any task + modality selection

Co-authored-by: Merve Noyan <merveenoyan@gmail.com>

…/huggingface.js into new-image-text-tasks

apolinario · 2025-12-04T13:13:20Z

Thanks @merveenoyan , modified the examples and opened this PR here https://huggingface.co/datasets/huggingfacejs/tasks/discussions/12

hanouticelina

Reviewed the inference part, all good! thanks

merveenoyan

thank you!

apolinario added 2 commits December 3, 2025 09:29

add image-text-to-image and image-text-to-video tasks

47d5221

add snippetGenerator

3aec3d2

apolinario requested review from SBrandeis, Wauplin, gary149, hanouticelina, julien-c, ngxson and pcuenca as code owners December 3, 2025 00:51

apolinario requested a review from merveenoyan December 3, 2025 00:51

gary149 approved these changes Dec 3, 2025

View reviewed changes

merveenoyan reviewed Dec 3, 2025

View reviewed changes

apolinario and others added 7 commits December 4, 2025 09:28

Update packages/tasks/src/tasks/image-text-to-video/about.md

d8b1a33

Co-authored-by: Merve Noyan <merveenoyan@gmail.com>

Update packages/tasks/src/tasks/image-text-to-video/about.md

4262e28

Co-authored-by: Merve Noyan <merveenoyan@gmail.com>

Update packages/tasks/src/tasks/image-text-to-video/about.md

6293843

Co-authored-by: Merve Noyan <merveenoyan@gmail.com>

Update packages/tasks/src/tasks/image-text-to-video/data.ts

0a2d889

Co-authored-by: Merve Noyan <merveenoyan@gmail.com>

change examples and data

9913ed9

modify the filenames

7dbfd4b

Merge branch 'new-image-text-tasks' of https://github.com/huggingface…

cd0934d

…/huggingface.js into new-image-text-tasks

hanouticelina approved these changes Dec 4, 2025

View reviewed changes

merveenoyan approved these changes Dec 4, 2025

View reviewed changes

merveenoyan merged commit 2c2de89 into main Dec 5, 2025
5 checks passed

merveenoyan deleted the new-image-text-tasks branch December 5, 2025 15:41

Add image-text-to-image and image-text-to-video tasks #1866

Add image-text-to-image and image-text-to-video tasks #1866

Uh oh!

Conversation

apolinario commented Dec 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

apolinario commented Dec 3, 2025

Uh oh!

pcuenca commented Dec 3, 2025

Uh oh!

gary149 left a comment

Choose a reason for hiding this comment

Uh oh!

merveenoyan left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

merveenoyan Dec 3, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Wauplin commented Dec 3, 2025

Uh oh!

apolinario commented Dec 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

hanouticelina left a comment

Choose a reason for hiding this comment

Uh oh!

merveenoyan left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

Add `image-text-to-image` and `image-text-to-video` tasks #1866

Add `image-text-to-image` and `image-text-to-video` tasks #1866

apolinario commented Dec 3, 2025 •

edited

Loading

apolinario commented Dec 4, 2025 •

edited

Loading