Flux followup #9074

yiyixuxu · 2024-08-03T22:52:43Z

to-do

refactor rotary embedding
- removed Flux-specific methods/classes to use rotary embedding: i.e. EmbedND, apply_rope, rope;
- created a FluxPosEmbed that uses diffusers' existing get_1d_rotary_pos_embed method, which is already used by hunyuan dit, lumina and stable audio
- changed the flux transformer inputs img_ids and txt_ids: currently, these are 3d tensors with a batch dimension; I changed them to 2d since these are just positional ids that are used to encode rotary embedding, so we do not need to add a batch dimension here, and get_1d_rotary_pos_embed method does not accept batched positional ids so we remove it here to be consistent across the library - note that this is a breaking change, so I made sure to deprecate it, also add a test to make sure the previous inputs will still work
refactor attention processor (combine into one, deprecated FluxSingleAttnProcessor2_0)

# flux unit test for rotary embedding refactor import torch from diffusers import FluxPipeline model_path = "black-forest-labs/FLUX.1-dev" pipe = FluxPipeline.from_pretrained(model_path, torch_dtype=torch.bfloat16) pipe.enable_model_cpu_offload() #save some VRAM by offloading the model to CPU. Remove this if you have enough GPU power prompt = "A cat holding a sign that says hello world" image = pipe( prompt, height=1024, width=1024, guidance_scale=3.5, num_inference_steps=50, max_sequence_length=512, generator=torch.Generator("cpu").manual_seed(0), ).images[0] image.save(f"yiyi_test_4_out{branch}.png")

main	this PR

src/diffusers/models/transformers/transformer_flux.py

HuggingFaceDocBuilderDev · 2024-08-05T20:01:21Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

wangqixun · 2024-08-06T04:31:05Z

Could you please inform when the related PR for the 'flux transformer block' and 'pipeline' can be completed and merged? We are currently working on the adaptation and training of controlnet and other related plugins on the old version. After this PR is completed, we will reorganize the code to adapt to the new coding style, and then we will submit a new PR for controlnet and ipadapter.

prompt = "A girl with green long hair. she is wearing a yellow suit. half body, background is sky and cloud. anime style"

yiyixuxu · 2024-08-06T19:18:26Z

@wangqixun let's not wait for this refactor to be done for the PR! we can refactor the ip-adapter and controlnet together once the PR is in

… flux-followup

… 3d tensors

sayakpaul

Superb!

So, IIUC, the batched txt_ids and img_ids are not at a necessity now because of how we're doing the RoPE in the refactored class (FluxPosEmbed)? Or are there any additional differences to be aware of?

sayakpaul · 2024-08-18T10:39:35Z

src/diffusers/models/attention_processor.py

+
+ def __init__(self):
+ deprecation_message = "`FluxSingleAttnProcessor2_0` is deprecated and will be removed in a future version. Please use `FluxAttnProcessor2_0` instead."
+ deprecate("FluxSingleAttnProcessor2_0", "1.0.0", deprecation_message)


Let's maybe deprecate it earlier? Not a strong opinion, though.

sayakpaul · 2024-08-18T10:43:21Z

src/diffusers/models/embeddings.py

 return x_out.type_as(x)


+class FluxPosEmbed(nn.Module):


Maybe a reference to the original BFL inference code?

… flux-followup

yiyixuxu · 2024-08-19T17:37:07Z

src/diffusers/models/embeddings.py

+ sin_out = []
+ pos = ids.squeeze().float().cpu().numpy()
+ is_mps = ids.device.type == "mps"
+ freqs_dtype = torch.float32 if is_mps else torch.float64


@sayakpaul the results for flux are identical with this refactor
the only other difference is here, where we downcast the dtype for mps see #9133 for more details

Aye, thanks!

DN6 · 2024-08-21T15:51:24Z

src/diffusers/pipelines/flux/pipeline_flux.py

 unscale_lora_layers(self.text_encoder_2, lora_scale)

 dtype = self.text_encoder.dtype if self.text_encoder is not None else self.transformer.dtype
- text_ids = torch.zeros(batch_size, prompt_embeds.shape[1], 3).to(device=device, dtype=dtype)


Nice 👍🏽

DN6

LGTM 👍🏽

* refactor rotary embeds * adding jsmidt as co-author of this PR for #9133 --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> Co-authored-by: Joseph Smidt <josephsmidt@gmail.com>

yiyixuxu added 3 commits August 3, 2024 01:27

edit

efc7ed9

refactor rotary embeds

9b8f8c7

Merge branch 'main' into flux-followup

1b4d1c5

yiyixuxu commented Aug 3, 2024

View reviewed changes

src/diffusers/models/transformers/transformer_flux.py Outdated Show resolved Hide resolved

yiyixuxu added 2 commits August 3, 2024 12:54

Update src/diffusers/models/transformers/transformer_flux.py

1887bda

fix

a9cdfcc

yiyixuxu added 2 commits August 17, 2024 04:54

remove the batch dimension in ids

de66c58

keep transformer timesteps input same

abad854

yiyixuxu mentioned this pull request Aug 17, 2024

Update transformer_flux.py. Change float64 to float32 #9133

Closed

yiyixuxu added 9 commits August 18, 2024 02:57

add freqs_dtype, allow torch.float64 and make adjustment for mps device

463b910

deprecate flux single attn processor

f23cb1b

Merge branch 'main' into flux-followup

568884a

deprecate 2d ids inputs to flux transformer

ab3a550

Merge branch 'flux-followup' of github.com:huggingface/diffusers into…

079cb33

… flux-followup

use FluxPosEmbed in flux controlnet too

4161d93

apply same change to controlnet

89e0ccc

add a test for deprecated flux tranformers inputs: txt and img ids as…

0ff2266

… 3d tensors

up

40e94e0

yiyixuxu requested review from DN6 and sayakpaul August 18, 2024 09:15

sayakpaul approved these changes Aug 18, 2024

View reviewed changes

Merge branch 'main' into flux-followup

293bcd8

yiyixuxu mentioned this pull request Aug 19, 2024

[Core] fuse_qkv_projection() to Flux #9185

Merged

jsmidt and others added 2 commits August 19, 2024 19:27

adding jsmidt as co-author of this PR for #9133

72d1cf0

Merge branch 'flux-followup' of github.com:huggingface/diffusers into…

f0301b2

… flux-followup

yiyixuxu commented Aug 19, 2024

View reviewed changes

Merge branch 'main' into flux-followup

95b0a55

DN6 reviewed Aug 21, 2024

View reviewed changes

DN6 approved these changes Aug 21, 2024

View reviewed changes

yiyixuxu merged commit c291617 into main Aug 21, 2024

yiyixuxu deleted the flux-followup branch August 21, 2024 18:45

yiyixuxu mentioned this pull request Aug 21, 2024

[Flux] Support Union ControlNet #9175

Merged

6 tasks

DN6 mentioned this pull request Aug 22, 2024

hardcoded torch.float64 isn't supported on Metal (device="mps") #9224

Closed

This was referenced Aug 25, 2024

AttributeError in diffusers.models.transformers.transformer_flux - Missing rope Attribute #9267

Closed

[do not merge] testing rotary embedding + torch.compile #9321

Closed

hvaara mentioned this pull request Sep 16, 2024

flux does not work on MPS devices #9047

Closed

sayakpaul added a commit that referenced this pull request Dec 23, 2024

Flux followup (#9074)

6bea130

* refactor rotary embeds * adding jsmidt as co-author of this PR for #9133 --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> Co-authored-by: Joseph Smidt <josephsmidt@gmail.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Flux followup #9074

Flux followup #9074

Uh oh!

yiyixuxu commented Aug 3, 2024 •

edited

Loading

Uh oh!

HuggingFaceDocBuilderDev commented Aug 5, 2024

wangqixun commented Aug 6, 2024 •

edited

Loading

yiyixuxu commented Aug 6, 2024

sayakpaul left a comment

sayakpaul Aug 18, 2024

sayakpaul Aug 18, 2024

yiyixuxu Aug 19, 2024

sayakpaul Aug 20, 2024

DN6 Aug 21, 2024

DN6 left a comment

Flux followup #9074

Flux followup #9074

Uh oh!

Conversation

yiyixuxu commented Aug 3, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

HuggingFaceDocBuilderDev commented Aug 5, 2024

wangqixun commented Aug 6, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

yiyixuxu commented Aug 6, 2024

sayakpaul left a comment

Choose a reason for hiding this comment

sayakpaul Aug 18, 2024

Choose a reason for hiding this comment

sayakpaul Aug 18, 2024

Choose a reason for hiding this comment

yiyixuxu Aug 19, 2024

Choose a reason for hiding this comment

sayakpaul Aug 20, 2024

Choose a reason for hiding this comment

DN6 Aug 21, 2024

Choose a reason for hiding this comment

DN6 left a comment

Choose a reason for hiding this comment

yiyixuxu commented Aug 3, 2024 •

edited

Loading

wangqixun commented Aug 6, 2024 •

edited

Loading