You can use the online workflow below. If the results are good, you can then try setting it up and running it locally.
👉 Click Here For Online WorkFlow
As a fan benefit, you’ll receive 1,000 points upon registration, plus 100 points for daily logins. Enjoy a smooth experience with an RTX 4090 and 48GB of VRAM.
Model Description
This is the upgraded version of Kontext Reality Transform, trained on the Qwen Image Edit model, with better results than the original Kontext version! In this version, you can try switching prompts to change the scene of the original image. See the recommended prompts for details. And don't forget the Qwen Image's Chinese characters support.
Recommended Settings
Sampler: Euler
LoRA Weight: 1
CFG: 1.0 (Using the Qwen-Image-Lightning-8steps-V1.1 , acceleration LoRA weight 1.0)
Steps: 8
Recommended to use high-resolution upscale: 1.5× scale. Denoising set to 0.4. Beta scheduler is recommended, with 8 steps, CFG 1.0, using the Qwen-Image-Lightning-8steps-V1.1, acceleration LoRA weight 0.5.
Recommended Prompt
Trigger Word:transform into realistic photography
Other Prompts:
transform into realistic photography, The background is set next to an everyday shopping street.
transform into realistic photography, change the background to a Japanese anime comiket convention, with a sparse crowd of japanese otaku and various japanese male and female cosplayers walking towards various directions in the background.They are wearing various anime costumes from different manga and anime,they are unware of the viewer, the lighting is natural indoor lighting without flashlight
(Use your imagination)
模型介绍
这是此前发布的Kontext Reality Transform的升级版本,基于Qwen Image Edit模型训练,效果比Kontext版更好!此版本可以尝试切换提示词来更改原图的场景,详见提示词推荐。同时不要忘了Qwen Image系列的中文支持。
推荐设置
采样器: Euler
LoRA 权重: 1
CFG: 1.0 (使用 Qwen-Image-Lightning-8steps-V1.1 , 加速LoRA权重1.0)
步数: 8
推荐使用高清放大: 1.5倍. 降噪 0.4. Beta 调度器, 8步, CFG 1.0, 使用 Qwen-Image-Lightning-8steps-V1.1, 加速LoRA权重0.5
推荐提示词
触发词:transform into realistic photography
其他提示词:
transform into realistic photography, The background is set next to an everyday shopping street.
transform into realistic photography, change the background to a Japanese anime comiket convention, with a sparse crowd of japanese otaku and various japanese male and female cosplayers walking towards various directions in the background.They are wearing various anime costumes from different manga and anime,they are unware of the viewer, the lighting is natural indoor lighting without flashlight
(发挥你的想象)
Description
FAQ
Comments (40)
Great result. I'm guessing it mostly changes the location to a convention because that's what that source photos had? I wonder how hard it would be to change that result with data descriptions.
Actually, the answer is no. The background can be changed to a comic convention scene only because of the way the prompt is written. For other types of background replacement, you can give it a try.
Just wow. Many thanks for this!
Glad you like it :D
What do you use to train it? I have so many flux kontext dev lora datasets I want to migrate over
AItoolkit already supports training with Qwen Edit.
Phenomenal results 👏
It manages cases of unrealistic perspectives, shapes, hairstyles, etc. surprisingly well where FluxKontext fails.
My only critique would be the LoRA's tendency to default to asian characters.
Thank you very much for creating this! 🙌
Thank you, I really appreciate your feedback!
I'm not sure if it's an illusion, but all the characters have similar facial features, and they all look like the same person.
It’s not an illusion, the faces used in the training dataset share similar facials traits, but are not exactly the same.
你好,请问你用什么训练器训练Qwen-Edit的lora模型?
aitoolkit,其他的也可以尝试,训练效果差不多
How to save an image with a transition - animation "was-became"?
My before-and-after effect was generated using another ComfyUI workflow, but this kind of effect can be easily achieved with any video editing software.
@aldniki217 I would really love to see that workflow!
why does it generate random people in my background, how do I make sure that there is no one in the background? Is this configured in the "Text multiline" node?
Simply use “transform into realistic photography” as the entire prompt if you don’t want to change the background
@aldniki217 ок, thanks
Dunno if it's only me but it seems like latest Qwen-image-Edit 2509 update has broken your amazing LoRA T____T
happened on me too
Its not broken and 2509 made this Lora even better, check your workflow
@Luxaria Could you share your workflow ?
2509 can't work
I've done several tests and it doesn't work with the new version.
I've gone back to the old one just for this Lora, which does a better job than Qwen Edit alone (any version), which is too tedious.
I'm just adding the 8-step Lora (from the old model, of course).
This is absolutely brilliant! Thank you so, so much. I tried it out and I'm incredibly impressed and happy with the results. Seriously, thank you! 🙏
This LoRA is AWESOME!
request update for 2509 pwease!
Same here
I came here today looking for the 2509 version. This is a great lora.
I just added a review comparing results. I used it in Qwen Edit Plus (2509) and liked how well it worked, though it had some issues. Impressive considering I didn't input anime, but American comics instead. Tip: Combine this with the Samsung lora at 1 or .9 strength:
https://civitai.com/models/1551668/samsungcam-ultrareal
Or the Lenovo lora:
https://civitai.com/models/1662740/lenovo-ultrareal
and the results are boosted even more. It still has trouble converting drawn background objects, but if I describe the whole scene, and emphasize a lot that it's a photo, this lora does a great job. Without the lora, it just recreates the scene I describe vaguely, but doesn't line up with the original image. Here's an example prompt I had to use, to give an idea of how descriptive I had to get, and even with this, it still treats many background objects as 2D art, but with photograph coloring/shading, so it doesn't read as much as 2D art:
Positive:
transform into realistic photography, and remove the word bubbles. It will be a real photo of a woman standing in a city. The woman is wearing a leather jacket over a tied shirt exposing her midriff. She has her hands in her jacket pockets. She is a little angry, with a closed mouth, and is giving a stern look while glancing to the right. Her hair is in a ponytail and is blowing in the wind. She has a short buckskin skirt and metal belt. She's wearing shiny snakeskin boots, which are reflecting the light from the diner around their edges.
She is in front of a 40-year-old man who has a toothpick in his mouth, leaning back against something off-screen, and is smirking with a closed mouth while leering at her. Behind him is an 18-wheeler truck with a partially covered sign that says "HAUL ASS", but only the letters "HA AS" are visible. The truck is blue, with realistic textures from having driven long distances.
Also behind them, some people are reaching their arms out to greet each other happily. They are in front of a diner called "Truk Stop Restaurant and Grille", which has a well-lit interior, and a realistic bright sign. Above the diner is a highway bridge, with a street lamp, and trucks/cars driving over the bridge at night. The photo will be in the style of a professional photograph, capturing the detailed and gritty realism of the city. All the characters will be real people, and not drawings. It will look like a real professional photo by a famous photographer.
Negative (not necessary, but just in case):
line art, drawings, 2D, comic art, illustration, blurry, low quality, low resolution, bad anatomy, deformed, distorted, extra limbs, missing limbs, extra fingers, fused fingers, text, watermark, logo, artifacts, overexposed, underexposed, grainy
Does this Lora have an Asian bias? Or can it also do other races, too? Like Caucasian, etc. Asking because all of the examples show an Asian bias, which wouldn't be useful for me particularly. If so do you have particupar prompt advice for achieving it? Either way, cool Lora.
Ah, gotta love racists downvoting.
Hey, any kontext version for this?
Incredible work, but with a fatal flaw that needs fixing.
eyes are always looking at the camera
even if you give it already perfect 3d image to enhance it still resets eyes to the camera for no reason at all. even at 0.1 cfg value.
similar problem with mouth but at lest not that important
I second the request for a 2509 update. This is a great lora
I guess he is not interested for updated Qwen models. but current version is awesome.
Omg this is the absolute BEST transformation to realism LoRa I have tested so far!! Better then any other ones currently out there in my Opinion. Took me a bit to realize that CFG = 4 even with the Qwen lightning LoRa on and I found it works even a little bit better (shadows and details) when using the new “Unchained” NSFW LoRa with it as well. Keep CFG at 4 and 9 steps with basic euler and simple 👌 Great work on this LoRa!!
i know everyone loves their low fidelity fast gen time lightning but yo, make a non-lightning version!
Details
Files
aldniki_qwen_reality_transform_v01.safetensors
Mirrors
aldniki_qwen_reality_transform_v01.safetensors
aldniki_qwen_reality_transform_v01.safetensors
aldniki_qwen_reality_transform_v01.safetensors
QWEN_RealityTransform.safetensors
aldniki_qwen_reality_transform_v01.safetensors
aldniki_qwen_reality_transform_v01.safetensors
aldniki_qwen_reality_transform_v01.safetensors
aldniki_qwen_reality_transform_v01.safetensors
aldniki_qwen_reality_transform_v01.safetensors
aldniki_qwen_reality_transform_v01.safetensors
aldniki_qwen_reality_transform_v01.safetensors
Qwen-transform into realistic photography.safetensors
aldniki_qwen_reality_transform_v01.safetensors
aldniki_qwen_reality_transform_v01.safetensors
qwen_reality_transform_v01.safetensors
aldniki_qwen_reality_transform_v01.safetensors