CivArchive
    X-Ray Vision (see through anything) - v1.1 ZIT
    NSFW
    Preview undefined
    Preview undefined
    Preview undefined
    Preview undefined
    Preview undefined
    Preview undefined
    Preview undefined
    Preview undefined
    Preview undefined
    Preview undefined
    Preview undefined

    Version 1 series are incremental. Major versions changes are their own entities in this sequence. For instance, v1 was an invisible device, a vortex of sorts and v2 is through a hand-held cell phone. V2 is unlikely to produce any of the specific content available in v1 series.

    The v2 download file includes a 3, which is the major training rendition, and the reason this doesn't match the version is because the training rendition 2 was useless, it produced nothing usable after 20k steps. I had to manually generate new images, again, that were not too disparate from each other in order for it to converge. These manually created images took me 3 days to compose and was a serious PITA, using Illustrious, Z-Image, Qwen edit and Flux 2 Klein in order to help me composite something useful.
    -
    - V1 -

    This LoRA is intended to produce images taken through a device, we're calling it X-Ray, where part of the clothing is missing in order to see undergarments and nudity.

    Some of the trained terms that were used, though they probably don't matter with current architecture..

    X-Ray boobs
    X-Ray bra
    X-Ray panties
    X-Ray pelvis
    X-Ray underwear
    X-Ray nude
    Clothed
    Nude

    I sent it through the trainer up to around 8k steps and for a couple of hours I tried to generate consistent images with some difficult scenarios but the prompts were just not magical enough. I sent it back through for almost 20k steps and, after some testing, I chose around 12k as the compromise between sledge hammer and getting something useful. I know I could have had an easier time, and wasted LESS time, had I just generated more training images but the process wasn't so easy, here's what I did...

    I created some nudes, half with Illustrious and half with Z-Image-Turbo.

    I then brought it over to Qwen Image Edit 2509 and prompted to add clothing, both outer and inner (underwear).

    I took that into an image editor, lately I've gone back to Gimp, getting away from PS, Affinity wars, and just put the dressed versions on top, masking out a this and a that, exposing different sections, saving images of those manipulations.

    I originally intended to make an X-Ray vision person where the eye beams would penetrate the clothing but I hung onto that idea for 2 days and didn't get anywhere, my brain just didn't want to get that complicated, so I just gave in and took a more simpler approach.

    I kept the clothed and nudes in the training data, captioned them all with simple terms and then let JOY-C fiddle with it to give the context some weight.

    Using OneTrainer: I guess the most important factors are defaults, rank 32, alpha 1, 2 repeats and batch size of 1.

    - V2 -

    I grabbed some stock photo items for the cell phone and arms, had Qwen edit extend the arm and add sleeves, I added 3 different sleeves. I used those with the other composited images. It took 3 days to amass 4 slightly different concepts to train with, only 3 of the concepts are repeatable without issues, the shower scene being the most prominent, followed by the hotel window, cruise ship (was kind of a throw away) and then the public restroom scene (which seems to be a complete disaster but can make some funny pictures).

    After adjusting the dataset and running it from 19k to 33k all I got was garbage so I released the less damaged version, which is still rather limited and I'll be trying for some other devices and situations in the future, frankly I'm running out of situational ideas.

    I had to "strong arm" the cell phone into working which prevents the LoRA from producing usable images in any situation beyond its known training set. The most reproducible scenarios is in prominence order...

    Shower Scene - triggers: shower, silhouette, exposed, nude, shower curtain
    Hotel Room - triggers: hotel room, window, on bed, panties, bra, nude
    Cruise Ship - triggers: (mentioning items inside the ship will usually show up in the cell phone)
    Diner - triggers: waitress, nude, breasts, waitress uniform
    Public Restroom - (trained with multiple phones) triggers: 3 men, stall, public restroom

    Conclusion:

    The initial attempt at this concept, applied in v1.0 and v1.1, allowed for the ability to prompt in scenarios that were not offered in the initial dataset. For instance, the pool scenario was completely fabricated after the fact, and attempting to do this with v2.0 fails, with anything I tried, and it attempts to combine trained in material together in order to achieve the goal.

    I can understand that the model is attempting to use the most available material to achieve the goal, which would suggest that I need less steps or a lower weight, but this obvious solution doesn't seem to work as expected with ZIT, though it won't stop me from trying.

    I like the series and I'll add another twist besides the vortex and cell phone, maybe glasses as was suggested in the comments, though I'm very welcome to ideas so please don't hesitate.

    Description

    I sat on v1.1 for awhile, though I did test it during the 1.0 release, and I'm starting to think that this is actually a lot more responsive.

    So, I give you Sledge Hammer, at step 19799.

    FAQ

    Comments (12)

    Aa6972Jan 4, 2026· 1 reaction
    CivitAI

    i like this idea!

    Tarterbox
    Author
    Jan 4, 2026

    \o/

    MasmalkarJan 5, 2026· 4 reactions
    CivitAI

    Looks quite good mang, maybe one of those x-ray phone/glasses? Is just a bit of fluff but it can make things more interesting

    Tarterbox
    Author
    Jan 5, 2026· 2 reactions

    Yea, definitely could use some augmentation and I originally started that way, using a subject to project beams from their eyes to the target, to see how that would turn out but it became a complex process so I chose something lighter since I didn't even think this would work.

    Now that I know it works I can test other ideas.

    Cheers \o

    HotRock_SiJan 20, 2026

    I've been trying for ages to try and make an image like this just using prompts in ZIT for someone holding up a pair of glasses and the area of the lens acts like this x-ray with very limited results. This is a great step forward, I hope you expand upon it

    Tarterbox
    Author
    Jan 20, 2026· 1 reaction

    @HotRock_Si Making the dataset took 3 days of my free time, I had to compose them manually those look the best actually. I used a combination of models, stock photo and edit models to produce a data set of 24 clean images.

    HotRock_SiJan 20, 2026· 1 reaction

    @Tarterbox thanks for taking the time to describe how you approached this, not something I see very often, so its great to get an insight into this process.

    Tarterbox
    Author
    Jan 20, 2026

    @HotRock_Si You're welcome, I appreciate your feedback.

    wyxzddsjj919Jan 21, 2026

    I'm a complete novice, so could a magnifying glass meant for older people also work? 😁

    Tarterbox
    Author
    Jan 21, 2026

    @wyxzddsjj919 
    I hadn't thought of that, that's definitely worth trying. I'll give it a shot on the next run. I imagine it will be a bit more complex because the model will have to learn a zoom effect along with visual isolation so that's going to be interesting.

    Right now I'm setting up images for eye glasses, actually it's more like goggles, and I've run into a hiccup where I'm not sure what the visual reference aught to be but I'm making material while I consider the idea.

    alcaitiffJan 6, 2026· 2 reactions
    CivitAI

    Insanely good! You are the best!

    Tarterbox
    Author
    Jan 6, 2026

    Kind of you, thanks \o

    LORA
    ZImageTurbo

    Details

    Downloads
    401
    Platform
    CivitAI
    Platform Status
    Available
    Created
    1/4/2026
    Updated
    5/16/2026
    Deleted
    -