Did this for a bounty. Not sure why the bottom is cut off but it mostly does what it's supposed to do. The image ratio you use matters a lot. 512x568 and 576x640 seem to work pretty well for getting most of the text(the input images were 512x551) but there might be a better ratio that I didn't find.(512x551 didn't work as well). You can prompt for the element of the card and the supported elements are fire, neutral, grass, water, elect.
Example Prompt: mmbn, <lora:MMBN_chipsV3-000006:0.99>fire, (no_humans:1.2)
Description
Initial Version



