RT-DETR-v2 r50vd model fine-tuned on about 11k Manga, Webtoon, Manhua and Western Comic style Images for text and speech bubble detection.
Training Image Size = 640. Training Images were resized, not cropped.
Tall Webtoons were split vertically.
Classes are:
0: bubble
1: text_bubble (text inside bubbles)
2: text_free (text outside bubbles)

Downloads last month
29,688
Safetensors
Model size
42.9M params
Tensor type
F32
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Spaces using ogkalu/comic-text-and-bubble-detector 2

Free AI Image Generator No sign-up. Instant results. Open Now