Update README.md
Browse files
README.md
CHANGED
@@ -19,6 +19,10 @@ tags:
|
|
19 |
|
20 |
> **Formula-Text-Detection** is a vision-language encoder model fine-tuned from **google/siglip2-base-patch16-224** for **binary image classification**. It is built using the **SiglipForImageClassification** architecture to distinguish between **mathematical formulas** and **natural text** in document or image regions.
|
21 |
|
|
|
|
|
|
|
|
|
22 |
```py
|
23 |
Classification Report:
|
24 |
precision recall f1-score support
|
@@ -100,6 +104,22 @@ iface = gr.Interface(
|
|
100 |
if __name__ == "__main__":
|
101 |
iface.launch()
|
102 |
```
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
103 |
|
104 |
---
|
105 |
|
|
|
19 |
|
20 |
> **Formula-Text-Detection** is a vision-language encoder model fine-tuned from **google/siglip2-base-patch16-224** for **binary image classification**. It is built using the **SiglipForImageClassification** architecture to distinguish between **mathematical formulas** and **natural text** in document or image regions.
|
21 |
|
22 |
+
> [!Note]
|
23 |
+
> Note: This model works best with plain text or formulas using the same font style
|
24 |
+
|
25 |
+
|
26 |
```py
|
27 |
Classification Report:
|
28 |
precision recall f1-score support
|
|
|
104 |
if __name__ == "__main__":
|
105 |
iface.launch()
|
106 |
```
|
107 |
+
## **Demo Inference**
|
108 |
+
|
109 |
+
> [!Important]
|
110 |
+
> Text
|
111 |
+
|
112 |
+
|
113 |
+

|
114 |
+

|
115 |
+

|
116 |
+
|
117 |
+
> [!Important]
|
118 |
+
> Formula
|
119 |
+
|
120 |
+

|
121 |
+

|
122 |
+

|
123 |
|
124 |
---
|
125 |
|