facebook
/

nougat-base

vision-encoder-decoder

image-text-to-text

Model card Files Files and versions

nielsr HF Staff commited on Sep 21, 2023

Commit

68066c9

·

1 Parent(s): b214ed7

Update README.md (#3)

- Update README.md (a02a49f522ddc8737f812c3523cffbb601d63e9a)

Files changed (1) hide show

README.md +3 -0

README.md CHANGED Viewed

@@ -2,6 +2,7 @@
 license: apache-2.0
 tags:
 - vision
 pipeline_tag: image-to-text
 ---
@@ -11,6 +12,8 @@ Nougat model trained on PDF-to-markdown. It was introduced in the paper [Nougat:
 Disclaimer: The team releasing Nougat did not write a model card for this model so this model card has been written by the Hugging Face team.
 ## Model description
 Nougat is a [Donut](https://huggingface.co/docs/transformers/model_doc/donut) model trained to transcribe scientific PDFs into an easy-to-use markdown format. The model consists of a Swin Transformer as vision encoder, and an mBART model as text decoder.

 license: apache-2.0
 tags:
 - vision
+- nougat
 pipeline_tag: image-to-text
 ---
 Disclaimer: The team releasing Nougat did not write a model card for this model so this model card has been written by the Hugging Face team.
+Note: this model corresponds to the "0.1.0-base" version of the original repository.
 ## Model description
 Nougat is a [Donut](https://huggingface.co/docs/transformers/model_doc/donut) model trained to transcribe scientific PDFs into an easy-to-use markdown format. The model consists of a Swin Transformer as vision encoder, and an mBART model as text decoder.