ReactiveAI
/

RxT-Alpha-Micro-Plus-Decoder-I-SMAT

Text Generation

model_hub_mixin

pytorch_model_hub_mixin

🇪🇺 Region: EU

Model card Files Files and versions Community

AdamF92 commited on 19 days ago

Commit

ded4e5b

·

1 Parent(s): faf699f

Epoch 0 - Val loss 1.1532

Files changed (2) hide show

README.md +12 -0
config.json +21 -0

README.md ADDED Viewed

	@@ -0,0 +1,12 @@

+---
+license: apache-2.0
+pipeline_tag: text-generation
+tags:
+- model_hub_mixin
+- pytorch_model_hub_mixin
+---
+This model has been pushed to the Hub using the [PytorchModelHubMixin](https://huggingface.co/docs/huggingface_hub/package_reference/mixins#huggingface_hub.PyTorchModelHubMixin) integration:
+- Code: [More Information Needed]
+- Paper: [More Information Needed]
+- Docs: [More Information Needed]

config.json ADDED Viewed

	@@ -0,0 +1,21 @@

+{
+  "att_groups": 4,
+  "att_heads": 8,
+  "att_query_groups": 4,
+  "cross_att_type": "sqa",
+  "embed_dim": 128,
+  "ff_dim": 256,
+  "ff_dropout": 0.1,
+  "init_identity_norm": true,
+  "moe_top_k": 4,
+  "num_experts": 20,
+  "num_layers": 10,
+  "self_att_type": "sqa",
+  "seq_len": 256,
+  "stm_size": 256,
+  "use_flash_attention": false,
+  "use_gated": true,
+  "use_head_norm": true,
+  "use_moe": true,
+  "vocab_size": 7500
+}