DurstewitzLab
/

dynamix-3d

Time Series Forecasting

Model card Files Files and versions

Dschobby commited on about 22 hours ago

Commit

a81d305

·

verified ·

1 Parent(s): 9f962e0

Update README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -29,7 +29,7 @@ DynaMix is based on a sparse mixture of experts (MoE) architecture operating in
 By aggregating the expert weighting with the expert prediction the next state is predicted. The current model has the following specifics:
-- **M (Latent state dimension):** 10
 - **N (Observation space dimension):** 3
 - **Experts:** 10 expert networks in the mixture
 - **Expert type:** `"almost_linear_rnn"` — a compact recurrent model combining linear and nonlinear components (`P=2` ReLU units)
@@ -50,7 +50,7 @@ model = DynaMix(M=M, N=N, Experts=EXPERTS, expert_type=EXPERT_TYPE, P=P)
 # Load model weights
 model_path = hf_hub_download(
     repo_id="DurstewitzLab/dynamix-3d",
-    filename="dynamix-3d-small-v1.0.safetensors"
 )
 model_state_dict = load_file(model_path)
 model.load_state_dict(model_state_dict)

 By aggregating the expert weighting with the expert prediction the next state is predicted. The current model has the following specifics:
+- **M (Latent state dimension):** 30
 - **N (Observation space dimension):** 3
 - **Experts:** 10 expert networks in the mixture
 - **Expert type:** `"almost_linear_rnn"` — a compact recurrent model combining linear and nonlinear components (`P=2` ReLU units)
 # Load model weights
 model_path = hf_hub_download(
     repo_id="DurstewitzLab/dynamix-3d",
+    filename="dynamix-3d-base-v1.0.safetensors"
 )
 model_state_dict = load_file(model_path)
 model.load_state_dict(model_state_dict)