167 30 552

Djuunaa

djuna

AI & ML interests

None yet

Recent Activity

new activity 3 days ago

Lambent/IsoC-Gemma-3-12B:Weight = 1.0?

liked a model 3 days ago

MiniMaxAI/MiniMax-M2.1

liked a model 10 days ago

schneewolflabs/A0l-12B

View all activity

Organizations

New activity in Lambent/IsoC-Gemma-3-12B 3 days ago

Weight = 1.0?

#1 opened 3 days ago by

djuna

liked a model 3 days ago

MiniMaxAI/MiniMax-M2.1

Text Generation • 229B • Updated 1 day ago • 60k • • 525

liked 2 models 10 days ago

schneewolflabs/A0l-12B

Text Generation • 12B • Updated 10 days ago • 25 • 2

LLM360/K2-V2-Instruct

Updated 18 days ago • 6.49k • 37

liked a model 11 days ago

arcee-ai/AFM-4.5B-Base-KDA-NoPE

Feature Extraction • 5B • Updated 13 days ago • 45 • 11

liked a model 14 days ago

allenai/Bolmo-7B

Text Generation • 8B • Updated 7 days ago • 647 • 44

reacted to leonardlin's post with 🔥 17 days ago

Post

2171

We just released our latest Shisa V2.1 Japanese multi-lingual models: https://huggingface.co/collections/shisa-ai/shisa-v21

Besides updates to our 14B, and 70B, we have a new LFM2-based 1.2B, Llama 3.2-based 3B, and Qwen 3-based 8B, all with class-leading Japanese language capabilities.

Per usual, lots of details in the Model Cards for those interested.

1 reply

upvoted a collection 18 days ago

Trinity

Collection

Collection of Arcee AI models in the Trinity family • 8 items • Updated 18 days ago • 21

liked a model 22 days ago

NewBie-AI/NewBie-image-Exp0.1

Text-to-Image • Updated 1 day ago • 934 • 231

liked a model 23 days ago

EssentialAI/rnj-1-instruct

Text Generation • 8B • Updated 5 days ago • 456k • • 291

liked a model 26 days ago

Skywork/R1V4

Image-Text-to-Text • Updated 26 days ago • 13

New activity in ChenkinNoob/ChenkinNoob-XL-V0.1 about 1 month ago

Diffusers library format?

#4 opened about 1 month ago by

djuna

liked a Space about 1 month ago

Z Image Turbo

🏃

1.53k

Generate images from text prompts

liked a model about 1 month ago

YanLabs/gemma-3-27b-it-abliterated-normpreserve

Text Generation • 27B • Updated 21 days ago • 752 • 18

reacted to grimjim's post with 🤗 about 1 month ago

Post

3115

I wanted to call attention to Arli Ai's success in applying my recent modifications to refusal ablation to a MoE model successfully. Nice work, @OwenArli !
ArliAI/GLM-4.5-Air-Derestricted
Ablation on a MoE model is no small thing; I expect preserving norms/magnitudes during intervention better respects routing compared to naive refusal ablation.

(I would have tagged their org earlier, but that feature seemed to be broken via "@")

ArliAI

4 replies

upvoted an article about 1 month ago

Article

DeLERP: Decomposed Linear Interpolation for Model Merging

Nov 20

•

reacted to onekq's post with 👍 about 1 month ago

Post

2621

GLM 4.6 is on a par with Gemini 2

onekq-ai/WebApp1K-models-leaderboard

1 reply

reacted to grimjim's post with 🔥 about 1 month ago

Post

5031

Implemented a proof of concept sampler in pure PyTorch and transformers.

Max P consists of a dynamic token filter which applies Winsorization to cap the probabilties of top tokens. Specifically, a base probability in the range of [0,1] is used to cap individual token probability; the sampler then redistributes excess proportionally.

https://github.com/jim-plus/maxp-sampler-poc

Combined with Temperature and Min P, this could represent a more intuitive way of reducing repetition in text generation.