taufeeque/mbpp-hardcode
Viewer
• Updated
• 974 • 1.39k
Obfuscated Policy, Obfuscated Activations, Blatant Deception, and Honest models trained in the Obfuscation Atlas paper
Note Dataset used for probe evaluation and RL training.
Totally Free + Zero Barriers + No Login Required