arxiv:2405.20267
Ruochen Zhao
ruochenzhao
AI & ML interests
NLP interpretability
Recent Activity
upvoted a paper about 2 months ago
Can LLMs Predict Their Own Failures? Self-Awareness via Internal Circuits Totally Free + Zero Barriers + No Login Required