Ruochen Zhao
ruochenzhao
AI & ML interests
NLP interpretability
Recent Activity
upvoted
a
paper
3 days ago
Can LLMs Predict Their Own Failures? Self-Awareness via Internal Circuits