Gonçalo Paulo

MrGonao

AI & ML interests

Interpretability

Recent Activity

updated a collection about 2 months ago
Replicating emergent misalignment
updated a model about 2 months ago
MrGonao/edu_incorrect_subtle_reformatted_2
published a model about 2 months ago
MrGonao/edu_incorrect_subtle_reformatted_2
View all activity

Organizations

EleutherAI's profile picture Sapienza University of Rome's profile picture delphi's profile picture