Gonçalo Paulo
MrGonao
AI & ML interests
Interpretability
Recent Activity
updated a collection about 2 months ago
Replicating emergent misalignment updated a model about 2 months ago
MrGonao/edu_incorrect_subtle_reformatted_2 published a model about 2 months ago
MrGonao/edu_incorrect_subtle_reformatted_2