Joseph Tang's picture

Joseph Tang

lilvjosephtang

·

AI & ML interests

None yet

Recent Activity

updated a dataset 3 days ago

Layer6/RankJudge

published a dataset 3 days ago

Layer6/RankJudge

authored a paper 11 days ago

LLM Safety From Within: Detecting Harmful Content with Internal Representations

View all activity

Organizations

Papers 6

arxiv:2604.18519

arxiv:2605.02913

arxiv:2604.01591

arxiv:2510.23948

models 0

None public yet

datasets 1

lilvjosephtang/SEAM-Benchmark

Viewer • Updated Sep 2, 2025 • 3.2k • 340 • 8