·
AI & ML interests
None yet
Organizations
view article Phare LLM benchmark V2: Reasoning models don't guarantee better security
davidberenstein1957
• • 10
view article Giskard Bot: Identifying robustness, performance and ethical vulnerabilities in the Top 10 Most Popular Hugging Face Models
view article LLMs recognise bias but also reproduce harmful stereotypes: an analysis of bias in leading LLMs
davidberenstein1957
• • 16
upvoted an article about 1 year ago view article Good answers are not necessarily factual answers: an analysis of hallucination in leading LLMs
davidberenstein1957
• • 42
upvoted a paper about 1 year ago