JM's picture

5

JM

JMJM

·

AI & ML interests

None yet

Organizations

upvoted an article 5 months ago

Article

Phare LLM benchmark V2: Reasoning models don't guarantee better security

davidberenstein1957

•

Dec 16, 2025

• 10

upvoted an article 8 months ago

Article

Giskard Bot: Identifying robustness, performance and ethical vulnerabilities in the Top 10 Most Popular Hugging Face Models

JMJM

•

Mar 21, 2024

• 2

upvoted an article 10 months ago

Article

LLMs recognise bias but also reproduce harmful stereotypes: an analysis of bias in leading LLMs

davidberenstein1957

•

Jul 2, 2025

• 16

upvoted an article about 1 year ago

Article

Good answers are not necessarily factual answers: an analysis of hallucination in leading LLMs

davidberenstein1957

•

May 7, 2025

• 42

upvoted a paper about 1 year ago

RealHarm: A Collection of Real-World Language Model Application Failures

Paper • 2504.10277 • Published Apr 14, 2025 • 10