2 257 96

Raja Biswas

rbiswasfc

AI & ML interests

NLP, Generative AI

Recent Activity

upvoted a paper about 3 hours ago

SkillOpt: Executive Strategy for Self-Evolving Agent Skills

upvoted a paper about 7 hours ago

SkillsBench: Benchmarking How Well Agent Skills Work Across Diverse Tasks

upvoted a paper about 7 hours ago

MemGovern: Enhancing Code Agents through Learning from Governed Human Experiences

View all activity

Organizations

upvoted a paper about 3 hours ago

SkillOpt: Executive Strategy for Self-Evolving Agent Skills

Paper • 2605.23904 • Published 4 days ago • 169

upvoted 3 papers about 7 hours ago

SkillsBench: Benchmarking How Well Agent Skills Work Across Diverse Tasks

Paper • 2602.12670 • Published Feb 13 • 61

MemGovern: Enhancing Code Agents through Learning from Governed Human Experiences

Paper • 2601.06789 • Published Jan 11 • 82

SimpleMem: Efficient Lifelong Memory for LLM Agents

Paper • 2601.02553 • Published Jan 5 • 38

upvoted an article about 8 hours ago

Article

Harness, Scaffold, and the AI Agent Terms Worth Getting Right

sergiopaniego, ariG23498

•

1 day ago

• 35

upvoted a collection 8 days ago

WebWorld

Collection

4 items • Updated 15 days ago • 8

upvoted a paper 13 days ago

NanoResearch: Co-Evolving Skills, Memory, and Policy for Personalized Research Automation

Paper • 2605.10813 • Published 15 days ago • 16

liked a model about 1 month ago

openai/privacy-filter

Token Classification • 1B • Updated Apr 22 • 306k • 1.5k

liked a dataset about 1 month ago

NJU-LINK/DR3-Eval

Viewer • Updated Apr 20 • 100 • 2.45k • 2

upvoted a collection about 1 month ago

NVIDIA Nemotron v3

Collection

Open, Production-ready Enterprise Models • 18 items • Updated 7 days ago • 296

upvoted 3 papers about 1 month ago

upvoted an article about 1 month ago

Article

Beyond Semantic Similarity: Introducing NVIDIA NeMo Retriever’s Generalizable Agentic Retrieval Pipeline

nvidia

•

Mar 13

• 40

liked 2 datasets about 1 month ago

perplexity-ai/draco

Viewer • Updated Feb 20 • 100 • 372 • 91

mercor/apex-agents

Benchmark • Updated Mar 3 • 480 • 21.7k • 125

upvoted 4 papers about 2 months ago

OPUS: Towards Efficient and Principled Data Selection in Large Language Model Pre-training in Every Iteration

Paper • 2602.05400 • Published Feb 5 • 355

AI Can Learn Scientific Taste

Paper • 2603.14473 • Published Mar 15 • 426

OpenWorldLib: A Unified Codebase and Definition of Advanced World Models

Paper • 2604.04707 • Published Apr 6 • 203

FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization

Paper • 2603.19835 • Published Mar 20 • 351