Delete gen_outputs/Qwen2.5-3B-Instruct/value_pretrain eeaa615 verified DatPySci commited on 9 days ago
Upload gen_outputs/Qwen2.5-3B-Instruct/value_pretrain/polaris_t1.0_p1.0_n16-MNT3072.jsonl with huggingface_hub e08c95b verified DatPySci commited on 9 days ago
Upload gen_outputs/Qwen2.5-3B-Instruct/value_pretrain/polaris_t1.0_p1.0_n32-MNT3072.jsonl with huggingface_hub fbcd1b2 verified DatPySci commited on 9 days ago
Upload gen_outputs/Qwen2.5-3B-Instruct-GRPO-AdamW/math_t1.0_p1.0_n32-MNT3072.jsonl with huggingface_hub 4c66a8c verified DatPySci commited on 22 days ago
Upload gen_outputs/Qwen2.5-3B-Instruct/math_t1.0_p1.0_n16-MNT3072.jsonl with huggingface_hub 3286f53 verified DatPySci commited on 23 days ago