zephre 7B-sft---> PPO-7B zephre 7B-sft---> PPO-7B zephre 7B-sft---> PPO-7B zephre 7B-sft---> PPO-7B zephre 7B-sft---> PPO-7B zephre 7B-sft---> PPO-7B zephre 7B-sft---> PPO-7B zephre 7B-sft---> PPO-7B zephre 7B-sft---> PPO-7B

Downloads last month
186
Safetensors
Model size
7B params
Tensor type
F32
·
Inference Providers NEW
Input a message to start chatting with ewqr2130/7B_ppo_phiRM_2GPU_3e-7step_4000.

Model tree for ewqr2130/7B_ppo_phiRM_2GPU_3e-7step_4000

Quantizations
1 model