StableVicuna 是由 Stable Diffusion 背后的 StabilityAI 推出的第一个通过基于人类反馈的强化学习(RLHF)训练的大规模开源聊天机器人。StableVicuna是Vicuna v0 13b的进一步指令微调和RLHF训练版本,它是一个指令微调的 LLaMA 130亿模型。
Copyright Notice: Unless otherwise stated, all articles on this website are originally created and owned by AINAVNews. Without permission, no individual, media, website or group may reprint, plagiarize or reproduce the content of this website in other ways, or set up a mirror on servers that do not belong to our website. Otherwise, our website will reserve the right to pursue relevant legal responsibilities in accordance with the law.