hai
zhuhai123
AI & ML interests
None yet
Organizations
None yet
zhuhai123's activity
貌似很拉跨,一个7B的模型3090显存都不够载入,要是不安装它推荐的加速包,速度慢的像狗。
15
#12 opened over 1 year ago
by
boxter007
请教一下推理使用Multi-Query Attention 是需要在训练的时候就要使用Multi-Query Attention训练么
1
#21 opened over 1 year ago
by
zhuhai123