I think the benchmark of IFEval: 38.24 extremely low for chat capabilities. Not expected it to be that low though.
because this is the base model not an instruction tuned model.
· Sign up or log in to comment