Bug Report
#10
by
kxzxvbk
- opened
The loss implementation of condition generation model is wrong. It takes me 3 days to debug!!!
The calculation for loss should be modified as: loss = loss_fct(lm_logits[:, :-1, :].reshape(-1, lm_logits.size(-1)), labels[:, 1:].reshape(-1))