Spaces:

Ahmadzei
/

RAG

Runtime error

File size: 257 Bytes

5fa1a76

RoBERTa improved upon this by introducing a new pretraining recipe that includes training for longer and on larger batches, randomly masking tokens at each epoch instead of just once during preprocessing, and removing the next-sentence prediction objective.