Can RLHF with Preference Optimization Techniques Help LLMs Surpass GPT4-Quality Models? 8 days ago • 2
Thinking LLMs: General Instruction Following with Thought Generation Paper • 2410.10630 • Published Oct 14 • 17
Gemma 2: Improving Open Language Models at a Practical Size Paper • 2408.00118 • Published Jul 31 • 75