elichen3051
's Collections
LLM Fundamental papers
updated
Attention Is All You Need
Paper
•
1706.03762
•
Published
•
47
Note
Transformer
Language Models are Few-Shot Learners
Paper
•
2005.14165
•
Published
•
11
Note
GPT-3
GQA: Training Generalized Multi-Query Transformer Models from Multi-Head
Checkpoints
Paper
•
2305.13245
•
Published
•
5
Llama 2: Open Foundation and Fine-Tuned Chat Models
Paper
•
2307.09288
•
Published
•
242
Textbooks Are All You Need II: phi-1.5 technical report
Paper
•
2309.05463
•
Published
•
87
Phi-3 Technical Report: A Highly Capable Language Model Locally on Your
Phone
Paper
•
2404.14219
•
Published
•
253
Paper
•
2303.08774
•
Published
•
5
Training language models to follow instructions with human feedback
Paper
•
2203.02155
•
Published
•
16
Note
RLHF
Direct Preference Optimization: Your Language Model is Secretly a Reward
Model
Paper
•
2305.18290
•
Published
•
49
Note
DPO
Statistical Rejection Sampling Improves Preference Optimization
Paper
•
2309.06657
•
Published
•
13
Note
Rejection Sampling
RoFormer: Enhanced Transformer with Rotary Position Embedding
Paper
•
2104.09864
•
Published
•
10
Note
ROPE