arxiv:2412.05270
Zhenyu Zhang
Kyriection
AI & ML interests
Large Language Models, Efficient Machine Learning, Quantum Computing
Recent Activity
upvoted
a
paper
26 days ago
Mix-LN: Unleashing the Power of Deeper Layers by Combining Pre-LN and
Post-LN
authored
a paper
about 1 month ago
APOLLO: SGD-like Memory, AdamW-level Performance
upvoted
a
paper
about 1 month ago
APOLLO: SGD-like Memory, AdamW-level Performance
Organizations
None yet
Papers
12
models
None public yet
datasets
None public yet