Papers from the NICS-EFFALG Team - a nics-efc Collection

nics-efc 's Collections

Papers from the NICS-EFFALG Team

Papers from the NICS-EFFALG Team

updated Jun 28

Skeleton-of-Thought: Large Language Models Can Do Parallel Decoding

Paper • 2307.15337 • Published Jul 28, 2023 • 36
DiTFastAttn: Attention Compression for Diffusion Transformer Models

Paper • 2406.08552 • Published Jun 12 • 22
ViDiT-Q: Efficient and Accurate Quantization of Diffusion Transformers for Image and Video Generation

Paper • 2406.02540 • Published Jun 4 • 2
Can LLMs Learn by Teaching? A Preliminary Study

Paper • 2406.14629 • Published Jun 20 • 17
MoA: Mixture of Sparse Attention for Automatic Large Language Model Compression

Paper • 2406.14909 • Published Jun 21 • 13