arxiv:2406.04692
Jue Wang
juewang
AI & ML interests
None yet
Organizations
models
12
juewang/deepseek-coder-6.7b-base-trt-int4-g64-hf
Text Generation
•
Updated
•
4
juewang/deepseek-coder-1.3b-base-trt-int4-g64-hf
Text Generation
•
Updated
•
7
juewang/deepseek-coder-1.3b-instruct-trt-int4-g64-hf
Text Generation
•
Updated
•
3
juewang/deepseek-coder-6.7b-instruct-trt-int4-g64-hf
Text Generation
•
Updated
•
4
juewang/deepseek-coder-6.7b-instruct-trt-int8-g64-hf
Text Generation
•
Updated
•
3
juewang/deepseek-coder-6.7b-instruct-trt-int8-g32-hf
Text Generation
•
Updated
•
3
juewang/deepseek-coder-6.7b-instruct-trt-int8-g128-hf
Text Generation
•
Updated
•
6
juewang/Meta-Llama-3-2B-mlp-layer-pruned
Text Generation
•
Updated
•
45
juewang/Meta-Llama-3-4B-mlp-pruned
Text Generation
•
Updated
•
58
juewang/Meta-Llama-3-8B-wo-gqa
Text Generation
•
Updated
•
14