Shangming Cai
Cheshire94
AI & ML interests
None yet
Organizations
None yet
Cheshire94's activity
[READ IF YOU DO NOT HAVE ACCESS] Getting access to the model
36
#130 opened 6 months ago
by
osanseviero
update README.md
1
#12 opened 11 months ago
by
Cheshire94
Update README of branch dev_triton.
2
#11 opened 11 months ago
by
Cheshire94
Add ApplyRoPE and RMSNorm kernels written in OpenAI Triton
1
#10 opened 12 months ago
by
Cheshire94
Does Qwen support 16k context, what is the best config for max_new_tokens?
2
#22 opened over 1 year ago
by
Cheshire94
Error with dtype=torch.float16.
2
#10 opened over 1 year ago
by
Cheshire94
Prompt template
18
#1 opened over 1 year ago
by
monuminu
Prompt template
18
#1 opened over 1 year ago
by
monuminu