Collection of some GPT-4 generated datasets. It may be useful for those looking for the best-quality datasets to train competitive LLMs.
Leon Lee
Leon-Leee
AI & ML interests
LLMs, code generation, chatbot, workflows
Recent Activity
New activity
about 19 hours ago
allenai/Llama-3.1-Tulu-3-70B:Why do you use pass@10 to test coding perfmance...
New activity
1 day ago
OpenCoder-LLM/RefineCode-code-corpus-meta:Further release and details
liked
a dataset
3 days ago
bigcode/the-stack-v2-train-smol-ids
Organizations
Collections
5
models
None public yet