K - a diwank Collection

chat-ui

jondurbin/py-dpo-v0.1

Viewer • Updated Jan 11 • 9.47k • 155 • 46

jondurbin/gutenberg-dpo-v0.1

Viewer • Updated Jan 12 • 918 • 1.51k • 126

jondurbin/cinematika-v0.1

Viewer • Updated Apr 11 • 47.1k • 376 • 53

ParisNeo/lollms_aware_dataset

Viewer • Updated Oct 27, 2023 • 464 • 113 • 5

grimulkan/LimaRP-augmented

Viewer • Updated Jan 24 • 804 • 100 • 29

TIGER-Lab/MathInstruct

Viewer • Updated May 15 • 262k • 2.22k • 249

christopher/rosetta-code

Viewer • Updated Sep 24, 2023 • 79k • 240 • 31

b-mc2/sql-create-context

Viewer • Updated Jan 25 • 78.6k • 3.09k • 417

migtissera/Synthia-v1.3

Viewer • Updated Nov 16, 2023 • 119k • 369 • 99

tinyBenchmarks/tinyMMLU

Viewer • Updated Jul 8 • 385 • 3.5k • 16

tinyBenchmarks/tinyWinogrande

Preview • Updated May 25 • 1.73k • 3

tinyBenchmarks/tinyAI2_arc

Preview • Updated May 25 • 1.33k • 3

tinyBenchmarks/tinyHellaswag

Viewer • Updated May 25 • 50k • 1.75k • 4

tinyBenchmarks/tinyTruthfulQA

Preview • Updated May 25 • 1.26k • 3

tinyBenchmarks/tinyAlpacaEval

Viewer • Updated Apr 19 • 100 • 131 • 4

tinyBenchmarks/tinyGSM8k

Preview • Updated May 25 • 1.4k • 5

cognitivecomputations/samantha-data

Updated Mar 29 • 1.03k • 123

roborovski/synthetic-tool-calls

Viewer • Updated Mar 5 • 6.01k • 38 • 1

roborovski/glaive-tool-usage-dpo

Viewer • Updated Feb 29 • 42k • 37 • 2

kalomaze/StackMix-v0.1

Viewer • Updated Feb 28 • 30 • 44 • 2

roborovski/glaive-function-calling-v2-conversation

Viewer • Updated Feb 19 • 113k • 37 • 2

mlabonne/truthy-dpo-v0.1

Viewer • Updated Feb 18 • 1.02k • 41 • 1

ai4bharat/indic-align

Viewer • Updated Jul 25 • 97.4M • 1.09k • 10

coseal/CodeUltraFeedback_binarized

Viewer • Updated Mar 18 • 9.5k • 48 • 15

coseal/CodeUltraFeedback

Viewer • Updated Mar 15 • 10k • 97 • 25

KTO: Model Alignment as Prospect Theoretic Optimization

Paper • 2402.01306 • Published Feb 2 • 15

ai4bharat/sangraha

Viewer • Updated Oct 21 • 268M • 17k • 31

Rephrase and Respond: Let Large Language Models Ask Better Questions for Themselves

Paper • 2311.04205 • Published Nov 7, 2023 • 5

Multilingual Instruction Tuning With Just a Pinch of Multilinguality

Paper • 2401.01854 • Published Jan 3 • 10

Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models

Paper • 2401.01335 • Published Jan 2 • 64

GAIA: a benchmark for General AI Assistants

Paper • 2311.12983 • Published Nov 21, 2023 • 184

Self-Instruct: Aligning Language Model with Self Generated Instructions

Paper • 2212.10560 • Published Dec 20, 2022 • 8

HuggingFaceH4/self-instruct-seed

Viewer • Updated Jan 31, 2023 • 175 • 332 • 25

ToolTalk: Evaluating Tool-Usage in a Conversational Setting

Paper • 2311.10775 • Published Nov 15, 2023 • 7

Dynamic Planning with a LLM

Paper • 2308.06391 • Published Aug 11, 2023 • 2

FreedomIntelligence/SocraticChat

Viewer • Updated Oct 12, 2023 • 50.7k • 39 • 6

Large Language Model as a User Simulator

Paper • 2308.11534 • Published Aug 21, 2023 • 2

Natural Language Embedded Programs for Hybrid Language Symbolic Reasoning

Paper • 2309.10814 • Published Sep 19, 2023 • 3

AlpaGasus: Training A Better Alpaca with Fewer Data

Paper • 2307.08701 • Published Jul 17, 2023 • 22

mlabonne/alpagasus

Viewer • Updated Aug 3, 2023 • 9.23k • 50 • 8

AgentTuning: Enabling Generalized Agent Abilities for LLMs

Paper • 2310.12823 • Published Oct 19, 2023 • 35

THUDM/AgentInstruct

Viewer • Updated Oct 23, 2023 • 1.87k • 330 • 198

Diversity of Thought Improves Reasoning Abilities of Large Language Models

Paper • 2310.07088 • Published Oct 11, 2023 • 5

SmartPlay : A Benchmark for LLMs as Intelligent Agents

Paper • 2310.01557 • Published Oct 2, 2023 • 12

Large Language Models Cannot Self-Correct Reasoning Yet

Paper • 2310.01798 • Published Oct 3, 2023 • 33

MINT: Evaluating LLMs in Multi-turn Interaction with Tools and Language Feedback

Paper • 2309.10691 • Published Sep 19, 2023 • 4

LLM+P: Empowering Large Language Models with Optimal Planning Proficiency

Paper • 2304.11477 • Published Apr 22, 2023 • 3

Quiet-STaR: Language Models Can Teach Themselves to Think Before Speaking

Paper • 2403.09629 • Published Mar 14 • 74

SelfCheck: Using LLMs to Zero-Shot Check Their Own Step-by-Step Reasoning

Paper • 2308.00436 • Published Aug 1, 2023 • 21

Running

461

📢

UGI Leaderboard

MuSR: Testing the Limits of Chain-of-thought with Multistep Soft Reasoning

Paper • 2310.16049 • Published Oct 24, 2023 • 4

Instruction-Following Evaluation for Large Language Models

Paper • 2311.07911 • Published Nov 14, 2023 • 19

allenai/UNcommonsense

Viewer • Updated Jan 19 • 18.3k • 53 • 8

UNcommonsense Reasoning: Abductive Reasoning about Uncommon Situations

Paper • 2311.08469 • Published Nov 14, 2023 • 10

Flows: Building Blocks of Reasoning and Collaborating AI

Paper • 2308.01285 • Published Aug 2, 2023 • 2

aiflows/CCFlows

Updated Dec 10, 2023 • 2

Learning to Reason and Memorize with Self-Notes

Paper • 2305.00833 • Published May 1, 2023 • 4

Verify-and-Edit: A Knowledge-Enhanced Chain-of-Thought Framework

Paper • 2305.03268 • Published May 5, 2023 • 2

Making Large Language Models Better Reasoners with Alignment

Paper • 2309.02144 • Published Sep 5, 2023 • 2

Reason for Future, Act for Now: A Principled Framework for Autonomous LLM Agents with Provable Sample Efficiency

Paper • 2309.17382 • Published Sep 29, 2023 • 4

ALERT: Adapting Language Models to Reasoning Tasks

Paper • 2212.08286 • Published Dec 16, 2022 • 2

CodeIt: Self-Improving Language Models with Prioritized Hindsight Replay

Paper • 2402.04858 • Published Feb 7 • 14

Vivacem/MMIQC

Viewer • Updated Jan 20 • 2.29M • 93 • 14

LLMs in the Imaginarium: Tool Learning through Simulated Trial and Error

Paper • 2403.04746 • Published Mar 7 • 22

Learning to Decode Collaboratively with Multiple Language Models

Paper • 2403.03870 • Published Mar 6 • 18

Large Language Models as Zero-shot Dialogue State Tracker through Function Calling

Paper • 2402.10466 • Published Feb 16 • 17

SynthDST: Synthetic Data is All You Need for Few-Shot Dialog State Tracking

Paper • 2402.02285 • Published Feb 3 • 1

When Scaling Meets LLM Finetuning: The Effect of Data, Model and Finetuning Method

Paper • 2402.17193 • Published Feb 27 • 23

Towards Optimal Learning of Language Models

Paper • 2402.17759 • Published Feb 27 • 16

Evaluating Very Long-Term Conversational Memory of LLM Agents

Paper • 2402.17753 • Published Feb 27 • 18

Aman279/Locomo

Viewer • Updated Mar 7 • 35 • 4 • 1

Generative Representational Instruction Tuning

Paper • 2402.09906 • Published Feb 15 • 53

Synthetic Data (Almost) from Scratch: Generalized Instruction Tuning for Language Models

Paper • 2402.13064 • Published Feb 20 • 47

OpenCodeInterpreter: Integrating Code Generation with Execution and Refinement

Paper • 2402.14658 • Published Feb 22 • 82

Beyond A*: Better Planning with Transformers via Search Dynamics Bootstrapping

Paper • 2402.14083 • Published Feb 21 • 47

PerLTQA: A Personal Long-Term Memory Dataset for Memory Classification, Retrieval, and Synthesis in Question Answering

Paper • 2402.16288 • Published Feb 26 • 1

pandalla/Machine_Mindset_MBTI_dataset

Viewer • Updated Jun 4 • 161k • 409 • 53

berkeley-nest/Nectar

Viewer • Updated Mar 20 • 183k • 464 • 277

totally-not-an-llm/sharegpt-hyperfiltered-3k

Viewer • Updated Jul 13, 2023 • 3.24k • 100 • 14

HuggingFaceTB/cosmopedia

Viewer • Updated Aug 12 • 31.1M • 5.03k • 567

argilla/ultrafeedback-binarized-preferences-cleaned

Viewer • Updated Dec 11, 2023 • 60.9k • 8.72k • 125

dmayhem93/self-critiquing-refine

Viewer • Updated Apr 8, 2023 • 39.2k • 44 • 1

dmayhem93/self-critiquing-critique-and-refine

Viewer • Updated Apr 8, 2023 • 39.2k • 33 • 1

morzecrew/RefinedPersonaChat

Viewer • Updated Aug 7, 2023 • 207k • 50 • 2

beratcmn/rephrased-instruction-turkish-poems

Viewer • Updated Dec 16, 2023 • 4.96k • 41 • 4

Birchlabs/openai-prm800k-stepwise-critic

Viewer • Updated Jun 3, 2023 • 1.09M • 346 • 43

theblackcat102/evol-codealpaca-v1

Viewer • Updated Mar 10 • 111k • 1.21k • 154

meta-math/GSM8K_Backward

Viewer • Updated Nov 10, 2023 • 1.27k • 83 • 15

meta-math/MetaMathQA-40K

Viewer • Updated Nov 10, 2023 • 40k • 249 • 21

glaiveai/glaive-code-assistant-v2

Viewer • Updated Apr 4 • 215k • 61 • 43

Towards General Computer Control: A Multimodal Agent for Red Dead Redemption II as a Case Study

Paper • 2403.03186 • Published Mar 5 • 5

PROC2PDDL: Open-Domain Planning Representations from Texts

Paper • 2403.00092 • Published Feb 29 • 1

btan2/cappy-large

Text Classification • Updated Dec 7, 2023 • 21 • 19

VMware/open-instruct

Viewer • Updated Jul 12, 2023 • 143k • 98 • 44

QizhiPei/BioT5_finetune_dataset

Viewer • Updated Sep 2 • 33 • 608 • 6

Tensoic/gooftagoo

Viewer • Updated Mar 16 • 16.2k • 49 • 9

GenVRadmin/Aryabhatta-Orca-Maths-Hindi

Viewer • Updated Mar 18 • 200k • 39 • 3

Corex: Pushing the Boundaries of Complex Reasoning through Multi-Model Collaboration

Paper • 2310.00280 • Published Sep 30, 2023 • 3

JARVIS-1: Open-World Multi-task Agents with Memory-Augmented Multimodal Language Models

Paper • 2311.05997 • Published Nov 10, 2023 • 36

wangwilliamyang/wikihow

Updated Jan 18 • 9

argilla/distilabel-capybara-kto-15k-binarized

Viewer • Updated Mar 19 • 15.1k • 57 • 5

argilla/ultrafeedback-binarized-preferences-cleaned-kto

Viewer • Updated Mar 19 • 231k • 156 • 9

argilla/distilabel-intel-orca-kto

Viewer • Updated Mar 19 • 23.1k • 36 • 5

argilla/kto-mix-15k

Viewer • Updated Apr 19 • 15.3k • 110 • 13

KnutJaegersberg/dolphin_orca_clustered

Updated Sep 14, 2023 • 37 • 1

GAIR/autoj-scenario-classifier

Text Generation • Updated Oct 9, 2023 • 12 • 5

Orca 2: Teaching Small Language Models How to Reason

Paper • 2311.11045 • Published Nov 18, 2023 • 70

GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection

Paper • 2403.03507 • Published Mar 6 • 182

Ask Optimal Questions: Aligning Large Language Models with Retriever's Preference in Conversational Search

Paper • 2402.11827 • Published Feb 19 • 1

Grounding Language Model with Chunking-Free In-Context Retrieval

Paper • 2402.09760 • Published Feb 15

Agent-FLAN: Designing Data and Methods of Effective Agent Tuning for Large Language Models

Paper • 2403.12881 • Published Mar 19 • 16

BAAI/OPI

Preview • Updated 25 days ago • 547 • 8

internlm/Agent-FLAN

Preview • Updated Mar 20 • 118 • 66

kaist-ai/selfee-train

Viewer • Updated May 31, 2023 • 178k • 57 • 9

fabiochiu/medium-articles

Preview • Updated Jul 17, 2022 • 152 • 23

Reverse Training to Nurse the Reversal Curse

Paper • 2403.13799 • Published Mar 20 • 13

voidful/MuSiQue

Preview • Updated May 20, 2023 • 38 • 4

BAAI/bge-reranker-v2-m3

Text Classification • Updated Jun 24 • 736k • 405

allenai/reward-bench

Viewer • Updated Sep 9 • 8.11k • 7.14k • 77

Sorted LLaMA: Unlocking the Potential of Intermediate Layers of Large Language Models for Dynamic Inference Using Sorted Fine-Tuning (SoFT)

Paper • 2309.08968 • Published Sep 16, 2023 • 22

In-Context Learning Creates Task Vectors

Paper • 2310.15916 • Published Oct 24, 2023 • 42

Are Emergent Abilities in Large Language Models just In-Context Learning?

Paper • 2309.01809 • Published Sep 4, 2023 • 3

ZenMoore/RoleBench

Preview • Updated Nov 23, 2023 • 406 • 74

LLM Agent Operating System

Paper • 2403.16971 • Published Mar 25 • 65

ORPO: Monolithic Preference Optimization without Reference Model

Paper • 2403.07691 • Published Mar 12 • 62

princeton-nlp/QuRatedPajama-260B

Viewer • Updated Apr 16 • 254M • 513 • 6

Arcee's MergeKit: A Toolkit for Merging Large Language Models

Paper • 2403.13257 • Published Mar 20 • 20

Can large language models explore in-context?

Paper • 2403.15371 • Published Mar 22 • 32

Locutusque/OpenCerebrum-dpo

Viewer • Updated Mar 26 • 21.1k • 42 • 6

Doctor-Shotgun/theory-of-mind-dpo

Viewer • Updated Mar 14 • 539 • 47 • 16

Locutusque/arc-cot-dpo

Viewer • Updated Mar 26 • 957 • 37 • 6

fblgit/simple-math-DPO

Viewer • Updated Aug 1 • 800k • 205 • 16

KrisPi/PythonTutor-Evol-1k-DPO-GPT4_vs_35

Viewer • Updated Nov 18, 2023 • 943 • 47 • 13

zerolink/zsql-postgres-dpo

Viewer • Updated Feb 2 • 259k • 66 • 6

Lakera/gandalf_ignore_instructions

Viewer • Updated Oct 2, 2023 • 1k • 434 • 27

mrm8488/unnatural-instructions-full

Viewer • Updated Dec 21, 2022 • 66k • 68 • 16

NilanE/SmallParallelDocs-Ja_En-6k

Viewer • Updated Mar 5 • 6.32k • 103 • 2

Long-form factuality in large language models

Paper • 2403.18802 • Published Mar 27 • 24

NousResearch/OLMo-Bitnet-1B

Text Generation • Updated Apr 11 • 368 • 118

pyp1/VoiceCraft

Text-to-Speech • Updated Aug 21 • 52 • 206

CarperAI/openai_summarize_comparisons

Viewer • Updated Feb 27, 2023 • 260k • 1.43k • 40

PygmalionAI/PIPPA

Updated Sep 7, 2023 • 229 • 203

ivanleomk/gpt4-chain-of-density

Preview • Updated Nov 12, 2023 • 70 • 6

AIRI-NLP/cnli_memory_extracted

Viewer • Updated Mar 22 • 8.23k • 58 • 1

Lumos: Learning Agents with Unified Data, Modular Design, and Open-Source LLMs

Paper • 2311.05657 • Published Nov 9, 2023 • 27

openbmb/UltraInteract_sft

Viewer • Updated Apr 5 • 289k • 1.29k • 118

openbmb/UltraInteract_pair

Viewer • Updated Apr 5 • 220k • 634 • 104

openbmb/Eurus-70b-nca

Text Generation • Updated Apr 12 • 461 • 11

Noise Contrastive Alignment of Language Models with Explicit Rewards

Paper • 2402.05369 • Published Feb 8 • 1

ai2lumos/lumos_multimodal_ground_iterative

Viewer • Updated Mar 19 • 15.9k • 47 • 1

ai2lumos/lumos_multimodal_plan_iterative

Viewer • Updated Mar 19 • 15.9k • 54 • 2

ai2lumos/lumos_complex_qa_plan_onetime

Viewer • Updated Mar 19 • 19.4k • 61 • 3

ai2lumos/lumos_complex_qa_ground_onetime

Viewer • Updated Mar 19 • 19.2k • 74 • 3

ai2lumos/lumos_complex_qa_ground_iterative

Viewer • Updated Mar 19 • 19.1k • 65 • 2

ai2lumos/lumos_unified_plan_iterative

Viewer • Updated Mar 19 • 55.4k • 68 • 2

ai2lumos/lumos_complex_qa_plan_iterative

Viewer • Updated Mar 18 • 19k • 64 • 6

ai2lumos/lumos_unified_ground_iterative

Viewer • Updated Mar 19 • 55.5k • 57 • 2

ai2lumos/lumos_web_agent_ground_iterative

Viewer • Updated Mar 18 • 1.01k • 49 • 2

ai2lumos/lumos_web_agent_plan_iterative

Viewer • Updated Mar 18 • 1.01k • 49 • 4

ai2lumos/lumos_maths_ground_iterative

Viewer • Updated Mar 18 • 19.5k • 63 • 3

ai2lumos/lumos_maths_ground_onetime

Viewer • Updated Mar 18 • 19.8k • 54 • 1

ai2lumos/lumos_maths_plan_onetime

Viewer • Updated Mar 18 • 19.8k • 45 • 2

Symbol-LLM/Symbol-LLM-7B-Instruct

Text Generation • Updated Jun 23 • 64 • 13

MoritzLaurer/deberta-v3-large-zeroshot-v2.0

Zero-Shot Classification • Updated Apr 11 • 101k • 82

MoritzLaurer/bge-m3-zeroshot-v2.0

Zero-Shot Classification • Updated Apr 22 • 159k • 42

What Makes Good Data for Alignment? A Comprehensive Study of Automatic Data Selection in Instruction Tuning

Paper • 2312.15685 • Published Dec 25, 2023 • 16

Pavithree/eli5

Viewer • Updated Apr 23, 2022 • 229k • 271 • 2

vicgalle/configurable-system-prompt-multitask

Viewer • Updated Apr 23 • 1.95k • 214 • 19

paraloq/json_data_extraction

Viewer • Updated Mar 25 • 484 • 78 • 18

livecodebench/execution

Viewer • Updated Mar 12 • 479 • 59 • 4

iamtarun/python_code_instructions_18k_alpaca

Viewer • Updated Jul 27, 2023 • 18.6k • 1.99k • 240

LLM2LLM: Boosting LLMs with Novel Iterative Data Enhancement

Paper • 2403.15042 • Published Mar 22 • 25

manishiitg/CogStack-QA

Viewer • Updated Feb 9 • 24.7k • 37 • 1

manishiitg/CogStack-Tasks

Viewer • Updated Feb 9 • 4.69k • 31 • 1

manishiitg/CogStack-Conv

Viewer • Updated Feb 9 • 2.35k • 34 • 1

Reformatted Alignment

Paper • 2402.12219 • Published Feb 19 • 16

abacusai/SystemChat-1.1

Viewer • Updated Apr 11 • 20.2k • 81 • 29

Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention

Paper • 2404.07143 • Published Apr 10 • 103

Anthropic/persuasion

Viewer • Updated Apr 9 • 3.94k • 364 • 175

Rho-1: Not All Tokens Are What You Need

Paper • 2404.07965 • Published Apr 11 • 84

M4-ai/prm_dpo_pairs

Viewer • Updated Jul 1 • 93.9k • 60 • 7

OpenLLM-France/Claire-Dialogue-French-0.1

Viewer • Updated Dec 5, 2023 • 37k • 189 • 41

amaydle/npc-dialogue

Viewer • Updated Mar 25, 2023 • 1.92k • 49 • 15

facebook/empathetic_dialogues

Updated Jan 18 • 1.29k • 93

Salesforce/dialogstudio

Updated Jul 21 • 610 • 215

Direct Nash Optimization: Teaching Language Models to Self-Improve with General Preferences

Paper • 2404.03715 • Published Apr 4 • 60

microsoft/Taskbench

Viewer • Updated Aug 21 • 17.3k • 541 • 21

Learn Your Reference Model for Real Good Alignment

Paper • 2404.09656 • Published Apr 15 • 82

CantTalkAboutThis: Aligning Language Models to Stay on Topic in Dialogues

Paper • 2404.03820 • Published Apr 4 • 24

mlabonne/orpo-dpo-mix-40k

Viewer • Updated Oct 17 • 44.2k • 1.28k • 254

allenai/persona-bias

Updated Feb 5 • 53 • 11

PleIAs/YouTube-Commons

Updated Jun 26 • 1.08k • 320

FreedomIntelligence/evol-instruct-hindi

Viewer • Updated Aug 6, 2023 • 59k • 15 • 2

FreedomIntelligence/OVM-process

Viewer • Updated Apr 1 • 7.47k • 39 • 1

nuprl/CanItEdit

Viewer • Updated Mar 19 • 105 • 385 • 11

totally-not-an-llm/EverythingLM-data-V3

Viewer • Updated Sep 11, 2023 • 1.07k • 71 • 31

RUCAIBox/Story-Generation

Updated Mar 3, 2023 • 61 • 12

fabraz/writingPromptAug

Viewer • Updated Oct 14, 2023 • 24.1k • 119 • 2

jerryjalapeno/nart-100k-synthetic

Viewer • Updated Jul 16, 2023 • 99.1k • 129 • 39

jat-project/jat-dataset

Viewer • Updated Feb 16 • 258M • 249k • 33

euclaise/ReMask-3B

Text Generation • Updated Aug 10 • 79 • 15

google/Synthetic-Persona-Chat

Viewer • Updated Mar 1 • 10.9k • 1.23k • 76

google/cvss

Updated Feb 10 • 144 • 12

neural-bridge/rag-dataset-12000

Viewer • Updated Feb 5 • 12k • 1.6k • 111

HannahRoseKirk/prism-alignment

Viewer • Updated Apr 25 • 77.9k • 982 • 64

Gigax/NPC-LLM-3_8B

Text Generation • Updated May 14 • 605 • 24

nuprl/MultiPL-T

Viewer • Updated Aug 20 • 215k • 376 • 7

cognitivecomputations/SystemChat-1.2

Viewer • Updated Apr 30 • 52 • 52 • 6

mlabonne/arena-preferences

Viewer • Updated Apr 27 • 2.69k • 62 • 9

INTERS: Unlocking the Power of Large Language Models in Search with Instruction Tuning

Paper • 2401.06532 • Published Jan 12 • 10

Flexibly Scaling Large Language Models Contexts Through Extensible Tokenization

Paper • 2401.07793 • Published Jan 15 • 3

yutaozhu94/INTERS

Preview • Updated Feb 19 • 923 • 12

THUDM/CogAgent

Updated Dec 18, 2023 • 16

urchade/gliner_large-v2.1

Token Classification • Updated Apr 10 • 10.2k • 28

shachardon/ShareLM

Viewer • Updated Aug 6 • 331k • 337 • 29

nvidia/ChatQA-Training-Data

Viewer • Updated Jun 4 • 442k • 1.3k • 160

lightblue/tagengo-gpt4

Viewer • Updated Jun 2 • 78.1k • 122 • 61

Efficient-Large-Model/Llama-3-VILA1.5-8B

Text Generation • Updated Aug 16 • 34k • 29

bigcode/commitpackft

Viewer • Updated Aug 20, 2023 • 702k • 5.66k • 61

glaiveai/glaive-code-assistant-v3

Viewer • Updated May 20 • 950k • 216 • 44

davanstrien/cosmochat

Viewer • Updated May 10 • 199 • 54 • 11

davanstrien/cosmopedia_chat

Viewer • Updated Mar 8 • 1.19k • 59 • 7

MemGPT/MSC-Self-Instruct

Viewer • Updated Nov 2, 2023 • 500 • 202 • 11

MemGPT/qa_data

Viewer • Updated Feb 6 • 18.6k • 21 • 1

google/imageinwords

Updated May 25 • 204 • 115

grammarly/coedit

Viewer • Updated Oct 21, 2023 • 70.8k • 1.14k • 63

bea2019st/wi_locness

Updated Jan 18 • 116 • 14

GEM/FairytaleQA

Viewer • Updated Oct 25, 2022 • 10.6k • 143 • 8

grammarly/medit

Viewer • Updated Oct 1 • 113k • 100 • 13

MemGPT/MemGPT-DPO-Dataset

Viewer • Updated Apr 18 • 42.3k • 89 • 8

lmsys/lmsys-arena-human-preference-55k

Viewer • Updated May 17 • 57.5k • 1.14k • 136

princeton-nlp/QuRating-GPT3.5-Judgments

Viewer • Updated Mar 29 • 250k • 41 • 5

princeton-nlp/AutoCompressor-Llama-2-7b-6k

Updated Nov 22, 2023 • 1.86k • 2

H-D-T/Select-Stack

Viewer • Updated Sep 2 • 1.46M • 50 • 16

EleutherAI/lichess-puzzles

Viewer • Updated May 9 • 1.48M • 61 • 20

selfrag/selfrag_train_data

Viewer • Updated Oct 31, 2023 • 146k • 131 • 67

community-datasets/yahoo_answers_topics

Viewer • Updated Jun 24 • 1.46M • 1.44k • 54

TIGER-Lab/MMLU-Pro

Viewer • Updated 3 days ago • 12.1k • 32.1k • 288

ylacombe/expresso

Viewer • Updated Apr 30 • 11.6k • 295 • 32

microsoft/MeetingBank-QA-Summary

Viewer • Updated May 16 • 862 • 191 • 12

microsoft/MeetingBank-LLMCompressed

Viewer • Updated May 16 • 5.17k • 89 • 15

nvidia/ChatRAG-Bench

Viewer • Updated May 24 • 34.6k • 2.13k • 100

xingyaoww/code-act

Viewer • Updated Feb 5 • 78.4k • 333 • 49

kaist-ai/Multifaceted-Collection-ORPO

Viewer • Updated Jul 1 • 64.6k • 58 • 9

Alibaba-NLP/gte-Qwen2-7B-instruct

hwjiang/Real3D

Image-to-3D • Updated Jun 14 • 15 • 13

nvidia/Aegis-AI-Content-Safety-Dataset-1.0

Viewer • Updated Jun 28 • 12k • 666 • 46

ProGamerGov/synthetic-dataset-1m-dalle3-high-quality-captions

Updated Oct 30 • 3.85k • 119

OmniCorpus: A Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text

Paper • 2406.08418 • Published Jun 12 • 28

facebook/multi-token-prediction

Updated Jun 18 • 350

TIGER-Lab/M-BEIR

Viewer • Updated Aug 7 • 2.86M • 1.18k • 13

tomg-group-umd/pixelprose

Viewer • Updated Jun 23 • 15.6M • 1.2k • 130

mit-han-lab/ShareGPT4V

Preview • Updated Feb 22 • 41 • 3

mit-han-lab/litepose

Updated Jun 5 • 1

mit-han-lab/Llama-3-8B-Instruct-QServe-g128

Text Generation • Updated May 6 • 10 • 1

internlm/internlm-xcomposer2-vl-7b

Visual Question Answering • Updated Apr 12 • 3.89k • 79

OpenGVLab/InternViT-6B-448px-V1-5

Image Feature Extraction • Updated Aug 23 • 6.57k • 74

OpenGVLab/InternVL-Chat-V1-5

Image-Text-to-Text • Updated 10 days ago • 3.69k • 402

OpenGVLab/Mini-InternVL-Chat-4B-V1-5

Image-Text-to-Text • Updated 10 days ago • 3.6k • 58

openbmb/MiniCPM-Llama3-V-2_5

Image-Text-to-Text • Updated Sep 25 • 32.5k • 1.38k

microsoft/Florence-2-large

Image-Text-to-Text • Updated 16 days ago • 812k • 1.26k

llava-hf/LLaVA-NeXT-Video-7B-DPO-hf

Video-Text-to-Text • Updated 9 days ago • 2.41k • 8

arcee-ai/BAAI-Infinity-Instruct-System

Viewer • Updated Jun 24 • 2.36M • 213 • 15

hpcai-tech/OpenSora-VAE-v1.2

Updated Jun 17 • 595k • 54

hpcai-tech/OpenSora-STDiT-v3

Updated Jun 17 • 169k • 42

liuqi6777/RankGPT-msmarco-100k-clean

Viewer • Updated Feb 6 • 87.3k • 50 • 1

failspy/Meta-Llama-3-70B-Instruct-abliterated-v3.5

Text Generation • Updated May 30 • 5.02k • 37

ResplendentAI/NSFW_RP_Format_DPO

Viewer • Updated Mar 17 • 400 • 53 • 59

microsoft/msr_text_compression

Updated Jan 18 • 72 • 8

microsoft/msr_sqa

Updated Jan 18 • 135 • 4

microsoft/crd3

Updated Jan 18 • 160 • 23

nvidia/domain-classifier

Updated Jun 24 • 53.8k • 59

jhu-clsp/FollowIR-train

Viewer • Updated Mar 25 • 1.78k • 48 • 5

vicgalle/Phudge-3

Text Classification • Updated May 30 • 7 • 3

TWO/sutra-mlt256-v2

Updated May 24 • 8

AutoRAG-HP: Automatic Online Hyper-Parameter Tuning for Retrieval-Augmented Generation

Paper • 2406.19251 • Published Jun 27 • 8

aiana94/xMINDlarge

Viewer • Updated Oct 25 • 4.12M • 177 • 4

OpenCo7/UpVoteWeb

Viewer • Updated Jul 17 • 557M • 613 • 93

davanstrien/magpie-preference

Viewer • Updated 2 days ago • 494 • 911 • 12

FunAudioLLM/SenseVoiceSmall

Updated Jul 31 • 2.44k • 177

euclaise/gsm8k_multiturn

Viewer • Updated Jul 6 • 8.79k • 80 • 13

internlm/internlm-xcomposer2d5-7b

Visual Question Answering • Updated Jul 22 • 9.44k • 183

dell-research-harvard/newswire

Viewer • Updated Jul 2 • 1.44M • 422 • 68

alexshengzhili/SciGraphQA-295K-train

Viewer • Updated Aug 8, 2023 • 296k • 135 • 11

xinsir/controlnet-union-sdxl-1.0

Text-to-Image • Updated Jul 30 • 79.7k • 1.13k

T-FREE: Tokenizer-Free Generative LLMs via Sparse Representations for Memory-Efficient Embeddings

Paper • 2406.19223 • Published Jun 27 • 8

laion/links_to_pocasts_lecture_and_shows_for_tts

Viewer • Updated May 29 • 331k • 10 • 8

laion/datacomp-hq

Viewer • Updated Mar 13 • 20.7M • 196 • 10

laion/Subjects-for-curricular

Viewer • Updated Dec 20, 2023 • 3.99M • 102 • 5

laion/strategic_game_maze

Viewer • Updated Oct 20, 2023 • 345M • 19.5k • 10

mlabonne/llmtwin

Viewer • Updated Aug 27 • 3.34k • 108 • 7

Multimodal Self-Instruct: Synthetic Abstract Image and Visual Reasoning Instruction Using Language Model

Paper • 2407.07053 • Published Jul 9 • 41

dunzhang/stella_en_400M_v5

dunzhang/stella_en_1.5B_v5

RhapsodyAI/MiniCPM-V-Embedding-preview

Feature Extraction • Updated Aug 20 • 397 • 44

agentsea/wave-ui-25k

Viewer • Updated Jul 3 • 25k • 659 • 16

TencentARC/StoryStream

Preview • Updated Jul 17 • 374 • 23

apple/DCLM-7B

Updated Jul 26 • 3.03k • 825

HuggingFaceTB/smollm-corpus

Viewer • Updated Sep 6 • 237M • 37.4k • 250

HuggingFaceTB/bisac-topics

Viewer • Updated Apr 3 • 5.5k • 7 • 2

From GaLore to WeLore: How Low-Rank Weights Non-uniformly Emerge from Low-Rank Gradients

Paper • 2407.11239 • Published Jul 15 • 7

mistralai/Mistral-Nemo-Base-2407

Text Generation • Updated 24 days ago • 59.3k • 260

TencentARC/SEED-Story

Text-to-Image • Updated Aug 26 • 27 • 24

xlangai/BRIGHT

Viewer • Updated 13 days ago • 1.35M • 2.48k • 18

glaiveai/RAG-v1

Viewer • Updated Jun 25 • 51.4k • 247 • 64

QuietImpostor/Claude-3-Opus-Claude-3.5-Sonnnet-9k

Viewer • Updated Jun 30 • 9.94k • 86 • 17

PawanKrd/gpt-4o-200k

Viewer • Updated Jun 29 • 200k • 49 • 23

kalomaze/Opus_Instruct_3k

Viewer • Updated Jul 19 • 2.95k • 90 • 24

Coarse-to-Fine Vision-Language Pre-training with Fusion in the Backbone

Paper • 2206.07643 • Published Jun 15, 2022 • 1

Active Self-Supervised Learning: A Few Low-Cost Relationships Are All You Need

Paper • 2303.15256 • Published Mar 27, 2023 • 1

fireworks-ai/llama-3-firefunction-v2

Text Generation • Updated Jun 18 • 167 • 136

Stateful Memory-Augmented Transformers for Dialogue Modeling

Paper • 2209.07634 • Published Sep 15, 2022 • 1

cognitivecomputations/SystemChat-2.0

Preview • Updated May 31 • 72 • 54

CollectiveCognition/chats-data-2023-10-16

Viewer • Updated Oct 16, 2023 • 200 • 50 • 21

Izazk/Sequence-of-action-prediction-mind2web

Viewer • Updated Feb 22 • 68.9k • 78 • 3

BigAction/mind2web_clean

Viewer • Updated Apr 25 • 199 • 64 • 4

osunlp/Mind2Web

Viewer • Updated Jul 19, 2023 • 253 • 504 • 90

magicgh/MT-Mind2Web

Viewer • Updated Feb 23 • 259 • 87 • 2

TencentARC/PhotoMaker-V2

Text-to-Image • Updated Jul 22 • 20.2k • 123

KevSun/Personality_LM

Text Classification • Updated Jul 29 • 254 • 15

Running

241

♾️📚

Infinite Dataset Hub

Search and save datasets generated with a LLM in real time

chargoddard/SlimOrcaDedupCleaned-Sonnet3.5-DPO

Viewer • Updated Jul 23 • 168k • 51 • 7

nvidia/Minitron-8B-Base

Updated Aug 20 • 55 • 63

mlfoundations/MINT-1T-HTML

Viewer • Updated Sep 21 • 623M • 90.9k • 76

mlfoundations/MINT-1T-ArXiv

Viewer • Updated Sep 19 • 5.6M • 3.2k • 48

mlfoundations/MINT-1T-PDF-CC-2024-18

Updated Sep 19 • 20k • 19

AI-MO/NuminaMath-TIR

Viewer • Updated 6 days ago • 72.5k • 574 • 68

DistilDIRE: A Small, Fast, Cheap and Lightweight Diffusion Synthesized Deepfake Detection

Paper • 2406.00856 • Published Jun 2 • 10

mlabonne/FineTome-100k

Viewer • Updated Jul 29 • 100k • 7.68k • 130

LiruiZhao/Diffree

Image-to-Image • Updated Jul 29 • 65 • 17

BAAI/bge-multilingual-gemma2

Feature Extraction • Updated Jul 31 • 133k • 139

BAAI/bge-reranker-v2.5-gemma2-lightweight

Text Classification • Updated Sep 6 • 11.5k • 42

BAAI/IndustryCorpus

Viewer • Updated Jul 23 • 595M • 1.7k • 47

jspringer/echo-mistral-7b-instruct-lasttoken

Feature Extraction • Updated Feb 26 • 417 • 5

BAAI/bge-en-icl

Feature Extraction • Updated Sep 25 • 31.2k • 100

AlekseyKorshuk/full_user_edit_responses-clean

Viewer • Updated Mar 30, 2023 • 364k • 42 • 1

m-a-p/MMRA

Viewer • Updated Jul 31 • 1.02k • 119 • 13

m-a-p/II-Bench

Viewer • Updated Jun 29 • 1.43k • 617 • 8

BEE-spoke-data/fineweb-1000_64k

Viewer • Updated Jun 23 • 2k • 59 • 3

Salesforce/xgen-mm-phi3-mini-instruct-r-v1

Image-Text-to-Text • Updated Sep 18 • 15.8k • 184

black-forest-labs/FLUX.1-dev

Text-to-Image • Updated Aug 16 • 1.36M • • 6.82k

black-forest-labs/FLUX.1-schnell

Text-to-Image • Updated Aug 16 • 1.91M • • 2.95k

numind/NuExtract

Text Generation • Updated Oct 17 • 2.12k • 212

numind/NuSentiment-multilingual

Feature Extraction • Updated Jan 26 • 157 • 10

HuggingFaceM4/Idefics3-8B-Llama3

Image-Text-to-Text • Updated Sep 18 • 15.9k • 246

aipicasso/megalith-10m-florence2

Viewer • Updated Jul 31 • 9.14M • 67 • 22

ZhengPeng7/BiRefNet

Image Segmentation • Updated 23 days ago • 848k • 262

nvidia/quality-classifier-deberta

Updated Aug 6 • 2.53k • 49

Tree Attention: Topology-aware Decoding for Long-Context Attention on GPU clusters

Paper • 2408.04093 • Published Aug 7 • 4

tiiuae/falcon-mamba-7b-4bit

Text Generation • Updated Oct 10 • 127 • 11

nisten/all-human-diseases

Viewer • Updated Aug 19 • 2.2k • 99 • 101

THUDM/LongWriter-6k

Viewer • Updated Aug 14 • 6k • 286 • 169

anthracite-org/Stheno-Data-Filtered

Viewer • Updated Aug 18 • 31.1k • 18 • 14

anthracite-org/kalo-opus-instruct-22k-no-refusal

Viewer • Updated Aug 13 • 22.3k • 225 • 19

anthracite-org/nopm_claude_writing_fixed

Viewer • Updated Aug 18 • 6.35k • 136 • 8

microsoft/Phi-3.5-vision-instruct

Image-Text-to-Text • Updated Sep 26 • 973k • 590

microsoft/Phi-3.5-MoE-instruct

Text Generation • Updated Oct 24 • 52.4k • 525

fal/AuraFace-v1

Updated Aug 26 • 71

NexaAIDev/Squid

Updated Sep 3 • 50 • 32

Dolphin: Long Context as a New Modality for Energy-Efficient On-Device Language Models

Paper • 2408.15518 • Published Aug 28 • 42

HuggingFaceTB/everyday-conversations-llama3.1-2k

Viewer • Updated Aug 17 • 2.38k • 628 • 79

NousResearch/hermes-function-calling-v1

Viewer • Updated Aug 30 • 11.6k • 697 • 220

multimodalart/product-design

Text-to-Image • Updated Sep 22 • 2.87k • • 30

novateur/WavTokenizer

Text-to-Speech • Updated Sep 27 • 45

facebook/sapiens

Updated Sep 20 • 411 • 221

Shakker-Labs/AWPortrait-FL

Text-to-Image • Updated Sep 5 • 31.9k • 408

sequelbox/Supernova

Viewer • Updated Sep 27 • 178k • 241 • 8

Running

521

🖼💬

Vision Arena (Testing VLMs side-by-side)

mattshumer/Reflection-Llama-3.1-70B

Text Generation • Updated Sep 24 • 2.04k • 1.71k

deepseek-ai/DeepSeek-V2.5

Text Generation • Updated Oct 8 • 8.83k • 641

deepseek-ai/ESFT-vanilla-lite

Text Generation • Updated Jul 23 • 117 • 8

yifeihu/TB-OCR-preview-0.1

Image-Text-to-Text • Updated Sep 6 • 1.86k • 124

gabrielmbmb/distilabel-reflection-tuning

Viewer • Updated Sep 6 • 5 • 87 • 55

TencentARC/Open-MAGVIT2

Image Feature Extraction • Updated Sep 9 • 10

openbmb/MiniCPM3-4B

Text Generation • Updated about 20 hours ago • 29.8k • 384

THUDM/LongCite-glm4-9b

Text Generation • Updated Sep 13 • 429 • 27

jinaai/reader-lm-1.5b

Text Generation • Updated Sep 20 • 2.24k • 489

Vchitect/Vchitect-2.0-2B

Text-to-Video • Updated Sep 15 • 45 • 35

tencent/DepthCrafter

Depth Estimation • Updated Sep 24 • 344k • 70

mistralai/Pixtral-12B-2409

Updated 5 days ago • 524

stepfun-ai/GOT-OCR2_0

Image-Text-to-Text • Updated Sep 18 • 813k • 1.23k

StoryMaker: Towards Holistic Consistent Characters in Text-to-image Generation

Paper • 2409.12576 • Published Sep 19 • 15

THUdyh/Oryx-7B

Text Generation • Updated Sep 25 • 245 • 11

THUdyh/Oryx-7B-Image

Text Generation • Updated Sep 23 • 13 • 3

THUdyh/Oryx-ViT

Image Classification • Updated Sep 23 • 5

BAAI/SegGPT

Updated Apr 21, 2023 • 17

Salesforce/fineweb_deduplicated

Viewer • Updated Sep 14 • 6.43B • 1.84k • 27

KbsdJames/Omni-MATH

Viewer • Updated Oct 12 • 4.43k • 872 • 58

BAAI/Emu3-Gen

Any-to-Any • Updated Oct 23 • 5.39k • 190

CultriX/elitebabes-flux

Text-to-Image • Updated Sep 20 • 3.03k • • 14

RED-AIGC/StoryMaker

Text-to-Image • Updated 22 days ago • 746 • 72

google/frames-benchmark

Viewer • Updated Oct 15 • 824 • 1.98k • 170

Anthropic/discrim-eval

Viewer • Updated Jan 5 • 18.9k • 893 • 43

facebook/sam2.1-hiera-large

Mask Generation • Updated Sep 24 • 17.6k • 41

Zyphra/Zamba2-2.7B-instruct

Text Generation • Updated Oct 18 • 2.31k • 78

princeton-nlp/Llama-3-8B-ProLong-512k-Instruct

Updated about 1 month ago • 24k • 17

jxm/cde-small-v1

Feature Extraction • Updated Oct 30 • 10.6k • 274

PrincetonPLI/Instruct-SkillMix-SDD

Viewer • Updated Sep 9 • 8k • 71 • 4

THUDM/cogvlm2-llama3-caption

Video-Text-to-Text • Updated Sep 26 • 3.42k • 66

julien040/hacker-news-posts

Viewer • Updated Jun 6, 2023 • 4.01M • 86 • 5

princeton-nlp/Llama-3-8B-ProLong-512k-Base

Updated about 1 month ago • 181 • 6

LLM360/TxT360

Preview • Updated 23 days ago • 79.6k • 213

bingbangboom/flux-waterscape

Text-to-Image • Updated Oct 10 • 294 • • 13

facebook/Self-taught-evaluator-DPO-data

Viewer • Updated Sep 30 • 57.5k • 79 • 30

facebook/layerskip-llama2-13B

Text Generation • Updated Oct 19 • 41 • 5

ibm-granite/granite-8b-code-instruct-accelerator

Updated May 29 • 3 • 1

peakji/steiner-32b-preview

Updated Oct 21 • 45 • 40

CohereForAI/aya-expanse-32b

Text Generation • Updated 30 days ago • 33.6k • 175

CohereForAI/aya-expanse-8b

Text Generation • Updated Oct 30 • 46.8k • 294

Mono-InternVL: Pushing the Boundaries of Monolithic Multimodal Large Language Models with Endogenous Visual Pre-training

Paper • 2410.08202 • Published Oct 10 • 3

McGill-NLP/FaithDial

Viewer • Updated Feb 5, 2023 • 32.3k • 297 • 17

relaxml/Llama-3.1-8b-Instruct-QTIP-4Bit

Updated Oct 28 • 81 • 2

Dualformer: Controllable Fast and Slow Thinking by Learning with Randomized Reasoning Traces

Paper • 2410.09918 • Published Oct 13 • 3

GAIR/o1-journey

Viewer • Updated Oct 16 • 327 • 1.09k • 92

marcelbinz/Psych-101

Viewer • Updated 28 days ago • 60.1k • 253 • 37

nvidia/Nemotron-4-Mini-Hindi-4B-Base

Updated Oct 23 • 7 • 10

nvidia/Nemotron-4-Mini-Hindi-4B-Instruct

Updated 16 days ago • 44 • 12

Etched/oasis-500m

Updated 26 days ago • 5.48k • 419

HuggingFaceTB/SmolLM2-1.7B-Instruct

Text Generation • Updated 4 days ago • 90.9k • 393

tencent/Tencent-Hunyuan-Large

Text Generation • Updated 7 days ago • 239 • 476

THUDM/webrl-llama-3.1-8b

Updated 25 days ago • 200 • 3

THUDM/webrl-glm-4-9b

Updated 25 days ago • 129 • 7

hbseong/HarmAug-Guard

Text Classification • Updated Oct 14 • 1.08k • 33

BAAI/IndustryCorpus2

Viewer • Updated 15 days ago • 826M • 6.73k • 35

qq8933/OpenLongCoT-Pretrain

Viewer • Updated Oct 28 • 103k • 661 • 83

microsoft/maira-2

Text Generation • Updated Oct 21 • 2.2k • 33

LLM2CLIP: Powerful Language Model Unlock Richer Visual Representation

Paper • 2411.04997 • Published 23 days ago • 35

microsoft/orca-agentinstruct-1M-v1

Viewer • Updated about 1 month ago • 1.05M • 3.59k • 377

Nexusflow/Athene-V2-Chat

Text Generation • Updated 4 days ago • 3.48k • 126

Nexusflow/Athene-V2-Agent

Text Generation • Updated 9 days ago • 1.96k • 74

numind/NuExtract-1.5-tiny

Text Generation • Updated 13 days ago • 3.1k • 12

Mixture-of-Transformers: A Sparse and Scalable Architecture for Multi-Modal Foundation Models

Paper • 2411.04996 • Published 23 days ago • 48

allenai/ACE2-ERA5

Updated 10 days ago • 1

Do I Know This Entity? Knowledge Awareness and Hallucinations in Language Models

Paper • 2411.14257 • Published 9 days ago • 9

nvidia/Hymba-1.5B-Base

Text Generation • Updated 2 days ago • 5.82k • 96

AIDC-AI/Marco-o1

Text Generation • Updated 8 days ago • 7.51k • 563

allenai/Llama-3.1-Tulu-3-70B

Text Generation • Updated 5 days ago • 1.03k • 42

nachoyawn/three-million-bluesky

Viewer • Updated 2 days ago • 3.01M • 20 • 10