49 116 983

Jade

euclaise

AI & ML interests

None yet

Recent Activity

liked a dataset about 3 hours ago

Zyphra/Zyda-2

liked a model about 3 hours ago

showlab/ShowUI-2B

liked a model about 3 hours ago

Qwen/QwQ-32B-Preview

View all activity

Organizations

Posts 1

Post

Memphis: Advancing language model reasoning without relying on proprietary model outputs

Memphis is a series of models which advance human-data models, offering good performance without relying on proprietary model outputs (e.g. GPT-generated datasets). I've developed a new iterative finetuning procedure to improve the reasoning ability of these models beyond what is possible using only SFT on the same data.

Currently, I've released two models: Memphis-CoT-3B, and Memphis-scribe-3B.

To create these models, I've created new datasets:
- euclaise/reddit-instruct : A dataset of instruction/QA-like data scraped from Reddit. A curated version, filtered using Lilac and neural embedding models, is available at euclaise/reddit-instruct-curated
- euclaise/TinyCoT : TinyCoT is a mtea-dataset that aggregates a variety of different human-sourced reasoning data. It is a curated version of my previous MegaCoT dataset euclaise/MegaCoT, which contains 629k responses which get cut down to 28k for TinyCoT. There's also an intermediate version euclaise/MiniCoT, which has 129k responses.

Memphis-CoT is trained on reddit-instruct, a filtered version of oasst2 sablo/oasst2_curated, and TinyCoT. Multiple iterations were performed on TinyCoT, while reddit-instruct and oasst2 were only used for the initial model.

Memphis-scribe further finetunes Memphis-CoT on more creative tasks. It was finetuned from Memphis-CoT on 18 different datasets, including datasets like euclaise/WritingPrompts_curated, lemonilia/LimaRP, and more.

To prevent catastrophic forgetting, I used weight averaging between iterations.

- euclaise/Memphis-CoT-3B
- euclaise/Memphis-scribe-3B

Collections 4

models 15

datasets 30

euclaise/gsm8k_multiturn

Viewer • Updated Jul 6 • 8.79k • 69 • 13

euclaise/reddit-instruct-curated

Viewer • Updated Feb 1 • 11k • 67 • 19

euclaise/logician

Viewer • Updated Jan 30 • 7.94k • 34 • 4

euclaise/SuperMC

Viewer • Updated Jan 25 • 278k • 37 • 1

euclaise/MegaCoT

Viewer • Updated Jan 23 • 629k • 117 • 5

euclaise/MiniCoT

Viewer • Updated Jan 23 • 129k • 140 • 6

euclaise/TinyCoT

Viewer • Updated Jan 23 • 27.7k • 51 • 9

euclaise/naturalinstructions2_preferences

Viewer • Updated Jan 20 • 2.04k • 41 • 1

euclaise/reddit-instruct

Viewer • Updated Jan 19 • 86.8k • 80 • 21

euclaise/WritingPrompts_curated

Viewer • Updated Jan 19 • 66.3k • 67 • 9

Jade

AI & ML interests

Recent Activity

Organizations

Posts 1

Collections 4

malmaud/onestop_qa

tasksource/ScienceQA_text_only

EleutherAI/logiqa

metaeval/reclor

longface/logicLM

allenai/cosmos_qa

EleutherAI/logiqa

tasksource/spartqa-mchoice

models 15

euclaise/ReMask-3B

euclaise/crow-1b-attempt1

euclaise/Memphis-CoT-3B

euclaise/Memphis-scribe-3B

euclaise/Memphis-scribe-3B-alpha

euclaise/gpt-neox-122m-minipile-digits

euclaise/Echo-3B

euclaise/Ferret-3B

euclaise/Echo-3B-q6-gguf

euclaise/Ferret_7B

datasets 30

euclaise/gsm8k_multiturn

euclaise/reddit-instruct-curated

euclaise/logician

euclaise/SuperMC

euclaise/MegaCoT

euclaise/MiniCoT

euclaise/TinyCoT

euclaise/naturalinstructions2_preferences

euclaise/reddit-instruct

euclaise/WritingPrompts_curated

Jade

AI & ML interests

Recent Activity

Organizations

Posts 1

Collections 4

models 15 Sort: Recently updated

datasets 30 Sort: Recently updated

models 15

datasets 30