ben burtenshaw's picture

ben burtenshaw

burtenshaw

·

AI & ML interests

None yet

Recent Activity

updated a Space 1 day ago

data-is-better-together/image-preferences-leaderboard

Reacted to davidberenstein1957's post with ➕ 1 day ago

Let’s make a generation of amazing image-generation models The best image generation models are trained on human preference datasets, where annotators have selected the best image from a choice of two. Unfortunately, many of these datasets are closed source so the community cannot train open models on them. Let’s change that! The community can contribute image preferences for an open-source dataset that could be used for building AI models that convert text to image, like the flux or stable diffusion families. The dataset will be open source so everyone can use it to train models that we can all use. Blog: https://huggingface.co/blog/burtenshaw/image-preferences

Reacted to davidberenstein1957's post with 😎 1 day ago

Let’s make a generation of amazing image-generation models The best image generation models are trained on human preference datasets, where annotators have selected the best image from a choice of two. Unfortunately, many of these datasets are closed source so the community cannot train open models on them. Let’s change that! The community can contribute image preferences for an open-source dataset that could be used for building AI models that convert text to image, like the flux or stable diffusion families. The dataset will be open source so everyone can use it to train models that we can all use. Blog: https://huggingface.co/blog/burtenshaw/image-preferences

View all activity

Articles

Let’s make a generation of amazing image generation models

Zero to Hero with the TRL learning link bomb 💣

Low Code Large Language Model Alignment

Argilla 2.4: Easily Build Fine-Tuning and Evaluation datasets on the Hub — No Code Required

How to build a custom text classifier without days of human labeling

How to optimize your data labelling project with custom interfaces

⚗️ 🔥 Building High-Quality Datasets with distilabel and Prometheus 2

⚗️ 🧑🏼‍🌾 Let's grow some Domain Specific Datasets together

Organizations

Posts 1

Post

1432

SFT + Quantisation + Unsloth is a super easy way of squeezing extra performance out of an LLM at low latencies. Here are some hand y resources to bootstrap your projects.

Here's a filtered dataset from Helpsteer2 with the most correct and coherent samples: burtenshaw/helpsteer-2-plus
This is a SFT finetuned model: ttps://huggingface.co/burtenshaw/gemma-help-tiny-sft
This is the notebook I use to train the model: https://colab.research.google.com/drive/17oskw_5lil5C3jCW34rA-EXjXnGgRRZw?usp=sharing
Here's a load of Unsloth notebook on finetuning and inference: https://docs.unsloth.ai/get-started/unsloth-notebooks

Collections 3

Papers 1

arxiv:2408.16961

spaces 24

Martinique

Create Dataset Ui

My Argilla

Argilla Fosllms

Argilla UI Demo Space (login: argilla/1234)

Argilla Llamaindex Monitor

models 11

burtenshaw/smol-vlm-trl-sft-ChartQA

Updated about 15 hours ago

burtenshaw/code-llama-3-2-1b-commerce

Text Generation • Updated 8 days ago • 25

burtenshaw/code-smol2-text-to-sql

Updated 9 days ago • 10

burtenshaw/Qwen2.5-3B-Instruct-GGUF

Updated 28 days ago • 5

burtenshaw/gemma-help-tiny-sft

Text Generation • Updated Aug 9 • 20 • 1

burtenshaw/Qwen1.5-0.5B-dpo-mix-7k

Text Generation • Updated Apr 3 • 8

burtenshaw/notus-merged-with-code-mistral-so-its-better-at-coding

Updated Apr 2 • 3

burtenshaw/Qwen1.5-0.5B-dpo-mix-7k-GGUF

burtenshaw/Qwen1.5-0.5B-dpo-mix-7k-5000

Text Generation • Updated Mar 29 • 12

burtenshaw/Qwen1.5-0.5B-dpo-mix-7k-3000

Text Generation • Updated Mar 29 • 12

datasets 21

burtenshaw/ohp-test-conversation

Preview • Updated 1 day ago • 7

burtenshaw/dataset-diff-test-changed

Viewer • Updated 29 days ago • 3 • 40

burtenshaw/dataset-diff-test

Viewer • Updated 29 days ago • 3 • 36

burtenshaw/most_used_models

Viewer • Updated Oct 23 • 250 • 52 • 1

burtenshaw/exam_questions

Viewer • Updated Oct 22 • 7 • 33

burtenshaw/pc-components-reviews-vectors

Viewer • Updated Oct 17 • 200 • 45

burtenshaw/fosllms-week-1-demo

Viewer • Updated Oct 16 • 12 • 42

burtenshaw/yahoo_answers_topics

Viewer • Updated Oct 3 • 100 • 55

burtenshaw/image-search-queries

Viewer • Updated Sep 10 • 199 • 45

burtenshaw/document-similarity

Viewer • Updated Sep 5 • 20 • 38