Bram Vanroy's picture

Bram Vanroy PRO

BramVanroy

·

https://bramvanroy.github.io/

AI & ML interests

Artificial intelligence, natural language processing, computational linguistics

Recent Activity

liked a Space 6 days ago

argilla/synthetic-data-generator

New activity 6 days ago

argilla/synthetic-data-generator:Hard to read in dark mode

New activity 8 days ago

HPLT/hplt_bert_base_fr:Adding `safetensors` variant of this model

View all activity

Organizations

Posts 11

Post

1561

The InstructGPT paper mentions that they insert 10% pretraining data during SFT, which they find improves the effect of PPO (IIUC). Has anyone else done later ablations on this? I've only seen the inverse suggested, mixing in SFT data during pretraining.

Post

2257

All my models seem to be plagued by infinite lists. When you ask a question that requires it to write a list, it most often keeps adding bullet points or enumeration. I am wondering whether this is a result of using chatty GPT-4 as DPO preferences. Any thoughts?

Collections 7

Papers 1

arxiv:2312.12852

spaces 5

Running on Zero

Fietje

An efficient, open LMM for Dutch

Text To AMR

Dutch Simplification

Open Dutch LLM Leaderboard

MATEO

models 38

BramVanroy/fietje-2

Text Generation • Updated 30 days ago • 705 • 6

BramVanroy/fietje-2-instruct

Text Generation • Updated 30 days ago • 1.11k • 2

BramVanroy/fietje-2-chat

Text Generation • Updated 30 days ago • 1.65k • 1

BramVanroy/GEITje-7B-ultra

Text Generation • Updated 30 days ago • 808 • 37

BramVanroy/GEITje-7B-ultra-GGUF

Updated Sep 5 • 600 • 6

BramVanroy/fietje-2-chat-gguf

Updated Aug 27 • 120 • 4

BramVanroy/fietje-2-instruct-gguf

Updated Aug 27 • 86 • 2

BramVanroy/fietje-2-gguf

Updated Aug 27 • 102 • 1

BramVanroy/tweety-7b-dutch-v24a-GGUF

Updated May 9 • 50 • 1

BramVanroy/fietje-3-mini-4k-instruct-GGUF

Updated May 5 • 57 • 2

datasets 26

BramVanroy/lmsys-20240814-nl

Viewer • Updated Oct 21 • 2.75k • 22

BramVanroy/en-to-la-instruct

Viewer • Updated Aug 23 • 45

BramVanroy/stack_md_lid

Viewer • Updated Aug 22 • 21M • 426 • 4

BramVanroy/Openhermes-2.5-dutch-46k-format

Viewer • Updated Aug 21 • 43.7k • 57

BramVanroy/fietje-2-data

Viewer • Updated Jun 4 • 13.8M • 83

BramVanroy/occiglot-fineweb-v0.5-nl

Viewer • Updated Jun 3 • 16.1M • 127 • 1

BramVanroy/no_robots_dutch

Viewer • Updated Jun 1 • 8.61k • 88 • 2

BramVanroy/ultra_feedback_dutch_cleaned

Viewer • Updated May 13 • 183k • 276 • 3

BramVanroy/WildChat-1M-filtered-gpt-4

Viewer • Updated May 4 • 139k • 50

BramVanroy/orca_dpo_pairs_dutch_cleaned

Viewer • Updated Apr 24 • 31.6k • 51 • 2