A collection of models trained using deep RL for a variety of games.
Matt Boraske
MattBoraske
AI & ML interests
Reinforcement Learning, Natural Language Processing, LLM Finetuning
Organizations
Collections
3
Datasets curated from the reddit r/amithea**hole subreddit and models finetuned on them using QLoRA.
-
MattBoraske/llama-2-7b-chat-reddit-AITA-multiclass
Text Generation • Updated • 3 -
MattBoraske/llama-2-7b-chat-reddit-AITA-multiclass-top-2k
Text Generation • Updated • 2 -
MattBoraske/llama-2-7b-chat-reddit-AITA-binary
Text Generation • Updated • 1 -
MattBoraske/llama-2-7b-chat-reddit-AITA-binary-top-2k
Text Generation • Updated • 4
models
35
MattBoraske/llama-2-13b-chat-reddit-AITA-benign-consenting
Text Generation
•
Updated
•
6
MattBoraske/llama-2-13b-chat-reddit-AITA-consenting
Text Generation
•
Updated
•
1
MattBoraske/llama-2-13b-chat-reddit-AITA-benign
Text Generation
•
Updated
•
1
MattBoraske/llama-2-13b-chat-reddit-AITA-all-samples
Text Generation
•
Updated
•
4
MattBoraske/flan-t5-xxl-reddit-AITA-binary-top-2k
Text2Text Generation
•
Updated
•
2
MattBoraske/flan-t5-xxl-reddit-AITA-binary
Text2Text Generation
•
Updated
•
5
MattBoraske/flan-t5-xxl-reddit-AITA-multiclass-top-2k
Text2Text Generation
•
Updated
•
1
MattBoraske/flan-t5-xxl-reddit-AITA-multiclass
Text2Text Generation
•
Updated
MattBoraske/flan-t5-xl-reddit-AITA-binary-top-2k
Text2Text Generation
•
Updated
MattBoraske/flan-t5-xl-reddit-AITA-binary
Text2Text Generation
•
Updated
•
2