DISLab
/

SummLlama3-8B

Model card Files Files and versions

Hwanjun commited on Oct 15

Commit

0d555ec

•

1 Parent(s): 6645da9

Update README.md

Files changed (1) hide show

README.md +8 -3

README.md CHANGED Viewed

@@ -7,13 +7,18 @@ pipeline_tag: summarization
   <b style="font-size: 40px;">SummLlama3-8B</b>
 </div>
-Are you looking for a summarizer that can generate more **human-preferred summaries**?
 Our **SummLlama3-8B** could be exactly what you need!
-SummLlama3 is initialized from Llama3-8B-Instruct, with additional training using Direct Preference Optimization (DPO) based on human-like summarization feedback.
-It outperforms the nearly 10x larger Llama3-70B-Instruct while offering much faster inference speed.
 Please refer to [our paper](link) to catch up how to exploit LLM-generated feedback in the context of text summarization.

   <b style="font-size: 40px;">SummLlama3-8B</b>
 </div>
+Are you looking for a summarizer that can generate more **human-preferred summaries** across multiple domains?
 Our **SummLlama3-8B** could be exactly what you need!
+SummLlama3 is initialized from Llama3-8B-Instruct, with additional training using Direct Preference Optimization (DPO) based on large-scale (over 100K) summarization feedback.
+The feedback encompasses a wide range of input documents, from short to lengthy texts, including both dialogue and non-dialogue formats, and spans across seven distinct domains:
+- Four non-dialouge domains: News, Lifestyle, Report, Medical
+- Three dialogue domains: Daily Life, Interview, Meeting
+Surprisingly, it outperforms the nearly 10x larger Llama3-70B-Instruct while offering much faster inference speed.
 Please refer to [our paper](link) to catch up how to exploit LLM-generated feedback in the context of text summarization.