Transducens
/

kind_teacher

Model card Files Files and versions Community

Rastapar commited on 3 days ago

Commit

2ce30d3

•

1 Parent(s): ce8f196

Update README.md

Files changed (1) hide show

README.md +12 -0

README.md CHANGED Viewed

@@ -28,6 +28,18 @@ that facilitates the fine-tuning of various well-known LLMs on custom data.
 Parameter-efficient fine-tuning is achieved via the QLoRA method [Dettmers et al., 2023](https://proceedings.neurips.cc/paper_files/paper/2023/file/1feb87871436031bdc0f2beaa62a049b-Paper-Conference.pdf).
 ## Usage Guide
 This project was executed on an Ubuntu 22.04.3 system running Linux kernel 6.8.0-40-generic.

 Parameter-efficient fine-tuning is achieved via the QLoRA method [Dettmers et al., 2023](https://proceedings.neurips.cc/paper_files/paper/2023/file/1feb87871436031bdc0f2beaa62a049b-Paper-Conference.pdf).
+Number of conversation turns and words in the original datasets and after splitting long conversations:
+| **Dataset**      | **Turns (Original)** | **Words (Original)** | **Turns (Split turns)** | **Words (Split turns)** |
+|------------------|:--------------------:|:--------------------:|:-----------------------:|:-----------------------:|
+| TSCC v2          |        570           |        788k          |         1074            |         786k            |
+| CIMA             |       1135           |         44k          |         1135            |          38k            |
+| MathDial         |       2861           |        923k          |         2876            |         879k            |
+| Multicultural    |         5            |        614k          |          643            |         614k            |
+| Uptake           |        774           |         35k          |          775            |          34k            |
+| **Total**        |     **5345**         |     **2404k**        |      **6503**           |      **2351k**          |
 ## Usage Guide
 This project was executed on an Ubuntu 22.04.3 system running Linux kernel 6.8.0-40-generic.