--- datasets: - nyu-mll/glue - paws - hlgd - quora - tasksource/parade - tasksource/apt - medical_questions_pairs language: - en --- | Dataset Name | Test Accuracy | |--------------------------|---------------| | glue/mrpc | 0.856 | | glue/qqp | 0.876 | | hlgd | 0.898 | | paws/labeled_final | 0.952 | | paws/labeled_swap | 0.968 | | medical_questions_pairs | 0.8562 | | parade | 0.732 | | apt | 0.824 | ``` @article{sileo2023tasksource, title={tasksource: A Dataset Harmonization Framework for Streamlined NLP Multi-Task Learning and Evaluation}, author={Sileo, Damien}, journal={arXiv preprint arXiv:2301.05948}, year={2023} } ``` (Accepted at LREC-COLING 2024)