Spaces:

openlifescienceai
/

open_medical_llm_leaderboard

Running on CPU Upgrade

App Files Files Community

aaditya commited on Jun 2

Commit

32998b8

•

1 Parent(s): ec19ecf

Update src/about.py

Browse files

Files changed (1) hide show

src/about.py +7 -2

src/about.py CHANGED Viewed

@@ -51,7 +51,8 @@ The datasets cover various aspects of medicine such as general medical knowledge
 The main evaluation metric used is Accuracy (ACC). Submit a model for automated evaluation on the "Submit" page. If you have comments or suggestions on additional medical datasets to include, please reach out to us in our discussion forum.
-Notice: The models on this leaderboard are not approved for clinical use and are intended for research purposes only. Please refer to the "Advisory Notice" section in the "About" page.
 The backend of the Open Medical LLM Leaderboard uses the Eleuther AI Language Model Evaluation Harness. More technical details can be found in the "About" page.
@@ -70,8 +71,12 @@ Evaluating the medical knowledge and clinical reasoning capabilities of LLMs is
 <h2 style="color: #2c3e50;">Advisory Notice</h2>
 The Open Medical-LLM Leaderboard showcases medical models intended solely for research and development purposes. It is important to be aware of the following:
 Regulatory Status: The models listed on this leaderboard have not been approved or registered by any regulatory authorities, including the US FDA, the European Medicines Agency (EMA), Health Canada, or the Therapeutic Goods Administration (TGA) in Australia. They are not listed in the US FDA Database for approved AI in healthcare or the EUDAMED database.
 Disclaimer: These models are not intended for direct patient care, clinical decision support, or any other professional medical purposes. Their use should be limited to research, development, and exploratory applications by qualified individuals who understand their limitations and the regulatory requirements.
@@ -80,7 +85,7 @@ Risk Warning: The outputs of these models may contain inaccuracies, biases, or m
 Research Tool Only: The models on this leaderboard are intended solely as research tools to assist healthcare professionals and should never be considered a replacement for the professional judgment and expertise of a qualified medical doctor.
-Further Validation Needed: Proper adaptation and validation of these models for specific medical use cases would require significant additional work, including:
 1) Thorough testing and evaluation in relevant clinical scenarios.
 2) Alignment with evidence-based guidelines and best practices.

 The main evaluation metric used is Accuracy (ACC). Submit a model for automated evaluation on the "Submit" page. If you have comments or suggestions on additional medical datasets to include, please reach out to us in our discussion forum.
+Evaluation Purpose: The primary role of this leaderboard is to assess and compare the performance of the models. It does not facilitate the distribution, deployment, or clinical use of these models.
+The models on this leaderboard are not approved for clinical use and are intended for research purposes only. Please refer to the "Advisory Notice" section in the "About" page.
 The backend of the Open Medical LLM Leaderboard uses the Eleuther AI Language Model Evaluation Harness. More technical details can be found in the "About" page.
 <h2 style="color: #2c3e50;">Advisory Notice</h2>
 The Open Medical-LLM Leaderboard showcases medical models intended solely for research and development purposes. It is important to be aware of the following:
+Evaluation Purpose: The primary role of this leaderboard is to assess and compare the performance of the models. It does not facilitate the distribution, deployment, or clinical use of these models.
 Regulatory Status: The models listed on this leaderboard have not been approved or registered by any regulatory authorities, including the US FDA, the European Medicines Agency (EMA), Health Canada, or the Therapeutic Goods Administration (TGA) in Australia. They are not listed in the US FDA Database for approved AI in healthcare or the EUDAMED database.
 Disclaimer: These models are not intended for direct patient care, clinical decision support, or any other professional medical purposes. Their use should be limited to research, development, and exploratory applications by qualified individuals who understand their limitations and the regulatory requirements.
 Research Tool Only: The models on this leaderboard are intended solely as research tools to assist healthcare professionals and should never be considered a replacement for the professional judgment and expertise of a qualified medical doctor.
+Further Validation Needed: Proper adaptation and validation of these models for specific medical use cases would require significant additional work, some of them including:
 1) Thorough testing and evaluation in relevant clinical scenarios.
 2) Alignment with evidence-based guidelines and best practices.