Spaces:
Running
on
CPU Upgrade
Running
on
CPU Upgrade
Update src/about.py
Browse files- src/about.py +25 -0
src/about.py
CHANGED
@@ -68,6 +68,31 @@ Evaluating the medical knowledge and clinical reasoning capabilities of LLMs is
|
|
68 |
|
69 |
π We evaluate the models on 9 medical Q&A datasets using the <a href="https://github.com/EleutherAI/lm-evaluation-harness" target="_blank"> Eleuther AI Language Model Evaluation Harness </a>, a unified framework to test language models on different tasks.
|
70 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
71 |
<h2 style="color: #2c3e50;">About Open Life Science AI</h2>
|
72 |
An Open Life Science Project to Benchmark and Track AI Progress, Share Models and Datasets in the Life Science Field.
|
73 |
<a href="https://openlifescience.ai/" target="_blank"> More info </a>
|
|
|
68 |
|
69 |
π We evaluate the models on 9 medical Q&A datasets using the <a href="https://github.com/EleutherAI/lm-evaluation-harness" target="_blank"> Eleuther AI Language Model Evaluation Harness </a>, a unified framework to test language models on different tasks.
|
70 |
|
71 |
+
<h2 style="color: #2c3e50;">Advisory Notice</h2>
|
72 |
+
|
73 |
+
The Open Medical-LLM Leaderboard showcases medical models intended solely for research and development purposes. It is important to be aware of the following:
|
74 |
+
|
75 |
+
Regulatory Status: The models listed on this leaderboard have not been approved or registered by any regulatory authorities, including the US FDA, the European Medicines Agency (EMA), Health Canada, or the Therapeutic Goods Administration (TGA) in Australia. They are not listed in the US FDA Database for approved AI in healthcare or the EUDAMED database.
|
76 |
+
|
77 |
+
Disclaimer: These models are not intended for direct patient care, clinical decision support, or any other professional medical purposes. Their use should be limited to research, development, and exploratory applications by qualified individuals who understand their limitations and the regulatory requirements.
|
78 |
+
|
79 |
+
Risk Warning: The outputs of these models may contain inaccuracies, biases, or misalignments that could pose risks if relied upon for medical decision-making. The models' performance has not been rigorously evaluated in randomized controlled trials or real-world healthcare environments.
|
80 |
+
|
81 |
+
Research Tool Only: The models on this leaderboard are intended solely as research tools to assist healthcare professionals and should never be considered a replacement for the professional judgment and expertise of a qualified medical doctor.
|
82 |
+
|
83 |
+
Further Validation Needed: Proper adaptation and validation of these models for specific medical use cases would require significant additional work, including:
|
84 |
+
|
85 |
+
1) Thorough testing and evaluation in relevant clinical scenarios.
|
86 |
+
2) Alignment with evidence-based guidelines and best practices.
|
87 |
+
3) Mitigation of potential biases and failure modes.
|
88 |
+
4) Integration with human oversight and interpretation.
|
89 |
+
5) Compliance with regulatory and ethical standards.
|
90 |
+
|
91 |
+
For any legal inquiries or concerns, please contact the authors of the MedPaLM papers directly.
|
92 |
+
|
93 |
+
Always consult a qualified healthcare provider for personal medical needs.
|
94 |
+
|
95 |
+
|
96 |
<h2 style="color: #2c3e50;">About Open Life Science AI</h2>
|
97 |
An Open Life Science Project to Benchmark and Track AI Progress, Share Models and Datasets in the Life Science Field.
|
98 |
<a href="https://openlifescience.ai/" target="_blank"> More info </a>
|