Update README.md
Browse files
README.md
CHANGED
@@ -18,7 +18,6 @@ base_model:
|
|
18 |
# ChemSolubilityBERTa
|
19 |
## Model Description
|
20 |
ChemSolubilityBERTa is a prototype designed to predict the aqueous solubility of chemical compounds from their SMILES representations. Based on ChemBERTa, a BERT-like transformer-based architecture, ChemBERTa pre-trained on 77M SMILES strings for molecular property prediction. We adapted ChemBERTa to predict solubility values by fine-tuning ChemBERTa with the ESOL (Estimated SOLubility) dataset, a water solubility prediction dataset of 1,128 samples. A user inputs a SMILES string, and the model outputs a log solubility value (log mol/L).
|
21 |
-
|
22 |
You can read the full paper [here](./01_ChemSolubilityBERTa.pdf).
|
23 |
|
24 |
## Fine-Tuning Details
|
@@ -42,9 +41,17 @@ inputs = tokenizer(smiles_string, return_tensors='pt')
|
|
42 |
outputs = model(**inputs)
|
43 |
solubility = outputs.logits.item()
|
44 |
print(f"Predicted solubility: {solubility}")
|
|
|
|
|
45 |
|
46 |
-
|
47 |
-
|
48 |
-
This model is licensed under the [MIT License](https://opensource.org/licenses/MIT).
|
49 |
|
|
|
|
|
|
|
|
|
|
|
|
|
50 |
|
|
|
|
|
|
18 |
# ChemSolubilityBERTa
|
19 |
## Model Description
|
20 |
ChemSolubilityBERTa is a prototype designed to predict the aqueous solubility of chemical compounds from their SMILES representations. Based on ChemBERTa, a BERT-like transformer-based architecture, ChemBERTa pre-trained on 77M SMILES strings for molecular property prediction. We adapted ChemBERTa to predict solubility values by fine-tuning ChemBERTa with the ESOL (Estimated SOLubility) dataset, a water solubility prediction dataset of 1,128 samples. A user inputs a SMILES string, and the model outputs a log solubility value (log mol/L).
|
|
|
21 |
You can read the full paper [here](./01_ChemSolubilityBERTa.pdf).
|
22 |
|
23 |
## Fine-Tuning Details
|
|
|
41 |
outputs = model(**inputs)
|
42 |
solubility = outputs.logits.item()
|
43 |
print(f"Predicted solubility: {solubility}")
|
44 |
+
```
|
45 |
+
## Citation and Usage
|
46 |
|
47 |
+
If you use ChemSolubilityBERTa in your research or projects, please cite the following:
|
|
|
|
|
48 |
|
49 |
+
@misc{ChemSolubilityBERTa,
|
50 |
+
author = {Farooq Khan},
|
51 |
+
title = {ChemSolubilityBERTa: A Transformer-Based Model for Predicting Aqueous Solubility from SMILES},
|
52 |
+
year = {2024},
|
53 |
+
url = {https://huggingface.co/khanfs/ChemSolubilityBERTa}
|
54 |
+
}
|
55 |
|
56 |
+
## License
|
57 |
+
This model is licensed under the [MIT License](https://opensource.org/licenses/MIT).
|