khanfs commited on
Commit
bd6b1d9
1 Parent(s): 2bd131f

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +11 -4
README.md CHANGED
@@ -18,7 +18,6 @@ base_model:
18
  # ChemSolubilityBERTa
19
  ## Model Description
20
  ChemSolubilityBERTa is a prototype designed to predict the aqueous solubility of chemical compounds from their SMILES representations. Based on ChemBERTa, a BERT-like transformer-based architecture, ChemBERTa pre-trained on 77M SMILES strings for molecular property prediction. We adapted ChemBERTa to predict solubility values by fine-tuning ChemBERTa with the ESOL (Estimated SOLubility) dataset, a water solubility prediction dataset of 1,128 samples. A user inputs a SMILES string, and the model outputs a log solubility value (log mol/L).
21
-
22
  You can read the full paper [here](./01_ChemSolubilityBERTa.pdf).
23
 
24
  ## Fine-Tuning Details
@@ -42,9 +41,17 @@ inputs = tokenizer(smiles_string, return_tensors='pt')
42
  outputs = model(**inputs)
43
  solubility = outputs.logits.item()
44
  print(f"Predicted solubility: {solubility}")
 
 
45
 
46
- ##How to Use
47
-
48
- This model is licensed under the [MIT License](https://opensource.org/licenses/MIT).
49
 
 
 
 
 
 
 
50
 
 
 
 
18
  # ChemSolubilityBERTa
19
  ## Model Description
20
  ChemSolubilityBERTa is a prototype designed to predict the aqueous solubility of chemical compounds from their SMILES representations. Based on ChemBERTa, a BERT-like transformer-based architecture, ChemBERTa pre-trained on 77M SMILES strings for molecular property prediction. We adapted ChemBERTa to predict solubility values by fine-tuning ChemBERTa with the ESOL (Estimated SOLubility) dataset, a water solubility prediction dataset of 1,128 samples. A user inputs a SMILES string, and the model outputs a log solubility value (log mol/L).
 
21
  You can read the full paper [here](./01_ChemSolubilityBERTa.pdf).
22
 
23
  ## Fine-Tuning Details
 
41
  outputs = model(**inputs)
42
  solubility = outputs.logits.item()
43
  print(f"Predicted solubility: {solubility}")
44
+ ```
45
+ ## Citation and Usage
46
 
47
+ If you use ChemSolubilityBERTa in your research or projects, please cite the following:
 
 
48
 
49
+ @misc{ChemSolubilityBERTa,
50
+ author = {Farooq Khan},
51
+ title = {ChemSolubilityBERTa: A Transformer-Based Model for Predicting Aqueous Solubility from SMILES},
52
+ year = {2024},
53
+ url = {https://huggingface.co/khanfs/ChemSolubilityBERTa}
54
+ }
55
 
56
+ ## License
57
+ This model is licensed under the [MIT License](https://opensource.org/licenses/MIT).