Junde commited on
Commit
5f6652b
1 Parent(s): 206585f

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -2
README.md CHANGED
@@ -25,8 +25,7 @@ Please visit our [repo](https://github.com/Eikor/InstructPLM) and [paper](https:
25
  year = {2024},
26
  doi = {10.1101/2024.04.17.589642},
27
  publisher = {Cold Spring Harbor Laboratory},
28
- abstract = {Large language models are renowned for their efficacy in capturing intricate patterns, including co-evolutionary relationships, and underlying protein languages. However, current methodologies often fall short in illustrating the emergence of genomic insertions, duplications, and insertion/deletions (indels), which account for approximately 14\% of human pathogenic mutations. Given that structure dictates function, mutated proteins with similar structures are more likely to persist throughout biological evolution. Motivated by this, we leverage cross-modality alignment and instruct fine-tuning techniques inspired by large language models to align a generative protein language model with protein structure instructions. Specifically, we present a method for generating variable-length and diverse proteins to explore and simulate the complex evolution of life, thereby expanding the repertoire of options for protein engineering. Our proposed protein LM-based approach, InstructPLM, demonstrates significant performance enhancements both in silico and in vitro. On native protein backbones, it achieves a perplexity of 2.68 and a sequence recovery rate of 57.51, surpassing ProteinMPNN by 39.2\% and 25.1\%, respectively. Furthermore, we validate the efficacy of our model by redesigning PETase and L-MDH. For PETase, all fifteen designed variable-length PETase exhibit depolymerization activity, with eleven surpassing the activity levels of the wild type. Regarding L-MDH, an enzyme lacking an experimentally determined structure, InstructPLM is able to design functional enzymes with an AF2-predicted structure. Code and model weights of InstructPLM are publicly available.Competing Interest StatementThe authors have declared no competing interest.},
29
- URL = {https://www.biorxiv.org/content/early/2024/04/20/2024.04.17.589642},
30
  eprint = {https://www.biorxiv.org/content/early/2024/04/20/2024.04.17.589642.full.pdf},
31
  journal = {bioRxiv}
32
  }
 
25
  year = {2024},
26
  doi = {10.1101/2024.04.17.589642},
27
  publisher = {Cold Spring Harbor Laboratory},
28
+ URL = {https://www.biorxiv.org/content/early/2024/04/20/2024.04.17.589642},
 
29
  eprint = {https://www.biorxiv.org/content/early/2024/04/20/2024.04.17.589642.full.pdf},
30
  journal = {bioRxiv}
31
  }