mjbuehler commited on
Commit
eaad9f2
1 Parent(s): a24aab1

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -2
README.md CHANGED
@@ -36,7 +36,7 @@ Cephalo is a series of multimodal materials science and engineering focused visi
36
 
37
  ![image/png](https://cdn-uploads.huggingface.co/production/uploads/623ce1c6b66fedf374859fe7/kl5GWBP9WS0D4uwd1t3S7.png)
38
 
39
- This version of Cephalo, lamm-mit/Cephalo-Idefics2-3x8b-beta, is a Mixture-of-Expert model based on variants and fine-tuned versions of the Idefics-2 model. The basic model architecture is as follows:
40
 
41
  ![image/png](https://cdn-uploads.huggingface.co/production/uploads/623ce1c6b66fedf374859fe7/b7BK8ZtDzTMsyFDi0wP3w.png)
42
 
@@ -67,7 +67,7 @@ def count_parameters(model):
67
 
68
  device = torch.device("cuda" if torch.cuda.is_available() else "cpu")
69
 
70
- model_name_moe = f"lamm-mit/Cephalo-Idefics2-3x8b-beta"
71
  config = AutoConfig.from_pretrained(model_name_moe, trust_remote_code=True)
72
  processor = AutoProcessor.from_pretrained(model_name_moe, trust_remote_code=True)
73
  moe_model = AutoModelForCausalLM.from_pretrained(
 
36
 
37
  ![image/png](https://cdn-uploads.huggingface.co/production/uploads/623ce1c6b66fedf374859fe7/kl5GWBP9WS0D4uwd1t3S7.png)
38
 
39
+ This version of Cephalo, lamm-mit/Cephalo-Idefics2-vision-3x8b-beta, is a Mixture-of-Expert model based on variants and fine-tuned versions of the Idefics-2 model. The basic model architecture is as follows:
40
 
41
  ![image/png](https://cdn-uploads.huggingface.co/production/uploads/623ce1c6b66fedf374859fe7/b7BK8ZtDzTMsyFDi0wP3w.png)
42
 
 
67
 
68
  device = torch.device("cuda" if torch.cuda.is_available() else "cpu")
69
 
70
+ model_name_moe = f"lamm-mit/Cephalo-Idefics2-vision-3x8b-beta"
71
  config = AutoConfig.from_pretrained(model_name_moe, trust_remote_code=True)
72
  processor = AutoProcessor.from_pretrained(model_name_moe, trust_remote_code=True)
73
  moe_model = AutoModelForCausalLM.from_pretrained(