metadata
license: apache-2.0
datasets:
- mosaicml/dolly_hhrlhf
language:
- en
library_name: transformers
pipeline_tag: text-generation
OPEN-LLAMA-300btkn-instruction-following-v1
import os
import torch
from transformers import AutoModelForCausalLM, AutoTokenizer
model_name = 'VMware/open-llama-300btkn-instruction-following-v1'
model_path = '/home/gollapudit/peft/open_lm_m_dolly_hhrlf'
tokenizer = AutoTokenizer.from_pretrained(model_name, add_bos_token = True)
model = AutoModelForCausalLM.from_pretrained(model_name, torch_dtype= torch.bfloat16, device_map = 'auto')
prompt_template = "Below is an instruction that describes a task. Write a descriptive response that appropriately completes the request.\n\n### Instruction:\n{instruction}\n\n### Response:"
prompt= 'how do I bake a cake?'
inputt = prompt_template.format(instruction= prompt)
input_ids = tokenizer(inputt, return_tensors="pt").input_ids.to("cuda")
output1 = model.generate(input_ids, max_length=512)
input_length = input_ids.shape[1]
output1 = output1[:, input_length:]
output= tokenizer.decode(output1[0])
print(output)
'''
Baking a cake is a simple process. You will need to prepare a cake mixture, then bake it in the oven. You can add various ingredients to the cake mixture, such as fruit, nuts, or spices, to make it flavorful. Baking a cake can be fun, as it creates a delicious dessert!</s>
'''