Omid-sar/fine-tuning-llama2-7b-code-generation-ludwig

Training procedure

The following bitsandbytes quantization config was used during training:

quant_method: bitsandbytes
load_in_8bit: False
load_in_4bit: True
llm_int8_threshold: 6.0
llm_int8_skip_modules: None
llm_int8_enable_fp32_cpu_offload: False
llm_int8_has_fp16_weight: False
bnb_4bit_quant_type: nf4
bnb_4bit_use_double_quant: True
bnb_4bit_compute_dtype: float16

The following bitsandbytes quantization config was used during training:

quant_method: bitsandbytes
load_in_8bit: False
load_in_4bit: True
llm_int8_threshold: 6.0
llm_int8_skip_modules: None
llm_int8_enable_fp32_cpu_offload: False
llm_int8_has_fp16_weight: False
bnb_4bit_quant_type: nf4
bnb_4bit_use_double_quant: True
bnb_4bit_compute_dtype: float16

Framework versions

PEFT 0.5.0
PEFT 0.5.0
Fine-Tuning Llama2-7b (Ludwig) for Code Generation

Goal

The goal of this project is to fine-tune the Llama2-7b language model for code generation using the QLORA method. The model will take natural language as input, and should return code as output. We're first going to iterate on a base Llama-2-7b model with prompting, and finally instruction-fine-tune the model.

As an example, if we prompt the model with this instruction:

Instruction: Create an array of length 5 which contains all even numbers between 1 and 10.

We want the model to produce exactly this response:

Response: array = [2, 4, 6, 8, 10]

QLORA Method for Fine-Tuning

The QLORA method for fine-tuning large language models (LLMs) is a parameter-efficient approach that uses 4-bit quantization to reduce the memory and computational requirements of fine-tuning. QLORA is implemented in the PEFT library, which is built on top of the Hugging Face Transformers library.

Ludwig Data Format

Ludwig requires data to be in a specific format. The main components of the data format are:

input_features: Defines the input features of the model. Each feature must have a name and type.
output_features: Defines the output features of the model. Similar to input features, each output feature must have a name and type.

Here is an example of a simple Ludwig config:

input_features:
  - name: instruction
    type: text
output_features:
  - name: output
    type: text

This config tells Ludwig to use the column called instruction in the dataset as an input feature and the output column as an output feature.

Prerequisites

Python 3.x
Ludwig
GPU (recommended for faster training)

Setup and Installation

Clone the repository:

git clone https://github.com/omid-sar/Llama2-7B-Fine-Tuning--Google-Colab-.git

Open the notebook Fine_Tuning_Llama2_7b(_Ludwig).ipynb in Google Colab or a local Jupyter environment.
Install the required libraries:
```
pip install ludwig
```
Follow the instructions in the notebook to download any additional datasets or models.

Usage

Run the cells in the notebook sequentially, following the instructions and comments provided.
Modify the model configuration, training parameters, or input data as needed to suit your use case.

Omid-sar
/

fine-tuning-llama2-7b-code-generation-ludwig