bloom-1b7-it / README.md
basilepp19's picture
Update README.md
baf0b44 verified
metadata
license: bigscience-bloom-rail-1.0
datasets:
  - swap-uniba/itwiki-march-2024
language:
  - it
tags:
  - bloom
  - italian

Model Card for Model ID

The model is obtained by performing language adaptation on the original bloom-1b7 model. In detail, we continued the pre-training on Italian-specific data without adaptation of the vocabulary. We use about 2.8M documents obtained from Italian Wikimedia dumps (swap-uniba/itwiki-march-2024). The model is trained for one epoch using LoRA and SFT.

Model Details

Model Description

  • Developed by: SWAP Research Group, Department of Computer Science, University of Bari Aldo Moro
  • Model type: BLOOM
  • Language(s) (NLP): Italian
  • License: bigscience-bloom-rail-1.0
  • Finetuned from model [optional]: bloom-1b7

Training Details

Training Data

2.8M documents obtained from Italian Wikimedia dumps (swap-uniba/itwiki-march-2024).

Training Procedure

LoRA and SFT.

Training Hyperparameters

  • Training regime: fp16

Citation [optional]

BibTeX:

APA:

Model Card Authors [optional]

Pierpaolo Basile, University of Bari Aldo Moro, Italy.

Model Card Contact

Pierpaolo Basile, University of Bari Aldo Moro, Italy.