Piper recording studio dataset reference
Hi, I have used my own voice using piper recording studio to create an Italian dataset - I have read all 1000+ prompts - and then I trained a medium model (finetuned using the lessac model). I would like to submit it, but I don't know what dataset reference I should put in the MODEL_CARD: do I also have to upload the dataset somewhere?
You don't have to upload it, but it would be a great contribution :)
If you're willing to donate your data to the public domain, I'd be happy to host it here: https://github.com/NabuCasa/voice-datasets
Otherwise, you can just say it came from a personal dataset.
Thank you for the answer! Since I really appreciate your work, I've decided to give something back, so I've created the PR for the model and here you can find the dataset: https://huggingface.co/datasets/paolapersico1/Voice-Dataset-Italian, I hope everything's okay :)
Looks great, this is much appreciated! Would you like the dataset to be called "paola"?
yes, thank you!