carlosdanielhernandezmena commited on
Commit
213ad52
1 Parent(s): 473c53f

Adding info to the README for the first time.

Browse files
Files changed (1) hide show
  1. README.md +71 -3
README.md CHANGED
@@ -1,3 +1,71 @@
1
- ---
2
- license: apache-2.0
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ language:
4
+ - ca
5
+ datasets:
6
+ - projecte-aina/3catparla_asr
7
+ tags:
8
+ - audio
9
+ - automatic-speech-recognition
10
+ - catalan
11
+ - faster-whisper
12
+ - whisper-large-v3
13
+ - catalonia
14
+ - barcelona-supercomputing-center
15
+ - projecte-aina
16
+ - 3catparla
17
+ ---
18
+ # faster-whisper-large-v3-ca-3catparla
19
+
20
+ This is a faster-whisper version of [projecte-aina/whisper-large-v3-ca-3catparla](https://huggingface.co/projecte-aina/whisper-large-v3-ca-3catparla).
21
+
22
+ The specific dataset used to create the model is the [3CatParla](https://huggingface.co/datasets/projecte-aina/3catparla_asr).
23
+
24
+ The model was created like described in [faster-whisper](https://github.com/guillaumekln/faster-whisper/tree/master):
25
+
26
+ ```bash
27
+ ct2-transformers-converter --model projecte-aina/whisper-large-v3-ca-3catparla
28
+ --output_dir faster-whisper-large-v3-ca-3catparla
29
+ --copy_files preprocessor_config.json
30
+ --quantization float16
31
+ ```
32
+
33
+ # Usage
34
+
35
+ ```python
36
+ from faster_whisper import WhisperModel
37
+
38
+ model_size = "projecte-aina/faster-whisper-large-v3-ca-3catparla"
39
+
40
+ # Run on GPU with FP16
41
+ model = WhisperModel(model_size, device="cuda", compute_type="float16")
42
+
43
+ # or run on GPU with INT8
44
+ #model = WhisperModel(model_size, device="cuda", compute_type="int8_float16")
45
+ # or run on CPU with INT8
46
+ # model = WhisperModel(model_size, device="cpu", compute_type="int8")
47
+
48
+ segments, info = model.transcribe("audio_in_catalan.mp3", beam_size=5, task="translate",language="ca")
49
+
50
+ print("Detected language '%s' with probability %f" % (info.language, info.language_probability))
51
+
52
+ for segment in segments:
53
+ print("[%.2fs -> %.2fs] %s" % (segment.start, segment.end, segment.text))
54
+ ```
55
+
56
+ # BibTeX entry and citation info
57
+ * When publishing results based on these models please refer to:
58
+ ```bibtex
59
+ @misc{mena2024fastwhis3catparla,
60
+ title={Acoustic Model in Catalan: faster-whisper-large-v3-ca-3catparla.},
61
+ author={Hernandez Mena, Carlos Daniel},
62
+ organization={Barcelona Supercomputing Center},
63
+ url={https://huggingface.co/projecte-aina/faster-whisper-large-v3-ca-3catparla},
64
+ year={2024},
65
+ }
66
+ ```
67
+
68
+ # Acknowledgements
69
+
70
+ This model has been promoted and financed by the Government of Catalonia through the Aina project.
71
+