carlosdanielhernandezmena commited on
Commit
5cd0e04
1 Parent(s): 213ad52

Adding more information to the model card

Browse files
Files changed (1) hide show
  1. README.md +76 -12
README.md CHANGED
@@ -17,20 +17,55 @@ tags:
17
  ---
18
  # faster-whisper-large-v3-ca-3catparla
19
 
20
- This is a faster-whisper version of [projecte-aina/whisper-large-v3-ca-3catparla](https://huggingface.co/projecte-aina/whisper-large-v3-ca-3catparla).
 
 
21
 
22
- The specific dataset used to create the model is the [3CatParla](https://huggingface.co/datasets/projecte-aina/3catparla_asr).
 
 
 
 
 
 
23
 
24
- The model was created like described in [faster-whisper](https://github.com/guillaumekln/faster-whisper/tree/master):
25
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
26
  ```bash
27
- ct2-transformers-converter --model projecte-aina/whisper-large-v3-ca-3catparla
28
- --output_dir faster-whisper-large-v3-ca-3catparla
29
- --copy_files preprocessor_config.json
30
- --quantization float16
 
 
 
 
 
31
  ```
32
 
33
- # Usage
 
34
 
35
  ```python
36
  from faster_whisper import WhisperModel
@@ -53,8 +88,21 @@ for segment in segments:
53
  print("[%.2fs -> %.2fs] %s" % (segment.start, segment.end, segment.text))
54
  ```
55
 
56
- # BibTeX entry and citation info
57
- * When publishing results based on these models please refer to:
 
 
 
 
 
 
 
 
 
 
 
 
 
58
  ```bibtex
59
  @misc{mena2024fastwhis3catparla,
60
  title={Acoustic Model in Catalan: faster-whisper-large-v3-ca-3catparla.},
@@ -65,7 +113,23 @@ for segment in segments:
65
  }
66
  ```
67
 
68
- # Acknowledgements
 
 
 
 
 
 
 
 
 
 
 
 
 
 
69
 
70
- This model has been promoted and financed by the Government of Catalonia through the Aina project.
 
71
 
 
 
17
  ---
18
  # faster-whisper-large-v3-ca-3catparla
19
 
20
+ ## Table of Contents
21
+ <details>
22
+ <summary>Click to expand</summary>
23
 
24
+ - [Model description](#model-description)
25
+ - [Intended uses and limitations](#intended-uses-and-limitations)
26
+ - [How to use](#how-to-use)
27
+ - [Training](#training)
28
+ - [Evaluation](#evaluation)
29
+ - [Citation](#citation)
30
+ - [Additional information](#additional-information)
31
 
32
+ </details>
33
 
34
+ ## Summary
35
+
36
+ The "faster-whisper-large-v3-ca-3catparla" is an acoustic model based on a [faster-whisper](https://github.com/guillaumekln/faster-whisper/tree/master) version of [projecte-aina/whisper-large-v3-ca-3catparla](https://huggingface.co/projecte-aina/whisper-large-v3-ca-3catparla) suitable for Automatic Speech Recognition in Catalan.
37
+
38
+ ## Model Description
39
+
40
+ The "faster-whisper-large-v3-ca-3catparla" is the result of converting the [projecte-aina/whisper-large-v3-ca-3catparla](https://huggingface.co/projecte-aina/whisper-large-v3-ca-3catparla) into a lighter model using a python module called [faster-whisper](https://github.com/guillaumekln/faster-whisper/tree/master).
41
+
42
+ The specific dataset used to create the [projecte-aina/whisper-large-v3-ca-3catparla](https://huggingface.co/projecte-aina/whisper-large-v3-ca-3catparla) model is called ["3CatParla"](https://huggingface.co/datasets/projecte-aina/3catparla_asr).
43
+
44
+ ## Intended Uses and Limitations
45
+
46
+ This model can used for Automatic Speech Recognition (ASR) in Catalan. The model is intended to transcribe audio files in Catalan to plain text without punctuation.
47
+
48
+ ## How to Get Started with the Model
49
+
50
+ ### Installation
51
+
52
+ In order to use this model, you may install [faster-whisper](https://github.com/guillaumekln/faster-whisper/tree/master)
53
+
54
+ Create a virtual environment:
55
  ```bash
56
+ python -m venv /path/to/venv
57
+ ```
58
+ Activate the environment:
59
+ ```bash
60
+ source /path/to/venv/bin/activate
61
+ ```
62
+ Install the modules:
63
+ ```bash
64
+ pip install faster-whisper
65
  ```
66
 
67
+ ### For Inference
68
+ In order to transcribe audio in Catalan using this model, you can follow this example:
69
 
70
  ```python
71
  from faster_whisper import WhisperModel
 
88
  print("[%.2fs -> %.2fs] %s" % (segment.start, segment.end, segment.text))
89
  ```
90
 
91
+ ## Conversion Details
92
+
93
+ ### Conversion procedure
94
+
95
+ This model is not a direct result of training. It is a conversion of a [Whisper](https://huggingface.co/openai/whisper-large-v3) model using [faster-whisper](https://github.com/guillaumekln/faster-whisper/tree/master). The procedure to create the model is as follows:
96
+
97
+ ```bash
98
+ ct2-transformers-converter --model projecte-aina/whisper-large-v3-ca-3catparla
99
+ --output_dir faster-whisper-large-v3-ca-3catparla
100
+ --copy_files preprocessor_config.json
101
+ --quantization float16
102
+ ```
103
+
104
+ ## Citation
105
+ If this model contributes to your research, please cite the work:
106
  ```bibtex
107
  @misc{mena2024fastwhis3catparla,
108
  title={Acoustic Model in Catalan: faster-whisper-large-v3-ca-3catparla.},
 
113
  }
114
  ```
115
 
116
+ ## Additional Information
117
+
118
+ ### Author
119
+
120
+ The conversion process was perform during July (2024) in the [Language Technologies Unit](https://huggingface.co/BSC-LT) of the [Barcelona Supercomputing Center](https://www.bsc.es/) by [Carlos Daniel Hernández Mena](https://huggingface.co/carlosdanielhernandezmena).
121
+
122
+ ### Contact
123
+ For further information, please send an email to <langtech@bsc.es>.
124
+
125
+ ### Copyright
126
+ Copyright(c) 2024 by Language Technologies Unit, Barcelona Supercomputing Center.
127
+
128
+ ### License
129
+
130
+ [Apache-2.0](https://www.apache.org/licenses/LICENSE-2.0)
131
 
132
+ ### Funding
133
+ This work has been promoted and financed by the Generalitat de Catalunya through the [Aina project](https://projecteaina.cat/).
134
 
135
+ The conversion of the model was possible thanks to the compute time provided by [Barcelona Supercomputing Center](https://www.bsc.es/) through MareNostrum 5.