Gaspardno commited on
Commit
e10feb5
1 Parent(s): 23b20cb

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +95 -0
README.md ADDED
@@ -0,0 +1,95 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ base_model: google/flan-t5-small
3
+ datasets:
4
+ - svakulenk0/qrecc
5
+ - taskmaster2
6
+ - djaym7/wiki_dialog
7
+ - deepmind/code_contests
8
+ - lambada
9
+ - gsm8k
10
+ - aqua_rat
11
+ - esnli
12
+ - quasc
13
+ - qed
14
+ language:
15
+ - en
16
+ - fr
17
+ - ro
18
+ - de
19
+ - multilingual
20
+ license: apache-2.0
21
+ tags:
22
+ - text2text-generation
23
+ - llama-cpp
24
+ - gguf-my-repo
25
+ widget:
26
+ - text: 'Translate to German: My name is Arthur'
27
+ example_title: Translation
28
+ - text: Please answer to the following question. Who is going to be the next Ballon
29
+ d'or?
30
+ example_title: Question Answering
31
+ - text: 'Q: Can Geoffrey Hinton have a conversation with George Washington? Give the
32
+ rationale before answering.'
33
+ example_title: Logical reasoning
34
+ - text: Please answer the following question. What is the boiling point of Nitrogen?
35
+ example_title: Scientific knowledge
36
+ - text: Answer the following yes/no question. Can you write a whole Haiku in a single
37
+ tweet?
38
+ example_title: Yes/no question
39
+ - text: Answer the following yes/no question by reasoning step-by-step. Can you write
40
+ a whole Haiku in a single tweet?
41
+ example_title: Reasoning task
42
+ - text: 'Q: ( False or not False or False ) is? A: Let''s think step by step'
43
+ example_title: Boolean Expressions
44
+ - text: The square root of x is the cube root of y. What is y to the power of 2, if
45
+ x = 4?
46
+ example_title: Math reasoning
47
+ - text: 'Premise: At my age you will probably have learnt one lesson. Hypothesis: It''s
48
+ not certain how many lessons you''ll learn by your thirties. Does the premise
49
+ entail the hypothesis?'
50
+ example_title: Premise and hypothesis
51
+ ---
52
+
53
+ # Gaspardno/flan-t5-small-Q4_K_M-GGUF
54
+ This model was converted to GGUF format from [`google/flan-t5-small`](https://huggingface.co/google/flan-t5-small) using llama.cpp via the ggml.ai's [GGUF-my-repo](https://huggingface.co/spaces/ggml-org/gguf-my-repo) space.
55
+ Refer to the [original model card](https://huggingface.co/google/flan-t5-small) for more details on the model.
56
+
57
+ ## Use with llama.cpp
58
+ Install llama.cpp through brew (works on Mac and Linux)
59
+
60
+ ```bash
61
+ brew install llama.cpp
62
+
63
+ ```
64
+ Invoke the llama.cpp server or the CLI.
65
+
66
+ ### CLI:
67
+ ```bash
68
+ llama-cli --hf-repo Gaspardno/flan-t5-small-Q4_K_M-GGUF --hf-file flan-t5-small-q4_k_m.gguf -p "The meaning to life and the universe is"
69
+ ```
70
+
71
+ ### Server:
72
+ ```bash
73
+ llama-server --hf-repo Gaspardno/flan-t5-small-Q4_K_M-GGUF --hf-file flan-t5-small-q4_k_m.gguf -c 2048
74
+ ```
75
+
76
+ Note: You can also use this checkpoint directly through the [usage steps](https://github.com/ggerganov/llama.cpp?tab=readme-ov-file#usage) listed in the Llama.cpp repo as well.
77
+
78
+ Step 1: Clone llama.cpp from GitHub.
79
+ ```
80
+ git clone https://github.com/ggerganov/llama.cpp
81
+ ```
82
+
83
+ Step 2: Move into the llama.cpp folder and build it with `LLAMA_CURL=1` flag along with other hardware-specific flags (for ex: LLAMA_CUDA=1 for Nvidia GPUs on Linux).
84
+ ```
85
+ cd llama.cpp && LLAMA_CURL=1 make
86
+ ```
87
+
88
+ Step 3: Run inference through the main binary.
89
+ ```
90
+ ./llama-cli --hf-repo Gaspardno/flan-t5-small-Q4_K_M-GGUF --hf-file flan-t5-small-q4_k_m.gguf -p "The meaning to life and the universe is"
91
+ ```
92
+ or
93
+ ```
94
+ ./llama-server --hf-repo Gaspardno/flan-t5-small-Q4_K_M-GGUF --hf-file flan-t5-small-q4_k_m.gguf -c 2048
95
+ ```