raygx
/

BERT-NepSA-T2

Text Classification

generated_from_keras_callback

Inference Endpoints

Model card Files Files and versions Community

raygx commited on Jul 22, 2023

Commit

0182711

•

1 Parent(s): 25fcc9e

Upload TFBertForSequenceClassification

Files changed (3) hide show

README.md +5 -13
config.json +1 -1
tf_model.h5 +1 -1

README.md CHANGED Viewed

@@ -1,5 +1,6 @@
 ---
 license: mit
 tags:
 - generated_from_keras_callback
 model-index:
@@ -14,9 +15,7 @@ probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [Shushant/nepaliBERT](https://huggingface.co/Shushant/nepaliBERT) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Train Loss: 0.5775
-- Validation Loss: 0.6255
-- Epoch: 4
 ## Model description
@@ -35,23 +34,16 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- optimizer: {'name': 'AdamWeightDecay', 'learning_rate': 1e-06, 'decay': 0.0, 'beta_1': 0.9, 'beta_2': 0.999, 'epsilon': 1e-07, 'amsgrad': False, 'weight_decay_rate': 0.001}
 - training_precision: float32
 ### Training results
-| Train Loss | Validation Loss | Epoch |
-|:----------:|:---------------:|:-----:|
-| 0.8096     | 0.7200          | 0     |
-| 0.6789     | 0.6691          | 1     |
-| 0.6341     | 0.6525          | 2     |
-| 0.6028     | 0.6266          | 3     |
-| 0.5775     | 0.6255          | 4     |
 ### Framework versions
-- Transformers 4.29.2
 - TensorFlow 2.12.0
-- Datasets 2.12.0
 - Tokenizers 0.13.3

 ---
 license: mit
+base_model: Shushant/nepaliBERT
 tags:
 - generated_from_keras_callback
 model-index:
 This model is a fine-tuned version of [Shushant/nepaliBERT](https://huggingface.co/Shushant/nepaliBERT) on an unknown dataset.
 It achieves the following results on the evaluation set:
 ## Model description
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- optimizer: {'name': 'AdamWeightDecay', 'learning_rate': 1e-06, 'decay': 0.0, 'beta_1': 0.9, 'beta_2': 0.999, 'epsilon': 1e-07, 'amsgrad': False, 'weight_decay_rate': 0.0001}
 - training_precision: float32
 ### Training results
 ### Framework versions
+- Transformers 4.31.0
 - TensorFlow 2.12.0
+- Datasets 2.13.1
 - Tokenizers 0.13.3

config.json CHANGED Viewed

@@ -28,7 +28,7 @@
   "pad_token_id": 0,
   "position_embedding_type": "absolute",
   "torch_dtype": "float32",
-  "transformers_version": "4.29.2",
   "type_vocab_size": 2,
   "use_cache": true,
   "vocab_size": 30522

   "pad_token_id": 0,
   "position_embedding_type": "absolute",
   "torch_dtype": "float32",
+  "transformers_version": "4.31.0",
   "type_vocab_size": 2,
   "use_cache": true,
   "vocab_size": 30522

tf_model.h5 CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:dec01be38fb7f7166034dbc4ec529597c582684d168ed3dbecc5936e6c304635
 size 438226204

 version https://git-lfs.github.com/spec/v1
+oid sha256:587869e988bb65b06ae2bd664df30a331f2ef2e28af131d05b987e77fc7f9cb6
 size 438226204