Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
bobox
/
DeBERTaV3-small-ST-AdaptiveLayer-Norm-ep2
like
0
Sentence Similarity
sentence-transformers
PyTorch
stanfordnlp/snli
English
deberta-v2
feature-extraction
Generated from Trainer
dataset_size:67190
loss:AdaptiveLayerLoss
loss:MultipleNegativesRankingLoss
Eval Results
Inference Endpoints
arxiv:
1908.10084
arxiv:
2402.14776
arxiv:
1705.00652
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
DeBERTaV3-small-ST-AdaptiveLayer-Norm-ep2
1 contributor
History:
3 commits
bobox
n_layers_per_step = 1, last_layer_weight = 1 * model_layers,, prior_layers_weight= 0.05, kl_div_weight = 2, kl_temperature= 0.9,
aa1484c
verified
6 months ago
1_Pooling
n_layers_per_step = 1, last_layer_weight = 1.5 * model_layers,, prior_layers_weight= 1, kl_div_weight = 2, kl_temperature= 1,
6 months ago
.gitattributes
Safe
1.52 kB
initial commit
6 months ago
README.md
Safe
23.6 kB
n_layers_per_step = 1, last_layer_weight = 1 * model_layers,, prior_layers_weight= 0.05, kl_div_weight = 2, kl_temperature= 0.9,
6 months ago
added_tokens.json
Safe
23 Bytes
n_layers_per_step = 1, last_layer_weight = 1.5 * model_layers,, prior_layers_weight= 1, kl_div_weight = 2, kl_temperature= 1,
6 months ago
config.json
Safe
860 Bytes
n_layers_per_step = 1, last_layer_weight = 1.5 * model_layers,, prior_layers_weight= 1, kl_div_weight = 2, kl_temperature= 1,
6 months ago
config_sentence_transformers.json
Safe
195 Bytes
n_layers_per_step = 1, last_layer_weight = 1.5 * model_layers,, prior_layers_weight= 1, kl_div_weight = 2, kl_temperature= 1,
6 months ago
modules.json
Safe
229 Bytes
n_layers_per_step = 1, last_layer_weight = 1.5 * model_layers,, prior_layers_weight= 1, kl_div_weight = 2, kl_temperature= 1,
6 months ago
pytorch_model.bin
Safe
pickle
Detected Pickle imports (3)
"torch._utils._rebuild_tensor_v2"
,
"torch.FloatStorage"
,
"collections.OrderedDict"
What is a pickle import?
565 MB
LFS
n_layers_per_step = 1, last_layer_weight = 1 * model_layers,, prior_layers_weight= 0.05, kl_div_weight = 2, kl_temperature= 0.9,
6 months ago
sentence_bert_config.json
Safe
53 Bytes
n_layers_per_step = 1, last_layer_weight = 1.5 * model_layers,, prior_layers_weight= 1, kl_div_weight = 2, kl_temperature= 1,
6 months ago
special_tokens_map.json
Safe
286 Bytes
n_layers_per_step = 1, last_layer_weight = 1.5 * model_layers,, prior_layers_weight= 1, kl_div_weight = 2, kl_temperature= 1,
6 months ago
spm.model
Safe
2.46 MB
LFS
n_layers_per_step = 1, last_layer_weight = 1.5 * model_layers,, prior_layers_weight= 1, kl_div_weight = 2, kl_temperature= 1,
6 months ago
tokenizer.json
Safe
8.66 MB
n_layers_per_step = 1, last_layer_weight = 1.5 * model_layers,, prior_layers_weight= 1, kl_div_weight = 2, kl_temperature= 1,
6 months ago
tokenizer_config.json
Safe
1.28 kB
n_layers_per_step = 1, last_layer_weight = 1.5 * model_layers,, prior_layers_weight= 1, kl_div_weight = 2, kl_temperature= 1,
6 months ago