Commit History

feat: use property instead of setter
6aad619

Markus28 commited on

feat: return from_bert for from_pretrained
5549314

Markus28 commited on

support-fast-tokenizer (#6)
ed1b276
verified

jupyterjazz commited on

feat: made from_bert work
851184a

Markus28 commited on

feat: choose flash attention heuristically if not set explicitly
2e2b8d0

Markus28 commited on

feat: select first LoRA upon initialization
fabeb13

Markus28 commited on

feat: formatting and type hints
617fe56

Markus28 commited on

fix: use proper initilization for embedding layer
850b9a2

Markus28 commited on

fix: fixed typo
5c4e4bf

Markus28 commited on

feat: added LoRA
8561a1f

Markus28 commited on

feat: assert return_dict
326b1c4

Markus28 commited on

fix: same assertions in other models
c1d92c9

Markus28 commited on

fix: assert is None for other kwargs too
3f5615c

Markus28 commited on

feat: added head_mask
599c64e

Markus28 commited on

added classifier dropout
767b681

Markus28 commited on

fix: formatting
ae4c28c

Markus28 commited on

fix: formatting
f115a1d

Markus28 commited on

feat: added further GLUE models
ec37ae5

Markus28 commited on

feat: added BertForSequenceClassification
ba24fb1

Markus28 commited on

fix: cast mask to bool
ca5f516

Markus28 commited on

reference the flash attention GitHub
eec6c0e

Markus28 commited on

fix: move flash components into top-level
5944ec8

Markus28 commited on

feat: try to fix import error
4c4562b

Markus28 commited on

feat: moved flash attention code into this repository
46df05d

Markus28 commited on

feat: added encode method
32458be

Markus28 commited on

fix: try to skip initialization of task type embeddings
3b35eab

Markus28 commited on

fix: try to skip initialization of task type embeddings
95ca1a8

Markus28 commited on

feat: added option for QK normalization
463061d

Markus28 commited on

fix: removed obscure config options
2e69073

Markus28 commited on

feat: added small config
149d26f

Markus28 commited on

feat: implement task type embeddings (#1)
8adf551
verified

Markus28 commited on

feat: added back option not to use flash attention
d4d5621

Markus28 commited on

feat: support gradient checkpointing
75d7a16

Markus28 commited on

Added additional config options
5b58f09

Markus28 commited on

removed unused imports
5e7b835

Markus28 commited on

removed __init__ from BertPretrainedModel
44fd417

Markus28 commited on

added config_class and base_model_prefix
45b2292

Markus28 commited on

Fixed typo
80472cb

Markus28 commited on

Fixed typo
6fb6577

Markus28 commited on

Try to subclass PretrainedModel
e209593

Markus28 commited on

Try to subclass PretrainedModel
2b23340

Markus28 commited on

strict=True for debugging
a0c289c

Markus28 commited on

try to simplify checkpointing
4c68a4c

Markus28 commited on

changed model_type
c35343d

Markus28 commited on

feat: added dense_seq_output to config
75a4e4d

Markus28 commited on

removed debugging
c2d8dc3

Markus28 commited on

debugging
c4185ce

Markus28 commited on