Commit History

changed model_type
c35343d

Markus28 commited on

feat: added dense_seq_output to config
75a4e4d

Markus28 commited on

removed debugging
c2d8dc3

Markus28 commited on

debugging
c4185ce

Markus28 commited on

debugging
a1e1eff

Markus28 commited on

debugging assertion
4d2995d

Markus28 commited on

fix: fixed get_input_embeddings method
7e06371

Markus28 commited on

feat: added get_input_embeddings method to BertForPreTraining
bb281f0

Markus28 commited on

feat: fixed _from_config
18eed80

Markus28 commited on

feat: changed model_type
eeb05a3

Markus28 commited on

removed from_config
0ce78aa

Markus28 commited on

fix: try to get from_config to work
871fd36

Markus28 commited on

feat: added from_config, also pass additional kwargs from config to model
4164fd6

Markus28 commited on

feat: updated modeling_bert.py to allow MLM-only training
0f43653

Markus28 commited on

feat: reverted monkey patch
3160695

Markus28 commited on

feat: try to monkey-patch index_first_axis
ed92835

Markus28 commited on

feat: try to fix compilation
03d8e7c

Markus28 commited on

feat: added debug print
63832b9

Markus28 commited on

feat: updated .to() override to handle kwargs
e86d612

Markus28 commited on

Revert "feat updated debug pring"
a62c2ab

Markus28 commited on

feat updated debug pring
d21ee1b

Markus28 commited on

fix: try to fix .to(torch.float16) with ALiBi
adf376f

Markus28 commited on

Revert "feat: added back option to disable flash attention"
b7ee9c4

Markus28 commited on

feat: added back option to disable flash attention
a2c07ba

Markus28 commited on

fix: always use flash attention
bfc0b2d

Markus28 commited on

fix: fix typo
fde927f

Markus28 commited on

fix: try to fix AutoModel.from_config
318cbce

Markus28 commited on

fix: try to fix AutoModel.from_pretrained
4c4f805

Markus28 commited on

fix: try to fix import error
8c27502

Markus28 commited on

feat: added top-level docstring, made it compatible with AutoModel
f8b62b4

Markus28 commited on

initial commit
87b642a

Markus28 commited on