Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
jinaai
/
xlm-roberta-flash-implementation
like
19
Follow
Jina AI
283
Transformers
94 languages
xlm-roberta
Inference Endpoints
License:
cc-by-nc-4.0
🇪🇺 Region: EU
Model card
Files
Files and versions
Community
50
Train
Deploy
Use this model
refs/pr/25
xlm-roberta-flash-implementation
Commit History
feat: merge with recent changes
493416f
jupyterjazz
commited on
Aug 2
feat-routing (
#26
)
6a92924
verified
jupyterjazz
makram93
commited on
Aug 2
fix-rope-inference (
#28
)
955fea2
verified
jupyterjazz
commited on
Aug 1
fix: residual is kept in kwargs
d9d8306
jupyterjazz
commited on
Jul 30
refactor: kwargs comprehension
4e13c90
jupyterjazz
commited on
Jul 29
fix: remove prints
acffa62
jupyterjazz
commited on
Jul 29
fix: 0 is not none
ae40cb9
jupyterjazz
commited on
Jul 29
refactor: modify encode
3eb20d0
jupyterjazz
commited on
Jul 29
refactor: finalize impl
509511d
jupyterjazz
commited on
Jul 29
poc
eefe43c
jupyterjazz
commited on
Jul 29
no-flash-attention-during-inference (
#22
)
7ad815b
verified
jupyterjazz
commited on
Jul 23
draft
6cc0f51
jupyterjazz
commited on
Jul 23
Rename config to config.json
4d09ca8
verified
jupyterjazz
commited on
Jul 16
Create config
721e106
verified
bwang0911
commited on
Jul 1
Delete config.json
a60f7c0
verified
bwang0911
commited on
Jun 30
rope-embeddings (
#20
)
e3681c2
verified
jupyterjazz
commited on
Jun 6
alibi (
#19
)
ab85772
verified
jupyterjazz
Jackmin108
commited on
Jun 4
lora bugfix (
#16
)
7c4a80c
verified
jupyterjazz
Jackmin108
commited on
May 24
Update config.json (
#15
)
98c3cd2
verified
bwang0911
commited on
May 21
Update tokenizer_config.json (
#14
)
c8da5f5
verified
bwang0911
commited on
May 21
Update modeling_xlm_roberta.py (
#13
)
d230f23
verified
bwang0911
commited on
May 20
Update modeling_xlm_roberta.py (
#12
)
64c81c6
verified
bwang0911
commited on
May 20
truncate-embedding-dimension (
#10
)
8542ad8
verified
jupyterjazz
commited on
May 16
lora-multiple-adapters (
#11
)
27d23b2
verified
jupyterjazz
commited on
May 15
Refactor LoRA (
#8
)
4b000ec
verified
jupyterjazz
commited on
May 14
support lora (
#1
)
f9b3adb
verified
jupyterjazz
commited on
Apr 22
Support for SequenceClassification (
#7
)
0bb73e5
verified
michael-guenther
commited on
Apr 22
Support torch_dtype and CLS pooling (
#6
)
13c4251
verified
michael-guenther
commited on
Apr 19
Update modeling_xlm_roberta_for_glue.py
102e7bc
verified
koukandre
commited on
Apr 17
Create modeling_xlm_roberta_for_glue.py (
#4
)
4dd727d
verified
koukandre
commited on
Apr 17
add-encode-function (
#5
)
424df3c
verified
michael-guenther
commited on
Apr 17
support-cpu (
#2
)
290e593
verified
michael-guenther
commited on
Apr 17
Update configuration_xlm_roberta.py
807ba34
verified
jupyterjazz
commited on
Apr 16
Update configuration_xlm_roberta.py
9db6c6f
verified
jupyterjazz
commited on
Apr 16
add stochastic_depth
77af1c7
michael-guenther
commited on
Apr 15
support activation checkpointing
1c61b96
michael-guenther
commited on
Apr 12
add mlm model and adjust naming
95b4916
michael-guenther
commited on
Apr 11
add script to convert weights
eb21270
michael-guenther
commited on
Apr 10
add tokenizer class
3d87c79
michael-guenther
commited on
Apr 9
change config name
30e6a10
michael-guenther
commited on
Apr 9
upload model
2e3ebcb
michael-guenther
commited on
Apr 9
initial commit
2aec9c9
verified
michael-guenther
commited on
Apr 9