Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
82
22
David Dale
cointegrated
Follow
Pushkinue's profile picture
dantetemplar's profile picture
lamaa's profile picture
79 followers
·
8 following
https://daviddale.ru/en
cointegrated
avidale
AI & ML interests
Research engineer at FAIR, Meta. Some pet projects on NLP for under-resourced languages. Interests: Machine translation, Chatbots, applied NLU, controllable text generation (in particular, text style transfer), miniature models.
Recent Activity
new
activity
about 17 hours ago
openlanguagedata/oldi_seed:
Tamasheq data contamination
new
activity
1 day ago
openlanguagedata/flores_plus:
(Beginner) Issues using the described method to load FLORES+ dataset
new
activity
1 day ago
openlanguagedata/flores_plus:
machine
View all activity
Organizations
cointegrated
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
New activity in
openlanguagedata/oldi_seed
about 17 hours ago
Tamasheq data contamination
3
#2 opened 28 days ago by
ayymen
New activity in
openlanguagedata/flores_plus
1 day ago
(Beginner) Issues using the described method to load FLORES+ dataset
5
#15 opened 2 days ago by
jaecbc
machine
1
#8 opened about 2 months ago by
maryamelboraie
Misalignments in the Aranese subset (aran1260)
1
#11 opened about 1 month ago by
OrianeN
New activity in
openlanguagedata/flores_plus
2 days ago
Fix misalignments in the Aranese subset (aran1260)
1
#13 opened 15 days ago by
agaliano
updated
a dataset
2 days ago
openlanguagedata/flores_plus
Viewer
•
Updated
2 days ago
•
807k
•
2.54k
•
30
New activity in
openlanguagedata/flores_plus
2 days ago
Split dataset in subsets per language
1
#5 opened 3 months ago by
thomas-ferraz
liked
a dataset
12 days ago
rombodawg/Everything_Instruct_Multilingual
Viewer
•
Updated
Oct 8, 2024
•
5.81M
•
256
•
23
New activity in
openlanguagedata/flores_plus
20 days ago
[DRAFT] Fix orthography in the Russian dev set
4
#4 opened 4 months ago by
cointegrated
Fix encoding at chv devtest
4
#9 opened about 2 months ago by
alexantonov
liked
a dataset
about 1 month ago
google/wmt24pp
Viewer
•
Updated
12 days ago
•
54.9k
•
6.17k
•
32
New activity in
slone/nllb-rus-tyv-v1
about 1 month ago
Adding `safetensors` variant of this model
#1 opened about 1 month ago by
SFconvertbot
New activity in
cointegrated/LaBSE-en-ru
about 1 month ago
Warn Some weights of the model checkpoint at cointegrated/LaBSE-en-ru were not used when initializing BertModel:
1
#4 opened 6 months ago by
alashkov83
New activity in
slone/LaBSE-shallow-distilled-bak
about 2 months ago
Adding `safetensors` variant of this model
#1 opened about 2 months ago by
SFconvertbot
New activity in
cointegrated/SONAR_200_text_encoder
about 2 months ago
can you please do the same for decoder
1
#2 opened 3 months ago by
damerajee
New activity in
slone/finugorbib
about 2 months ago
[bot] Conversion to Parquet
#1 opened about 2 months ago by
parquet-converter
liked
a dataset
about 2 months ago
udmurtNLP/udmurt-russian-parallel-corpora
Viewer
•
Updated
Feb 1
•
102k
•
74
•
3
New activity in
openlanguagedata/flores_plus
about 2 months ago
Added Dargwa dev set to flores_plus
2
#3 opened 4 months ago by
Murtazali
published
a dataset
about 2 months ago
slone/finugorbib
Viewer
•
Updated
Jan 27
•
849k
•
159
•
1
updated
a dataset
about 2 months ago
slone/finugorbib
Viewer
•
Updated
Jan 27
•
849k
•
159
•
1
Load more