Reinforcement Learning related models
Davide Buoso
lambdavi
AI & ML interests
PhD Student @ VANDAL (Polytechnic University of Turin).
Interested in the intersection of Robotics and Generative AI.
Organizations
None yet
NLP
NLP related models.
-
lambdavi/span-marker-luke-base-conll2003
Token Classification • 0.3B • Updated • 11 • 2 -
lambdavi/deberta-v3-base_on5
Token Classification • 0.2B • Updated • 15 -
lambdavi/luke-base_finetuned_conll2003
Token Classification • 0.3B • Updated • 7 -
lambdavi/luke-base_on5
Token Classification • 0.3B • Updated • 10
RL
Reinforcement Learning related models
NLP
NLP related models.
-
lambdavi/span-marker-luke-base-conll2003
Token Classification • 0.3B • Updated • 11 • 2 -
lambdavi/deberta-v3-base_on5
Token Classification • 0.2B • Updated • 15 -
lambdavi/luke-base_finetuned_conll2003
Token Classification • 0.3B • Updated • 7 -
lambdavi/luke-base_on5
Token Classification • 0.3B • Updated • 10
models
18

lambdavi/span-marker-luke-legal
Token Classification
•
0.3B
•
Updated
•
27
•
3

lambdavi/legal-luke-base-ner
Token Classification
•
0.3B
•
Updated
•
15
•
1

lambdavi/Reinforce-Pixelcopter-PLE-v0
Reinforcement Learning
•
Updated

lambdavi/ppo-Pyramids
Reinforcement Learning
•
Updated
•
5

lambdavi/a2c-PandaReachDense-v3
Reinforcement Learning
•
Updated

lambdavi/ddpg-PandaReach-v3
Reinforcement Learning
•
Updated

lambdavi/ppo-SnowballTarget
Reinforcement Learning
•
Updated
•
12

lambdavi/Reinforce-CartPole-v1
Reinforcement Learning
•
Updated

lambdavi/span-marker-luke-base-conll2003
Token Classification
•
0.3B
•
Updated
•
11
•
2

lambdavi/luke-base_finetuned_conll2003
Token Classification
•
0.3B
•
Updated
•
7
datasets
0
None public yet