Reinforcement Learning related models
Davide Buoso
lambdavi
AI & ML interests
PhD Student @ VANDAL (Polytechnic University of Turin).
Interested in the intersection of Robotics and Generative AI.
Organizations
None yet
NLP
NLP related models.
-
lambdavi/span-marker-luke-base-conll2003
Token Classification • 0.3B • Updated • 4 • 2 -
lambdavi/deberta-v3-base_on5
Token Classification • 0.2B • Updated • 2 -
lambdavi/luke-base_finetuned_conll2003
Token Classification • 0.3B • Updated • 2 -
lambdavi/luke-base_on5
Token Classification • 0.3B • Updated • 1
RL
Reinforcement Learning related models
NLP
NLP related models.
-
lambdavi/span-marker-luke-base-conll2003
Token Classification • 0.3B • Updated • 4 • 2 -
lambdavi/deberta-v3-base_on5
Token Classification • 0.2B • Updated • 2 -
lambdavi/luke-base_finetuned_conll2003
Token Classification • 0.3B • Updated • 2 -
lambdavi/luke-base_on5
Token Classification • 0.3B • Updated • 1
models
18

lambdavi/span-marker-luke-legal
Token Classification
•
0.3B
•
Updated
•
5
•
3

lambdavi/legal-luke-base-ner
Token Classification
•
0.3B
•
Updated
•
4
•
1

lambdavi/Reinforce-Pixelcopter-PLE-v0
Reinforcement Learning
•
Updated

lambdavi/ppo-Pyramids
Reinforcement Learning
•
Updated
•
4

lambdavi/a2c-PandaReachDense-v3
Reinforcement Learning
•
Updated
•
1

lambdavi/ddpg-PandaReach-v3
Reinforcement Learning
•
Updated

lambdavi/ppo-SnowballTarget
Reinforcement Learning
•
Updated
•
14

lambdavi/Reinforce-CartPole-v1
Reinforcement Learning
•
Updated

lambdavi/span-marker-luke-base-conll2003
Token Classification
•
0.3B
•
Updated
•
4
•
2

lambdavi/luke-base_finetuned_conll2003
Token Classification
•
0.3B
•
Updated
•
2
datasets
0
None public yet