r PRO
oceansweep
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 12 hours ago
Use Property-Based Testing to Bridge LLM Code Generation and Validation
liked
a model
8 days ago
kyutai/stt-2.6b-en
Organizations
None yet
LLMs-Using
-
CohereLabs/c4ai-command-r-plus
Text Generation • 104B • Updated • 3.1k • • 1.74k -
microsoft/Phi-3-medium-128k-instruct
Text Generation • 14B • Updated • 13.2k • 381 -
crusoeai/Llama-3-8B-Instruct-Gradient-1048k-GGUF
8B • Updated • 2.35k • 71 -
gradientai/Llama-3-8B-Instruct-262k
Text Generation • 8B • Updated • 5.8k • 258
TTS
Music_Gen
Personal-Projects
Relevant-Papers-Midterm
-
Same Task, More Tokens: the Impact of Input Length on the Reasoning Performance of Large Language Models
Paper • 2402.14848 • Published • 19 -
The Prompt Report: A Systematic Survey of Prompting Techniques
Paper • 2406.06608 • Published • 64 -
CRAG -- Comprehensive RAG Benchmark
Paper • 2406.04744 • Published • 49 -
Transformers meet Neural Algorithmic Reasoners
Paper • 2406.09308 • Published • 45
Parametric-Compression
Modeling-Martial-Artists
GGUF-related
VLMs
-
openbmb/MiniCPM-V-2
Visual Question Answering • 3B • Updated • 4.36k • 475 -
HuggingFaceM4/idefics2-8b-base
Image-Text-to-Text • 8B • Updated • 993 • 28 -
HuggingFaceM4/idefics2-8b
Image-Text-to-Text • 8B • Updated • 556k • 608 -
Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks
Paper • 2311.06242 • Published • 94
LLM-Models
-
mistralai/Mixtral-8x7B-Instruct-v0.1
Text Generation • 47B • Updated • 372k • • 4.47k -
prince-canuma/WizardLM-2-8x22B
Text Generation • 141B • Updated • 38 • 5 -
nvidia/Nemotron-4-340B-Instruct
Updated • 84 • 680 -
jinaai/jina-reranker-v2-base-multilingual
Text Classification • 0.3B • Updated • 431k • 289
Datasweep
Papers
MAMBA-Models
Training-related
Coding
GGUF-related
LLMs-Using
-
CohereLabs/c4ai-command-r-plus
Text Generation • 104B • Updated • 3.1k • • 1.74k -
microsoft/Phi-3-medium-128k-instruct
Text Generation • 14B • Updated • 13.2k • 381 -
crusoeai/Llama-3-8B-Instruct-Gradient-1048k-GGUF
8B • Updated • 2.35k • 71 -
gradientai/Llama-3-8B-Instruct-262k
Text Generation • 8B • Updated • 5.8k • 258
VLMs
-
openbmb/MiniCPM-V-2
Visual Question Answering • 3B • Updated • 4.36k • 475 -
HuggingFaceM4/idefics2-8b-base
Image-Text-to-Text • 8B • Updated • 993 • 28 -
HuggingFaceM4/idefics2-8b
Image-Text-to-Text • 8B • Updated • 556k • 608 -
Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks
Paper • 2311.06242 • Published • 94
TTS
LLM-Models
-
mistralai/Mixtral-8x7B-Instruct-v0.1
Text Generation • 47B • Updated • 372k • • 4.47k -
prince-canuma/WizardLM-2-8x22B
Text Generation • 141B • Updated • 38 • 5 -
nvidia/Nemotron-4-340B-Instruct
Updated • 84 • 680 -
jinaai/jina-reranker-v2-base-multilingual
Text Classification • 0.3B • Updated • 431k • 289
Music_Gen
Datasweep
Personal-Projects
Papers
Relevant-Papers-Midterm
-
Same Task, More Tokens: the Impact of Input Length on the Reasoning Performance of Large Language Models
Paper • 2402.14848 • Published • 19 -
The Prompt Report: A Systematic Survey of Prompting Techniques
Paper • 2406.06608 • Published • 64 -
CRAG -- Comprehensive RAG Benchmark
Paper • 2406.04744 • Published • 49 -
Transformers meet Neural Algorithmic Reasoners
Paper • 2406.09308 • Published • 45
MAMBA-Models
Parametric-Compression
Training-related
Modeling-Martial-Artists