-
Beyond A*: Better Planning with Transformers via Search Dynamics Bootstrapping
Paper • 2402.14083 • Published • 48 -
Linear Transformers are Versatile In-Context Learners
Paper • 2402.14180 • Published • 7 -
Training-Free Long-Context Scaling of Large Language Models
Paper • 2402.17463 • Published • 23 -
The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits
Paper • 2402.17764 • Published • 611
Yang Lee
innovation64
AI & ML interests
AGI
Recent Activity
updated
a model
about 16 hours ago
innovation64/gemma-2-2B-it-thinking-function_calling-V0
published
a model
about 16 hours ago
innovation64/gemma-2-2B-it-thinking-function_calling-V0
upvoted
a
paper
2 days ago
Stop Overthinking: A Survey on Efficient Reasoning for Large Language
Models
Organizations
Collections
2
RAG research
-
Beyond Chain-of-Thought: A Survey of Chain-of-X Paradigms for LLMs
Paper • 2404.15676 • Published -
How faithful are RAG models? Quantifying the tug-of-war between RAG and LLMs' internal prior
Paper • 2404.10198 • Published • 7 -
RAFT: Adapting Language Model to Domain Specific RAG
Paper • 2403.10131 • Published • 71 -
FaaF: Facts as a Function for the evaluation of RAG systems
Paper • 2403.03888 • Published
Papers
1
spaces
3
models
24

innovation64/gemma-2-2B-it-thinking-function_calling-V0
Updated

innovation64/llama3.1-nli
Updated

innovation64/llama3.1-8B-instruct-4bit-ruozhiba-4bit
Text Generation
•
Updated
•
21

innovation64/llama3.1-8B-instruct-4bit-ruozhiba-GGUF
Updated
•
488

innovation64/llama3.1-8B-instruct-4bit-ruozhiba-lora
Updated

innovation64/llama3.1-8B-instruct-4bit-ruozhiba-16
Text Generation
•
Updated
•
16

innovation64/speecht5_finetuned_voxpopuli_sl
Text-to-Speech
•
Updated
•
10

innovation64/whisper-tiny-dv
Automatic Speech Recognition
•
Updated
•
18

innovation64/distilhubert-finetuned-gtzan
Audio Classification
•
Updated
•
81

innovation64/poca-aSoccerTwos
Reinforcement Learning
•
Updated
•
5
datasets
None public yet