18 7

zhuqihao

zqh11

AI & ML interests

None yet

Recent Activity

authored a paper 7 days ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

liked a model 11 days ago

deepseek-ai/DeepSeek-R1-Zero

authored a paper 6 months ago

DeepSeek-Prover-V1.5: Harnessing Proof Assistant Feedback for Reinforcement Learning and Monte-Carlo Tree Search

View all activity

Organizations

zqh11's activity

authored a paper 7 days ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published 8 days ago • 270

liked a model 11 days ago

deepseek-ai/DeepSeek-R1-Zero

Text Generation • Updated 4 days ago • 17.1k • 622

authored a paper 6 months ago

DeepSeek-Prover-V1.5: Harnessing Proof Assistant Feedback for Reinforcement Learning and Monte-Carlo Tree Search

Paper • 2408.08152 • Published Aug 15, 2024 • 54

authored 2 papers 8 months ago

DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence

Paper • 2406.11931 • Published Jun 17, 2024 • 62

DeepSeek-Prover: Advancing Theorem Proving in LLMs through Large-Scale Synthetic Data

Paper • 2405.14333 • Published May 23, 2024 • 37

updated a model 11 months ago

deepseek-ai/deepseek-coder-33b-instruct

Text Generation • Updated Mar 7, 2024 • 11.8k • 479

New activity in deepseek-ai/deepseek-coder-33b-instruct 11 months ago

Adding `safetensors` variant of this model

#24 opened 11 months ago by

Calvinnncy97

authored a paper 12 months ago

DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models

Paper • 2402.03300 • Published Feb 5, 2024 • 84

updated a model 12 months ago

deepseek-ai/deepseek-math-7b-instruct

Text Generation • Updated Feb 6, 2024 • 7.7k • 106

New activity in deepseek-ai/deepseek-coder-6.7b-instruct 12 months ago

inference_params

#12 opened about 1 year ago by

DataSoul

updated a model 12 months ago

deepseek-ai/deepseek-coder-7b-instruct-v1.5

Text Generation • Updated Feb 5, 2024 • 28.7k • 120

New activity in deepseek-ai/deepseek-coder-33b-instruct 12 months ago

torch.cuda.OutOfMemoryError: CUDA out of memory. Tried to allocate 196.00 MiB. GPU 0 has a total capacty of 79.11 GiB of which 29.56 MiB is free

#21 opened about 1 year ago by

butujuzipi

Set global data for future chats

#17 opened about 1 year ago by

Sasori7

[AUTOMATED] Model Memory Requirements

#18 opened about 1 year ago by

model-sizer-bot

Fine tune the model with part of layers on GPU and rest on CPU

#11 opened about 1 year ago by

vmirea

New activity in deepseek-ai/deepseek-coder-7b-base-v1.5 about 1 year ago

Update to deepseek-coder-7b-base-v1.5 in code

#1 opened about 1 year ago by

bartowski

authored 2 papers about 1 year ago

DeepSeek-Coder: When the Large Language Model Meets Programming -- The Rise of Code Intelligence

Paper • 2401.14196 • Published Jan 25, 2024 • 54

DeepSeek LLM: Scaling Open-Source Language Models with Longtermism

Paper • 2401.02954 • Published Jan 5, 2024 • 43

New activity in deepseek-ai/deepseek-coder-33b-instruct about 1 year ago

Context length

#13 opened about 1 year ago by

Rohith1016

liked a dataset about 1 year ago

cognitivecomputations/dolphin-coder

Viewer • Updated Dec 7, 2023 • 109k • 89 • 57