Running 37 BigCodeArena π 37 Compare two AI models by sending them code and seeing their responses
Running on CPU Upgrade 18 BigCodeBench Evaluator π₯ 18 Evaluate code samples using specified parameters