Vipitis
/

santacoder-finetuned-Shadertoys-fine

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Vipitis commited on Jun 28, 2023

Commit

79dfc25

•

1 Parent(s): 94d1a3a

improved README

Files changed (1) hide show

README.md +56 -3

README.md CHANGED Viewed

@@ -1,4 +1,6 @@
 ---
 license: bigcode-openrail-m
 datasets:
 - bigcode/the-stack-dedup
@@ -6,11 +8,62 @@ datasets:
 pipeline_tag: text-generation
 tags:
 - code
 ---
-[Santacoder](https://huggingface.co/bigcode/santacoder) finetuned on [Shadertoys-fine](https://huggingface.co/datasets/Vipitis/Shadertoys-fine) for 1000 steps with a batch size of 2 and full sequence length of 2048.
-Origianl finetuning script from found [here](https://github.com/loubnabnl/santacoder-finetuning), adapted version to follow (soon^^).
-Main purpose of this model is to explore if finetuning models improves performance on [ShaderEval](https://huggingface.co/spaces/Vipitis/ShaderEval), results to follow (sooner).
 License carried over from model, however training data has an undefied license. Check details in [Shadertoys](https://huggingface.co/datasets/Vipitis/Shadertoys).

 ---
+language:
+- code
 license: bigcode-openrail-m
 datasets:
 - bigcode/the-stack-dedup
 pipeline_tag: text-generation
 tags:
 - code
+- shader
+widget:
+- text: void mainImage( out vec4 fragColor, in vec2 fragCoord )
+  example_title: mainImage
+  group: Shadertoy
+model-index:
+- name: santacoder-finetuned-the-stack-glsl
+  results:
+  - task:
+      type: text-generation
+      name: ShaderEval
+    dataset:
+      type: Vipitis/Shadertoys-fine
+      name: Shadertoys-fine
+      config: return_completion
+      revision: 0.0.2
+    metrics:
+      - type: exact_match
+        value: 0.567
+        name: 300 samples, greedy decoding
+        verified: false
+      - type: exact_match
+        value: 0.59749
+        name: all samples, greedy decoding
+        verified: false
 ---
+[Santacoder](https://huggingface.co/bigcode/santacoder) finetuned on [The-Stack-deddup (GLSL subset)](https://huggingface.co/datasets/bigcode/the-stack-dedup/tree/main/data/glsl) for 1000 steps with a batch size of 2 and full sequence length of 2048.
+adapted finetuning script found [here](./train.py)
+### Finetuning parameters
+```sh
+python3 train.py --model_path "bigcode/santacoder" \
+--dataset_name "Vipitis/Shadertoys-fine" \
+--data_column "code" \
+--split "train" \
+--seq_length 2048 \
+--max_steps 1000 \
+--batch_size 2 \
+--gradient_accumulation_steps 4 \
+--learning_rate 5e-5 \
+--num_warmup_steps 100 \
+--eval_freq 100 \
+--save_freq 100 \
+--log_freq 1 \
+--output_dir "checkpoint_dir" \
+--no_fp16
+```
+Main purpose of this model is to explore if finetuning models improves performance on [ShaderEval](https://huggingface.co/spaces/Vipitis/ShaderEval), which reached 0.567 with 300 samples and 0.59749 on all samples.
+### Disclaimer
+While the train/test split is held out, there is a lot of data contamination. The model results can't be trusted for this simple benchmark.
+Better tasks for the benchmark will be developed and tested against these models.
 License carried over from model, however training data has an undefied license. Check details in [Shadertoys](https://huggingface.co/datasets/Vipitis/Shadertoys).