ufal
/

byt5-small-geccc-mate

@@ -1,11 +1,13 @@
 ---
 language: cs
 license: cc-by-nc-sa-4.0
 tags:
 - Czech
 - GEC
 - GECCC dataset
-base_model: google/byt5-small
 ---
 # Model Card for byt5-small-geccc-mate
@@ -18,20 +20,20 @@ the MATE method and the [GECCC dataset](https://hdl.handle.net/11234/1-4861).
 ## Model Description
-- **Developed by:** [Seznam.cz](https://seznam.cz) and [Charles University, MFF, ÚFAL](https://ufal.mff.cuni.cz/)
-- **Language(s) (NLP):** Czech
-- **Model type:** character-based encoder-decoder Transformer model
-- **Finetuned from model:** `google/byt5-small`
-- **Finetuned on:**
-  - first synthetic errors generated by the MATE method (see [the paper](https://arxiv.org/abs/2506.22402))
-  - then the [GECCC dataset](https://hdl.handle.net/11234/1-4861)
-- **License:** CC BY-NC-SA 4.0
 ## Model Sources
-- **Repository:** https://github.com/ufal/tsd2025-gec
-- **Paper:** [Refining Czech GEC: Insights from a Multi-Experiment Approach](https://arxiv.org/abs/2506.22402)
-- **Dataset:** [GECCC dataset](https://hdl.handle.net/11234/1-4861)
 ## Evaluation
@@ -69,8 +71,8 @@ print(tokenizer.batch_decode(outputs, skip_special_tokens=True))
 ```
 @InProceedings{10.1007/978-3-032-02551-7_7,
-  author="Pechman, Petr and Straka, Milan and Strakov{\'a}, Jana and N{\'a}plava, Jakub",
-  editor="Ek{\v{s}}tein, Kamil and Konop{\'i}k, Miloslav and Pra{\v{z}}{\'a}k, Ond{\v{r}}ej and P{\'a}rtl, Franti{\v{s}}ek",
   title="Refining Czech GEC: Insights from a Multi-experiment Approach",
   booktitle="Text, Speech, and Dialogue",
   year="2026",
@@ -80,4 +82,4 @@ print(tokenizer.batch_decode(outputs, skip_special_tokens=True))
   isbn="978-3-032-02551-7",
   doi="10.1007/978-3-032-02551-7_7"
 }
-```

 ---
+base_model: google/byt5-small
 language: cs
 license: cc-by-nc-sa-4.0
 tags:
 - Czech
 - GEC
 - GECCC dataset
+pipeline_tag: text-generation
+library_name: transformers
 ---
 # Model Card for byt5-small-geccc-mate
 ## Model Description
+-   **Developed by:** [Seznam.cz](https://seznam.cz) and [Charles University, MFF, ÚFAL](https://ufal.mff.cuni.cz/)
+-   **Language(s) (NLP):** Czech
+-   **Model type:** character-based encoder-decoder Transformer model
+-   **Finetuned from model:** `google/byt5-small`
+-   **Finetuned on:**
+    -   first synthetic errors generated by the MATE method (see [the paper](https://arxiv.org/abs/2506.22402))
+    -   then the [GECCC dataset](https://hdl.handle.net/11234/1-4861)
+-   **License:** CC BY-NC-SA 4.0
 ## Model Sources
+-   **Repository:** https://github.com/ufal/tsd2025-gec
+-   **Paper:** [Refining Czech GEC: Insights from a Multi-Experiment Approach](https://arxiv.org/abs/2506.22402)
+-   **Dataset:** [GECCC dataset](https://hdl.handle.net/11234/1-4861)
 ## Evaluation
 ```
 @InProceedings{10.1007/978-3-032-02551-7_7,
+  author="Pechman, Petr and Straka, Milan and Strakov{\'a}, Jana and Náplava, Jakub",
+  editor="Ek{\v{s}}tein, Kamil and Konopík, Miloslav and Pražák, Ondřej and Pártl, František",
   title="Refining Czech GEC: Insights from a Multi-experiment Approach",
   booktitle="Text, Speech, and Dialogue",
   year="2026",
   isbn="978-3-032-02551-7",
   doi="10.1007/978-3-032-02551-7_7"
 }
+```