license: cc0-1.0 | |
datasets: | |
- code_search_net | |
library_name: transformers | |
tags: | |
- text-generation | |
- code | |
- python | |
This is an adapted tokenizer from GPT2 that can recognize tokens to do with Python coding. It is part of the [huggingfaceNLP course exercise](https://huggingface.co/learn/nlp-course/chapter6/2). It uses the method `train_new_from_iterator()` |