Installation ================================================ PyTorch-Transformers is tested on Python 2.7 and 3.5+ (examples are tested only on python 3.5+) and PyTorch 1.1.0 With pip ^^^^^^^^ PyTorch Transformers can be installed using pip as follows: .. code-block:: bash pip install pytorch-transformers From source ^^^^^^^^^^^ To install from source, clone the repository and install with: .. code-block:: bash git clone https://github.com/huggingface/pytorch-transformers.git cd pytorch-transformers pip install [--editable] . Tests ^^^^^ An extensive test suite is included to test the library behavior and several examples. Library tests can be found in the `tests folder `_ and examples tests in the `examples folder `_. Tests can be run using `pytest` (install pytest if needed with `pip install pytest`). Run all the tests from the root of the cloned repository with the commands: .. code-block:: bash python -m pytest -sv ./pytorch_transformers/tests/ python -m pytest -sv ./examples/ OpenAI GPT original tokenization workflow ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ If you want to reproduce the original tokenization process of the ``OpenAI GPT`` paper, you will need to install ``ftfy`` (use version 4.4.3 if you are using Python 2) and ``SpaCy`` : .. code-block:: bash pip install spacy ftfy==4.4.3 python -m spacy download en If you don't install ``ftfy`` and ``SpaCy``\ , the ``OpenAI GPT`` tokenizer will default to tokenize using BERT's ``BasicTokenizer`` followed by Byte-Pair Encoding (which should be fine for most usage, don't worry). Do you want to run a Transformer model on a mobile device? ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ You should check out our `swift-coreml-transformers `_ repo. It contains an example of a conversion script from a Pytorch trained Transformer model (here, ``GPT-2``) to a CoreML model that runs on iOS devices. It also contains an implementation of BERT for Question answering. At some point in the future, you'll be able to seamlessly move from pre-training or fine-tuning models in PyTorch to productizing them in CoreML, or prototype a model or an app in CoreML then research its hyperparameters or architecture from PyTorch. Super exciting!