Roleplay Lora trained on llama-7b in 4-bit mode. Trained for 3 epochs. uses the https://github.com/teknium1/GPTeacher/tree/main/Roleplay dataset Training in 4bit is very fast.. only took 1/2 hour on a 3090. Eval against the dataset gave a 3.x perpelexity.