aayushraina
/

gpt2shakespeare

Text Generation

Model card Files Files and versions

gpt2shakespeare / README.md

aayushraina's picture

Update README.md

c827178 verified 8 months ago

|

history blame contribute delete

2.31 kB

	---
	language: en
	tags:
	- shakespeare
	- gpt2
	- text-generation
	- english
	license: mit
	datasets:
	- shakespeare
	---

	# Shakespeare GPT-2

	A GPT-2 model fine-tuned on Shakespeare's complete works to generate Shakespeare-style text.

	## Model Description

	This model is a fine-tuned version of GPT-2 (124M parameters) trained on Shakespeare's complete works. It can generate text in Shakespeare's distinctive style, including dialogue, soliloquies, and dramatic prose.

	### Model Architecture

	- Base Model: GPT-2 (124M parameters)
	- Layers: 12
	- Heads: 12
	- Embedding Dimension: 768
	- Context Length: 1024 tokens
	- Total Parameters: ~124M

	### Training Details

	- Dataset: Complete works of Shakespeare
	- Training Steps: 100,000
	- Batch Size: 4
	- Sequence Length: 32
	- Learning Rate: 3e-4
	- Optimizer: AdamW
	- Device: MPS/CUDA/CPU

	## Intended Use

	This model is intended for:
	- Generating Shakespeare-style text
	- Creative writing assistance
	- Educational purposes in literature
	- Entertainment and artistic projects

	## Limitations

	- May generate text that mimics but doesn't perfectly replicate Shakespeare's style
	- Limited by training data to Shakespeare's vocabulary and themes
	- Can produce anachronistic or inconsistent content
	- Maximum context length of 1024 tokens

	## Training Data

	The model was trained on Shakespeare's complete works, including:
	- All plays (comedies, tragedies, histories)
	- Sonnets and poems
	- Total training tokens: [Insert number of tokens]

	## Performance

	The model achieves:
	- Training Loss: [Insert final training loss]
	- Best Loss: [Insert best loss achieved]

	## Example Usage
	python
	from transformers import GPT2LMHeadModel, GPT2Tokenizer
	Load model and tokenizer
	model_name = "your-username/shakespeare-gpt"
	tokenizer = GPT2Tokenizer.from_pretrained(model_name)
	model = GPT2LMHeadModel.from_pretrained(model_name)
	Generate text
	prompt = "To be, or not to be,"
	input_ids = tokenizer.encode(prompt, return_tensors='pt')
	output = model.generate(
	input_ids,
	max_length=500,
	temperature=0.8,
	top_k=40,
	do_sample=True
	)
	generated_text = tokenizer.decode(output[0], skip_special_tokens=True)
	print(generated_text)

	## Sample Outputs
	Prompt: "To be, or not to be,"
	Output: [Insert sample generation]
	Prompt: "Friends, Romans, countrymen,"
	Output: [Insert sample generation]