|  | --- | 
					
						
						|  | base_model: mistralai/Mistral-7B-Instruct-v0.3 | 
					
						
						|  | datasets: | 
					
						
						|  | - GaetanMichelet/chat-60_ft_task-1 | 
					
						
						|  | library_name: peft | 
					
						
						|  | license: apache-2.0 | 
					
						
						|  | tags: | 
					
						
						|  | - alignment-handbook | 
					
						
						|  | - trl | 
					
						
						|  | - sft | 
					
						
						|  | - generated_from_trainer | 
					
						
						|  | model-index: | 
					
						
						|  | - name: Mistral-7B_task-1_60-samples_config-2 | 
					
						
						|  | results: [] | 
					
						
						|  | --- | 
					
						
						|  |  | 
					
						
						|  | <!-- This model card has been generated automatically according to the information the Trainer had access to. You | 
					
						
						|  | should probably proofread and complete it, then remove this comment. --> | 
					
						
						|  |  | 
					
						
						|  | # Mistral-7B_task-1_60-samples_config-2 | 
					
						
						|  |  | 
					
						
						|  | This model is a fine-tuned version of [mistralai/Mistral-7B-Instruct-v0.3](https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.3) on the GaetanMichelet/chat-60_ft_task-1 dataset. | 
					
						
						|  | It achieves the following results on the evaluation set: | 
					
						
						|  | - Loss: 1.1342 | 
					
						
						|  |  | 
					
						
						|  | ## Model description | 
					
						
						|  |  | 
					
						
						|  | More information needed | 
					
						
						|  |  | 
					
						
						|  | ## Intended uses & limitations | 
					
						
						|  |  | 
					
						
						|  | More information needed | 
					
						
						|  |  | 
					
						
						|  | ## Training and evaluation data | 
					
						
						|  |  | 
					
						
						|  | More information needed | 
					
						
						|  |  | 
					
						
						|  | ## Training procedure | 
					
						
						|  |  | 
					
						
						|  | ### Training hyperparameters | 
					
						
						|  |  | 
					
						
						|  | The following hyperparameters were used during training: | 
					
						
						|  | - learning_rate: 0.0001 | 
					
						
						|  | - train_batch_size: 1 | 
					
						
						|  | - eval_batch_size: 1 | 
					
						
						|  | - seed: 42 | 
					
						
						|  | - distributed_type: multi-GPU | 
					
						
						|  | - gradient_accumulation_steps: 16 | 
					
						
						|  | - total_train_batch_size: 16 | 
					
						
						|  | - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08 | 
					
						
						|  | - lr_scheduler_type: cosine | 
					
						
						|  | - lr_scheduler_warmup_ratio: 0.1 | 
					
						
						|  | - num_epochs: 50 | 
					
						
						|  |  | 
					
						
						|  | ### Training results | 
					
						
						|  |  | 
					
						
						|  | | Training Loss | Epoch   | Step | Validation Loss | | 
					
						
						|  | |:-------------:|:-------:|:----:|:---------------:| | 
					
						
						|  | | 2.4067        | 0.6957  | 2    | 2.1215          | | 
					
						
						|  | | 2.2389        | 1.7391  | 5    | 1.6696          | | 
					
						
						|  | | 1.4281        | 2.7826  | 8    | 1.3318          | | 
					
						
						|  | | 1.1834        | 3.8261  | 11   | 1.2627          | | 
					
						
						|  | | 1.0416        | 4.8696  | 14   | 1.1743          | | 
					
						
						|  | | 0.9159        | 5.9130  | 17   | 1.1342          | | 
					
						
						|  | | 0.6449        | 6.9565  | 20   | 1.2086          | | 
					
						
						|  | | 0.5118        | 8.0     | 23   | 1.4014          | | 
					
						
						|  | | 0.3817        | 8.6957  | 25   | 1.5883          | | 
					
						
						|  | | 0.1478        | 9.7391  | 28   | 1.8900          | | 
					
						
						|  | | 0.0974        | 10.7826 | 31   | 2.1700          | | 
					
						
						|  | | 0.0395        | 11.8261 | 34   | 2.3129          | | 
					
						
						|  | | 0.0325        | 12.8696 | 37   | 2.3484          | | 
					
						
						|  |  | 
					
						
						|  |  | 
					
						
						|  | ### Framework versions | 
					
						
						|  |  | 
					
						
						|  | - PEFT 0.12.0 | 
					
						
						|  | - Transformers 4.44.0 | 
					
						
						|  | - Pytorch 2.1.2+cu121 | 
					
						
						|  | - Datasets 2.20.0 | 
					
						
						|  | - Tokenizers 0.19.1 |