t5-v1_1-large-gramatika-final-e8-b16

This model is a fine-tuned version of google/t5-v1_1-large on the None dataset. It achieves the following results on the evaluation set:

Model description

More information needed

More information needed

More information needed

The following hyperparameters were used during training:

Training Loss	Epoch	Step	Validation Loss	Rouge1	Rouge2	Rougel	Rougelsum	Gen Len
1.021	0.37	300	0.3972	34.0058	21.9508	32.8363	32.8413	18.7689
0.4571	0.73	600	0.3349	33.8082	23.1941	33.2259	33.2092	18.9198
0.3649	1.1	900	0.2819	37.177	26.2961	36.1081	36.1	18.9251
0.2812	1.46	1200	0.2162	41.57	31.5746	41.2196	41.2329	18.9303
0.2348	1.83	1500	0.1934	42.498	32.9335	42.2362	42.2357	18.9313
0.1883	2.2	1800	0.1900	42.8109	33.4358	42.5931	42.6018	18.9308
0.1607	2.56	2100	0.1790	42.8463	33.5315	42.6271	42.6405	18.9329
0.1553	2.93	2400	0.1764	43.0997	34.0805	42.9099	42.9121	18.9313
0.1143	3.29	2700	0.1735	43.2635	34.1534	43.0452	43.0553	18.9334
0.1078	3.66	3000	0.1652	43.5873	34.5612	43.3549	43.3701	18.9261
0.1029	4.02	3300	0.1729	43.7576	35.0653	43.5811	43.5982	18.9292
0.0669	4.39	3600	0.1773	43.6613	34.8016	43.4742	43.4793	18.9266
0.0671	4.76	3900	0.1710	43.92	35.1941	43.7078	43.7307	18.9313
0.0584	5.12	4200	0.1883	44.0979	35.4031	43.9062	43.9378	18.9334
0.0408	5.49	4500	0.1946	44.1764	35.6678	44.0201	44.0305	18.9319
0.0405	5.85	4800	0.1901	44.2228	35.7124	44.0157	44.0352	18.9319
0.0311	6.22	5100	0.2098	44.4711	36.1284	44.3122	44.3301	18.9251
0.0239	6.59	5400	0.2087	44.4937	36.2131	44.3726	44.3922	18.9271
0.0241	6.95	5700	0.2111	44.48	36.0938	44.3196	44.3404	18.9256
0.0162	7.32	6000	0.2235	44.4869	36.1867	44.3554	44.3615	18.9251
0.0147	7.68	6300	0.2281	44.618	36.3734	44.5021	44.5151	18.9240