GPT2-NECK-SWEEP_20240119-073516-4cDCc_dataset_name-rand_hidden_idxs-12_hidden_lb-0_neck_cls-mlp_pretrained-1_token_lb-0_epoch=00-val_self_loss=8.54.ckpt