Module molcrawl.evaluation.gpt2.protein_classification_data_preparation