Module molcrawl.rnaformer.main

RNAformer Training Script

Transformer model training script specialized for RNA sequences (gene expression data). Based on Geneformer architecture and optimized for learning RNA transcriptome data.

Features: - Custom tokenization for gene expression data - Cell type specific learning - Support for long contexts (1024 tokens) - Efficient batch processing and memory management