Arc Virtual Cell Model: State

State is a multi-scale AI model that uniquely operates on sets of cells to capture population-level and cell-level perturbation effects using a modern transformer architecture. The system combines two key components: the State Embedding (SE) model, which creates representations of individual cells, and the State Transition (ST) model, which models perturbation effects across cell populations. SE is trained on 167 million cells of observational data, which are measurements of how cells behave without intervention, while ST is trained on over 100 million cells of perturbation data, or how these cells respond to genetic changes or small molecules.