
BINSEQ: A family of high-performance binary formats for nucleotide sequences
Modern genomics produces billions of sequencing records per run, which are typically stored as gzip-compressed FASTQ files. While this format is widely used, it is not optimal for high-throughput processing. Here, we present BINSEQ, a family of simple binary formats that enable high-throughput parallel processing of sequencing data.








