NAF
: FASTA Compression Benchmark
Helicobacter
genomes
Test dataset: 1,464 genomes of
Helicobacter
species (
list
), stored in a single file.
Originally from
GenBank
, obtained via
GenomeSync
(on January 14, 2019)
Size: 2,497,823,276 bytes
Number of sequences: 98,154
Method:
Benchmark setup
Compared compressors:
FASTA compressors
Best setting of each compressor
Ratio vs Speed
Each line connects consecutive settings of one compressor.
Ratio vs Speed (log scale)
Size vs Time (log scale)