Description

Ensure BGZF compression, index, and produce a sequence size summary and a sequence dictionary for each FASTA file.

Input

name
description
pattern

ch_fasta

FASTA file, compressed or not.
Structure: [ val(meta), path(fasta) ]

*.{fa,fa.gz,fa.bz2,fa.xz,fasta,fasta.gz,fasta.bz2,fasta.xz}

Output

name
description
pattern

fasta_fai_gzi_dict

Channel containing:

  • BGZF-compressed FASTA
  • FASTA index (.fai)
  • BGZF index (.gzi)
  • sequence size summary (.sizes)
  • sequence dictionary (.dict)
    Structure:
    [ val(meta), path(fasta_gz), path(fai), path(gzi), path(sizes), path(dict) ]
*