Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

Below is an example that demonstrates this requirement and process by correctly splitting the file Example.fasta.

Example.fasta

>m140415_143853_42175_c100635972550000001823121909121417_s1_p0/553/3100_11230
>m140415_143853_42175_c324508543089230982134098587348034_s1_p0/553/103_725
>m140415_143853_42175_c324508543089230982134098587348034_s1_p0/553/973_13390
>m140415_143853_42175_c100635972550000001823121909121417_s1_p0/553/15030_17394

In the 4 headers, there are two unique 1-6 field sets:

>m140415_143853_42175_c100635972550000001823121909121417_s1_p0
>m140415_143853_42175_c324508543089230982134098587348034_s1_p0

All subreads corresponding to these headers need to be in their own files, so Example.fasta would be split accordingly:

m14041514385342175c100635972550000001823121909121417s1_p0.fasta

>m140415_143853_42175_c100635972550000001823121909121417_s1_p0/553/3100_11230 >m140415_143853_42175_c100635972550000001823121909121417_s1_p0/553/15030_17394

m14041514385342175c324508543089230982134098587348034s1_p0.fasta

>m140415_143853_42175_c324508543089230982134098587348034_s1_p0/553/103_725 >m140415_143853_42175_c324508543089230982134098587348034_s1_p0/553/973_13390

FALCON-formatter takes FASTA/Q files or folders of files as input, converts the FASTQ to FASTA and writes each read to a file corresponding to fields 1 through 6.

...