GSP
Quick Navigator

Search Site

Unix VPS
A - Starter
B - Basic
C - Preferred
D - Commercial
MPS - Dedicated
Previous VPSs
* Sign Up! *

Support
Contact Us
Online Help
Handbooks
Domain Status
Man Pages

FAQ
Virtual Servers
Pricing
Billing
Technical

Network
Facilities
Connectivity
Topology Map

Miscellaneous
Server Agreement
Year 2038
Credits
 

USA Flag

 

 

Man Pages
blt fastq-derep.sh(1) FreeBSD General Commands Manual blt fastq-derep.sh(1)

blt fastq-derep.sh file.fastq|fq[.xz|.bz2|.gz]

filename    FASTQ file optionally compressed with xz, bzip2, or gzip

blt fastq-derep.sh removes replicates from a fastq file using fastq2tsv to reformat to tab-separated data for easier sorting, then using Unix sort and an awk script to remove adjacent entries with the same sequence (column 2 of the TSV). Per latest benchmarks, seqkit rmdup --by-sequence and our own C version, blt fastq-derep, are about 3x as fast. However, blt fastq-derep.sh does not require the entire file in memory as it uses the Unix sort command, which automatically breaks large files into chunks for later merging.

blt fastq-derep.sh file.fastq.xz

blt-fastx2tsv(1), blt-fastx-derep(1)

J. Bacon

Search for    or go to Top of page |  Section 1 |  Main Index

Powered by GSP Visit the GSP FreeBSD Man Page Interface.
Output converted with ManDoc.