This package includes BBMap, a short read aligner, as well as various other bioinformatic tools. It is written in pure Java, can run on any platform, and has no dependencies other than Java being installed (compiled for Java 6 and higher). All tools are efficient and multithreaded.

BBMap: Short read aligner for DNA and RNA-seq data. Capable of handling arbitrarily large genomes with millions of scaffolds. Handles Illumina, PacBio, 454, and other reads; very high sensitivity and tolerant of errors and numerous large indels. Very fast.

BBNorm: Kmer-based error-correction and normalization tool.

Dedupe: Simplifies assemblies by removing duplicate or contained subsequences that share a target percent identity.

Reformat: Reformats reads between fasta/fastq/scarf/fasta+qual/sam, interleaved/paired, and ASCII-33/64, at over 500 MB/s.

BBDuk: Filters, trims, or masks reads with kmer matches to an artifact/contaminant file.

...and more!

Features

  • Pure Java, runs on any platform; already compiled, just unzip and run.
  • Fast, efficient, and multithreaded.
  • Usage information displayed when running a shellscript with no parameters.
  • Highest sensitivity of any short-read aligner.
  • Easy to install - just unzip/untar.
  • Easy to use. Example: bbmap.sh ref=ecoli.fa in=reads.fq out=mapped.sam
  • Handles all common formats: fasta, fastq, sam, scarf, fasta+qual, ASCII-33, ASCII-64, gzip.
  • Used by the Joint Genome Institute.
  • BB stands for Bestus Bioinformaticus.
  • There is a handy FAQ here: http://seqanswers.com/forums/showthread.php?t=58221

Project Samples

Project Activity

See All Activity >

Follow BBMap

BBMap Web Site

Other Useful Business Software
Open source. Easy to use. Proven. Complete. Icon
Open source. Easy to use. Proven. Complete.

End to end big data that enables you to spend less time formatting data and more time analyzing it.

Discover HPCC Systems - the truly open source big data solution that allows you to quickly process, analyze and understand large data sets, even data stored in massive, mixed-schema data lakes. Designed by data scientists, HPCC systems is a complete integrated solution from data ingestion and data processing to data delivery. The free online introductory courses and a robust developer community allow you to get started quickly.
Rate This Project
Login To Rate This Project

User Ratings

★★★★★
★★★★
★★★
★★
1
0
0
0
0
ease 1 of 5 2 of 5 3 of 5 4 of 5 5 of 5 5 / 5
features 1 of 5 2 of 5 3 of 5 4 of 5 5 of 5 5 / 5
design 1 of 5 2 of 5 3 of 5 4 of 5 5 of 5 5 / 5
support 1 of 5 2 of 5 3 of 5 4 of 5 5 of 5 0 / 5

User Reviews

  • I was so tired of trimmomatic not doing what it was supposed to with the adapters, I found that I didn't have enough control with the exact settings for the adapter trimming. I downloaded Bbmap to use Bbduk and had it up and running in about 1 h of playing with the thresholds. Not only it did exactly what I needed ( I needed to cut an adaptors with the ability to specify if it was from the left or the right), but it was easy an intuitive to use.
    Reply from BBMap
    Posted 2022-08-30
    Thanks for giving me my first review! I always like to hear that my tools do exactly what you need, and are intuitive and easy to use. Or, of course, if they aren't, so I can fix them. But I put a lot of effort into making them easy to use, so I'm glad it worked exactly as I intended. Well... sorry about the 1 hour of playing with thresholds, but yep, it can take that to optimize things for a custom case.
Read more reviews >

Additional Project Details

Operating Systems

Solaris, BSD, Windows

Languages

English

Intended Audience

Science/Research

User Interface

Console/Terminal

Programming Language

Java

Related Categories

Java Bio-Informatics Software, Java Medical Software

Registered

2014-02-19