Output¶
Full listing of expected output files¶
MetAMOS generates an interactive web page once a run successfully completes:
http://www.cbcb.umd.edu/~sergek/imetamos/gageb/Postprocess/out/html/summary.html
This includes summary statistics and taxonomic information based on Krona [1]. The easiest way to interact with the results is through the web interface. The web interface has been tested in several browsers. The currently known issues are:
Browser | Version | Issues |
---|---|---|
Chrome | 33.0.1750.152 | None |
Safari | 6.1.2 | None |
Firefox | 28 | QUAST reports do not show for Validate |
IE | 9 | Not Tested |
The Postprocess/out directory contains the results of the analysis. By default, metAMOS uses the prefix “proba” (Galician for test). Thus, files will have the name “proba”.*.
abundance.krona.html¶
Krona [1] plot of abundances using the tool selected for abundance (MetaPhyler [2] by default)
annotate.krona.html¶
Krona [1] plot of abundances using the tool selected for classification (Kraken [3] by default)
asm.scores¶
Validation scores for each assembly/kmer combination run. Header contains information on scores generated
best.asm¶
The name of the assembly/kmer combination that was selected as the best
<taxonomy>.classified¶
Subdirectory containing each level of the selected taxonomy (class by default) and the contigs/reads/orfs belonging to each
<taxonomy>.original.annots¶
Tab-delimited taxonomic level assignments for each contig/unassembled read. Class IDs correspond to NCBI taxonomy IDs.
<taxonomy>.original.reads.annots¶
Tab-delimited taxonomic level assignments as above, where contigs are replaced with their constituent sequences.
<taxonomy>.propagated.annots¶
Tab-delimited file as above after assembly graph-based propagation of assignments to contigs.
<taxonomy>.propagated.reads.annots¶
Tab-delimited file as above after propagation and having contigs replaced with their constituent reads.
html (directory)¶
HTML output from the pipeline. summary.html contains an interactive results view.
proba.bnk¶
AMOS bank format of the assembly that can be visualized using Hawkeye.
proba.classify.txt¶
The raw output of the abundances using the tool selected for abundance estimations (MetaPhyler [2] by default)
proba.ctg.cnt¶
The number of sequences mapped to each assembly contig
proba.ctg.cvg¶
The coverage of each assembly contig
proba.ctg.fa¶
The assembled contigs
proba.hits¶
The raw output of the contig/unassembled reads classifications using the selected tool (Kraken [3]) by default.
proba.lib1.contig.reads¶
The per-library assignment of sequences to contigs
proba.lib1.unaligned.fasta¶
The per-library unassembled sequences
proba.scf.fa¶
The assembled scaffolds
proba.motifs.fa¶
The motifs within scaffolds identified by Bambus 2
proba.orf.faa¶
The protein sequences of identified open reading frames (ORFs) in the assembly and unassembled reads
proba.orf.fna¶
The fasta sequences of identified open reading frames (ORFs) in the assembly and unassembled reads
proba.scf.orf.faa¶
The protein sequences of identified open reading frames (ORFs) in the scaffolds
proba.scf.orf.fna¶
The protein sequences of identified open reading frames (ORFs) in the scaffolds
ref.fasta¶
The recruited reference genome used for validation (iMetAMOS only)
ref.name¶
The name of the recruited reference genome (iMetAMOS only)
Additional details for each step are available under <STEP NAME>/out. This includes the raw output (as well as any intermediate files) of any tools run during that step. For example, Annotate/out/proba.prokka includes the full Prokka annotation output. Assemble/out/abyss*/ contains the intermediate files output by ABySS. Additionally, since MetAMOS stores all of its results in an AMOS bank, the assemblies can be visualized with Hawkeye.
[1] Ondov BD, Bergman NH, Phillippy AM.. Interactive metagenomic visualization in a Web browser. BMC Bioinformatics. 2011 Sep 30;12:385. PMID: 21961884
[2] Liu B, Gibbons T, Ghodsi M, Treangen T, Pop M. Accurate and fast estimation of taxonomic profiles from metagenomic shotgun sequences. BMC Genomics. 2011;12 Suppl 2:S4. Epub 2011 Jul 27.
[3] Wood DE, Salzberg SL: Kraken: ultrafast metagenomic sequence classification using exact alignments. Genome Biology 2014, 15:R46.