パッケージプログラム一覧(基礎生物学)

module 経由で使えるもの

事前に設定ファイルを読み込む必要があります

source /apl/bio/etc/bio.sh
Analysis TypeApplication nameDescriptioninstalled ver.Official URL
Homology searchblast+ Sequence Similarity Search2.13.0https://blast.ncbi.nlm.nih.gov/Blast.cgi
blat BLAT on DNA is designed to quickly find sequences of 95% and greater similarity of length 25 bases or more37https://genome.ucsc.edu/goldenpath/help/blatSpec.html
Diamond DIAMOND is a sequence aligner for protein and translated DNA searches, designed for high performance analysis of big sequence data.2.0.15https://github.com/bbuchfink/diamond
fasta Sequence Similarity Search36.3.8g 
HH-suiteThe HH-suite is an open-source software package for sensitive protein sequence searching based on the pairwise alignment of hidden Markov models (HMMs).3.3.0_SSE2https://github.com/soedinglab/hh-suite
MMseq2 ultra fast and sensitive sequence search and clustering suite14-7e284https://github.com/soedinglab/MMseqs2
vsearch 

VSEARCH is an open source and free of charge multithreaded 64-bit tool for processing metagenomic nucleotide sequence data. An alternative to the USEARCH 

2.29.0https://github.com/torognes/vsearch
NGS analysisbamtools Bamtools is a toolkit for analyzing and managing BAM files2.5.2https://github.com/pezmaster31/bamtools
BEDtools Bedtools utilities are a tools for a wide-range of genomics analysis tasks2.27.1https://bedtools.readthedocs.io/en/latest/
Bowtie Bowtie is an ultrafast, memory-efficient short read aligner 1.2.3https://bowtie-bio.sourceforge.net/index.shtml
Bowtie2 Bowtie 2 is an ultrafast and memory-efficient tool for aligning sequencing reads to long reference sequences.2.5.3https://bowtie-bio.sourceforge.net/bowtie2/index.shtml

bwa

bwa-mem

Burrows-Wheeler Aligner (BWA) is an efficient program that aligns relatively short nucleotide sequences against a long reference sequence

0.17.7

2.2.1

https://github.com/lh3/bwa
Cufflinks Cufflinks assembles transcripts, estimates their abundances, and tests for differential expression and regulation in RNA-Seq samples https://github.com/cole-trapnell-lab/cufflinks
fastQCA quality control tool for high throughput sequence data. https://www.bioinformatics.babraham.ac.uk/projects/fastqc/
fastpA quality control tool for high throughput sequence data.0.23.4https://github.com/OpenGene/fastp
GATK The GATK is the industry standard for identifying SNPs and indels in germline DNA and RNAseq data.4.3.0.0https://github.com/broadinstitute/gatk
hisat2 HISAT2 is a fast and sensitive alignment program for mapping next-generation sequencing reads (both DNA and RNA). 2.2.1https://daehwankimlab.github.io/hisat2/
kallistokallisto is a program for quantifying abundances of transcripts from RNA-Seq data, or more generally of target sequences using high-throughput sequencing reads.0.46.2https://github.com/pachterlab/kallisto
MACS2 Model-based Analysis of ChIP-Seq on short reads sequencers such as Genome Analyzer (Illumina / Solexa)2.1.2https://github.com/macs3-project/MACS
RSEM RNA-Seq by Expectation-Maximization1.3.3https://github.com/deweylab/RSEM
Salmon Salmon is a tool for quantifying the expression of transcripts using RNA-seq data1.10.0https://combine-lab.github.io/salmon/
samtools SAM Tools provide various utilities for manipulating alignments in the SAM format1.19.2https://www.htslib.org/
soap Short Oligonucleotide Analysis Package2.21https://github.com/ShujiaHuang/SOAPaligner
SRAtoolkitThe SRA Toolkit and SDK from NCBI is a collection of tools and libraries for using data in the INSDC Sequence Read Archives.3.0.0https://github.com/ncbi/sra-tools
STAR Spliced Transcripts Alignment to a Reference2.7.9ghttps://github.com/alexdobin/STAR
Stringtie StringTie is a fast and highly efficient assembler of RNA-Seq alignments into potential transcripts.2.2.1https://ccb.jhu.edu/software/stringtie/
Tophat TopHat is a fast splice junction mapper for RNA-Seq reads2.1.1https://ccb.jhu.edu/software/tophat/index.shtml
Genome (transcript)  AssemblerABySSAssembly By Short Sequences - a de novo, parallel, paired-end sequence assembler2.3.4https://github.com/bcgsc/abyss
Allpaths-LGThe new short read genome assembler.52488https://www.rcac.purdue.edu/software/allpathslg
canu Canu is a fork of the Celera Assembler designed for high-noise single-molecule sequencing (such as the PacBio RSII or Oxford Nanopore MinION).2.2https://github.com/marbl/canu
hifiasm Hifiasm is a fast haplotype-resolved de novo assembler for PacBio HiFi reads.0.18.5https://github.com/chhylp123/hifiasm
MaSuRCAMaSuRCA (Maryland Super-Read Celera Assembler) genome assembly software4.0.7https://github.com/alekseyzimin/masurca
NECATNECAT is an error correction and de-novo assembly tool for Nanopore long noisy reads.0.0.1https://github.com/xiaochuanle/NECAT
SPAdes SPAdes St. Petersburg genome assembler: SPAdes  is an assembly toolkit containing various assembly pipelines. 3.15.3https://github.com/ablab/spades
Trinityrnaseq Novel method for the efficient and robust de novo reconstruction of transcriptomes from RNA-seq data2.15.1https://github.com/trinityrnaseq/trinityrnaseq
velvet Sequence assembler for very short reads1.2.10https://github.com/dzerbino/velvet
soap denovoShort Oligonucleotide Analysis Package genome assembler2.04https://github.com/ShujiaHuang/SOAPaligner
wgssame as Celera Assembler: Whole genome assembler8.2https://github.com/alekseyzimin/wgs
Pairwise Alignmentlastz A tool for (1) aligning two DNA sequences, and (2) inferring appropriate scoring parameters automatically1.04https://github.com/lastz/lastz
MUMmer MUMmer is a system for rapidly aligning entire genomes, whether in complete or draft form.4.0.0https://github.com/mummer4/mummer
Multiple Alignmentclustal Omega Fast, accurate, scalable multiple sequence alignment for proteins1.2.4http://www.clustal.org/omega/
clustalw Multiple Sequence Alignment1.83http://www.clustal.org/clustal2/
clustalw2 Multiple Sequence Alignment2.1http://www.clustal.org/clustal2/
FAMSAProgressive algorithm for large-scale multiple sequence alignments.1.6.2https://github.com/refresh-bio/FAMSA
Gblocks Gblocks eliminates poorly aligned positions and divergent regions of an alignment of DNA or protein sequences0.91bhttps://www.biologiaevolutiva.org/jcastresana/Gblocks.html
MAFFTMAFFT is a multiple sequence alignment program7.505https://mafft.cbrc.jp/alignment/software/
muscle Multiple Sequence Alignment faster and more accurate than clustalw5.1https://www.drive5.com/muscle/
t_coffee Multiple sequence alignment package12.00.7https://tcoffee.org/Projects/tcoffee/index.html
Datbase searchdbgetDBGET is an integrated database retrieval system for major biological databases https://www.kanehisa.jp/ja/about_dbget_ja.html
Sequence AssemblerCAP3 Multiple Sequence Alignment122107https://faculty.sites.iastate.edu/xqhuang/cap3-and-pcap-sequence-and-genome-assembly-programs
consedAssembly Editor29.0http://bozeman.mbt.washington.edu/consed/consed.html
PhrapPhrap is a program for assembling shotgun DNA sequence data

1.090518

http://www.phrap.org/phredphrapconsed.html
Phred The phred software reads DNA sequencing trace files, calls bases, and assigns a quality value to each called base

071220

http://www.phrap.org/phredphrapconsed.html
TGICL Multiple Sequence Alignment (for huge data set)2.1it's gone : http://compbio.dfci.harvard.edu/tgi/software/
Gene predictionBRAKERBRAKER is a program that predicts genes in eukaryotic genomic sequences https://github.com/Gaius-Augustus/BRAKER
Augustus AUGUSTUS is a program that predicts genes in eukaryotic genomic sequences3.5.0https://github.com/Gaius-Augustus/Augustus
Genemark A family of gene prediction programs4.69https://genemark.bme.gatech.edu/
genscan Gene prediction https://www.genes.mit.edu/GENSCAN.html

glimmer

glimmerhmm

Glimmer is a system for finding genes in microbial DNA

30.02

3.0.4

https://ccb.jhu.edu/software/glimmerhmm/
TSEBRATSEBRA is a combiner tool that selects transcripts from gene predictions based on the support by extrisic evidence in form of introns and start/stop codons https://github.com/Gaius-Augustus/TSEBRA
Motif searchHMMERBiosequence analysis using profile HMM3.4http://hmmer.org/
Interproscan A tool that combines different protein signature recognition methods into one resource5.42-78.0https://www.ebi.ac.uk/interpro/
meme Multiple Em for Motif Elicitation5.4.1https://meme-suite.org/meme/index.html
phylogenetic tree analysismrbayesMrBayes is a program for Bayesian inference and model choice across a wide range of phylogenetic and evolutionary models

3.2.7

3.2.7.mpi

https://nbisweden.github.io/MrBayes/
njplot NJplot is a tree drawing program2.4https://doua.prabi.fr/software/njplot
paup Tools for inferring and interpreting phylogenetic trees4b10https://paup.phylosolutions.com/
Phylip A package of programs for inferring phylogenies (evolutionary trees)3.697https://phylipweb.github.io/phylip/
PhyMLPhyML is a phylogeny software based on the maximum-likelihood principle3.3https://github.com/stephaneguindon/phyml
RAxML (raxmlHPC)RAxML - Randomized Axelerated Maximum Likelihood8.2.13https://github.com/stamatak/standard-RAxML/
tree-puzzle Program to reconstruct phylogenetic trees from molecular sequence data by maximum likelihood5.3.rc16http://www.tree-puzzle.de/
Single cell analysisCellRangerA set of analysis pipelines that perform sample demultiplexing, barcode processing, single cell 3' and 5' gene counting, V(D)J transcript sequence assembly and annotation, and Feature Barcode analysis from single cell data.8.0.1https://www.10xgenomics.com/jp/support/software/cell-ranger/latest
MARVELMARVEL is an R package developed for alternative splicing analysis at single-cell resolution.1.0.0https://github.com/wenweixiong/MARVEL
Repeat MaskingRepeatMaskerRepeatMasker is a program that screens DNA sequences for interspersed repeats and low complexity DNA sequences4.1.7p1https://www.repeatmasker.org/
RepeatModelerRepeatModeler is a de novo transposable element (TE) family identification and modeling package.2.0.4https://www.repeatmasker.org/RepeatModeler/
tRNA searchtRNAscan-SEtRNAscan-SE pioneers the large-scale use of covariance models to annotate tRNA genes in genomes2.0.5https://trna.ucsc.edu/tRNAscan-SE/

apptainer で利用するためのコンテナ.sif ファイルが用意されているもの

  • 作業ディレクトリにシンボリックリンクを作ってご利用ください
  • パス:
/apl/bio/container/
  Version.sif fileremarksOfficial URL
AGATAGAT has the power to check, fix, pad missing information (features/attributes) of any kind of GTF and GFF to create complete, sorted and standardised gff3 format.

1.4.1

AGAT/1.4.1/agat_1.4.1--pl5321hdfd78af_0.sif

 https://github.com/NBISweden/AGAT
BRAKERBRAKER is a program that predicts genes in eukaryotic genomic sequences3.0.2BRAKER/3.0.2/braker3.sif https://github.com/Gaius-Augustus/BRAKER
BUSCOAssessing genome assembly and annotation completeness with single-copy orthologs5.8.0BUSCO/busco580.sifRequired Options : --offline https://busco.ezlab.org/
DeepConsensusDeepConsensus uses gap-aware sequence transformers to correct errors in Pacific Biosciences (PacBio) Circular Consensus Sequencing (CCS) data.1.2.0DeepConsensus/deepconsensus.sif https://github.com/google/deepconsensus
DeepTMHMMDeepTMHMM is currently the most complete and best-performing method for the prediction of the topology of both alpha-helical and beta-barrel transmembrane proteins.1.0.42DeepTMHMM/deeptmhmm_edit_g.sif https://dtu.biolib.com/DeepTMHMM
EpiTypingEpiTyping is a tool for detecting imprinting and X-chromosome inactivation status from RNA-seq1EpiTyping/epityping.sif https://github.com/Gal-Keshet/EpiTyping
GALBAGALBA uses the protein sequences of several (few) or one closely related species to generate a training gene set for AUGUSTUS with either miniprot or GenomeThreader. 1.0.7GALBA/galba107_aug35.sifwith Augustus 3.5.0https://github.com/Gaius-Augustus/GALBA
GATKThe GATK is the industry standard for identifying SNPs and indels in germline DNA and RNAseq data.4.0.1GATK/gatk-4.sif https://github.com/broadinstitute/gatk
ipyradAn interactive assembly and analysis toolkit for restriction-site associated DNA (RAD-seq) and related data types. 0.9.81ipyrad/0.9.81/ipyrad_0.9.81--pyh5e36f6f_0  https://ipyrad.readthedocs.io/en/master/