Salmon
De acordo com o página do Salmon, “Salmon is a tool for wicked-fast transcript quantification from RNA-seq data. It requires a set of target transcripts (either from a reference or de-novo assembly) to quantify. All you need to run Salmon is a FASTA file containing your reference transcripts and a (set of) FASTA/FASTQ file(s) containing your reads. Optionally, Salmon can make use of pre-computed alignments (in the form of a SAM/BAM file) to the transcripts rather than the raw reads. The mapping-based mode of Salmon runs in two phases; indexing and quantification. The indexing step is independent of the reads, and only need to be run one for a particular set of reference transcripts. The quantification step, obviously, is specific to the set of RNA-seq reads and is thus run more frequently”.
Versões Disponíveis
salmon/0.14.0
Submissão de Jobs
Crie um arquivo chamado, por exemplo, submit_jobs.sh.
#!/bin/bash
#SBATCH -t 20:00:00
module load salmon/0.14.0
export INPUT="transcripts.fasta reads_1.fastq reads_2.fastq run.sh"
export OUTPUT="transcripts_index transcripts_quant"
job-nanny ./run.sh
em que o script run.sh tem o seguinte conteúdo:
#!/bin/bash
echo ""
echo "===========[[ Preparing transcriptome indices ]]========================="
echo ""
salmon index -t transcripts.fasta -i transcripts_index -k 31
echo ""
echo "===========[[ Quantifying in mapping-based mode - 1 ]]==================="
echo ""
salmon quant -i transcripts_index -l A -1 reads_1.fastq -2 reads_2.fastq --validateMappings -o transcripts_quant
echo ""
echo "===========[[ Quantifying in mapping-based mode - 2 ]]==================="
echo ""
salmon quant -i transcripts_index -l A -r reads_1.fastq --validateMappings -o transcripts_quant
Antes de submeter o job, deve-se tornar o script executável pelo comando
chmod +x run.sh
Para submeter o processo, basta usar o comando:
sbatch submit_jobs.sh
Referências
Para informações adicionais sobre o software, consulte a documentação do Salmon.