-
Notifications
You must be signed in to change notification settings - Fork 2
Quick start
To run VirGenA with test data download and unzip release files.
on Windows:
You can set number of threads in config_test_win.xml by changing value of ThreadNumber element.
Using Windows command promt change dir to unzipped folder and type:
java -cp ./VirGenA.jar RefBasedAssembler config_test_win.xml
on Linux:
You can set number of threads in config_test_linux.xml by changing value of ThreadNumber element.
Change permissions of ./tools/usearch8.1.1861_i86linux32 to make it executable. After that using shell change dir to unzipped folder and type:
java -cp ./VirGenA.jar RefBasedAssembler config_test_linux.xml
Tets data is an artificial mixture containing 100000 HIV reads of three different subtypes (01_AE, B and C) in equal proportions. VirGenA should detect these components and assemble genome-length consensus sequences for all components.
Results will be stored in ./res/ folder. Expected output is:
- Files (fasta) with assemblies of three mixture components named after the selected references: 01_AE.TH.90.CM240.U54771_assembly.fasta, B.FR.83.HXB2_LAI_IIIB_BRU.K03455_assembly.fasta, C.BW.96.96BW0502.AF110967_assembly.fasta
- Sorted bam files with read alignments and corresponding index files (bai): 'reference_name'_mapped_reads.bam and 'reference_name'_mapped_reads.bai
- Log file.
- Download https://github.com/nsoranzo/sspace_basic
- Download https://github.com/gFedonin/VirGenA/blob/master/bin/GenerateTabFile.class and place it in the release directory.
- Run from the release directory: java -cp ./VirGenA.jar:. GenerateTabFile -in ./res/ -out ./res/sspace/ -c ./config_test_linux.xml
- Run SSPACE: perl pathToSSPACE/SSPACE_Basic_v2.0.pl -l ./res/sspace/res.lib -s ./res/sspace/res_contigs.fasta -x 0
Expected output:
All SSPACE output will be in the current directory, including standard_output.final.scaffolds.fasta, which contains resulting scaffolds. In this particular case SSPACE does not join contigs to scaffolds, printing all 4 initial contigs with new names.