SV_quiz_students_1.1

SV classification (aka SV quiz)

You can find all the files necessary to answer the questions in the folder “SV_quiz”. Within this folder you will find a subfolder corresponding to each of the questions. The tools we suggest to use are all freely available (IGV, MAFFT or LAST online) but Censor/Repbase which has a limited free use. Nonetheless, the limits of Censor should allow all of you to answer these questions. If you have any problem with Censor please ask me (Vale) or Alex 🙂

Visualise BAM files

IGV: Amazon instance or download on your computer (igvteam/igv)
To access the IGV viewer through the Amazon instance, go to Guacamole: “http://ec2-XX-XXX-XXX-XXX.compute-1.amazonaws.com:8080/guacamole/”, again replacing “XX-XXX-XXX-XXX” with your Amazon instance IP address. Go to the desktop environment and open the terminal inside of the ubuntu gui of guacamole and run this code:”./software/IGV_2.13.0/igv.sh”

If you want to do this exercise on your local computer you can download the quiz material like this:
scp [email protected]:workshop_materials/a13_structural_variation/SV_quiz/Data/SV_quiz_data.zip . replacing the “XX-XXX-XXX-XXX” with your Amazon instance IP address.

Make dotplots

MAFFT: https://mafft.cbrc.jp/alignment/server/index.html

LAST: http://lastweb.cbrc.jp

Repeat database
Censor (sequence homology to Repbase repeat database): https://www.girinst.org/censor/index.php

Q1: What is it?

Hint: Is there any signature of incongruent read mapping along the sequence of interest?

  • Open the sequence and respective BAM files in IGV.

Q2: What is it?

Hint: Are there more reads piling up than expected along the sequence of interest?

  • Open the sequence and respective BAM files in IGV.

Q3: What is it?

Hint: Do samples differ?

  • Open the sequence and respective BAM files in IGV.

Q4: What is it?

Hint: Puff?

  • Open the sequence and respective BAM files in IGV.

Q5: What is it?

Hint: Jump?

  • Open the sequence and respective BAM files in IGV.
  • Check the sequence with MAFFT/LAST and Censor.

Q6: What is it?

Hint: What happens in sample2 with respect to the reference and sample1?

  • Open the sequence and respective BAM files in IGV.
  • Check the sequence with MAFFT/LASTZ and Censor.

Q7: What is it?

Hint: Another jump?

  • Open the sequence and respective BAM files in IGV.
  • Check the sequence with MAFFT/LASTZ and Censor.

Answers

What is it? Q1 Q2 Q3 Q4 Q5 Q6 Q7
Deletion
Inversion
LTR insertion
Duplication
DNA transposon
soloLTR
Copy number variation
SNP
Artefact

Hint: There is only one answer per sequence.