Hi! I would like some advice ... please I have to do the job in Bioconductor, with the edgeR package.
I have to do a differential expression analysis job ... compare the expression of two samples of the Aedes aegypti mosquito (my teacher provided us with two fastq files) Until now, I have already used Burrows-wheeler aligner to generate an Index with a reference genome (from the same Aedes, I got it from www.vectorbase.org) and align both fastq files by the "sampe" method, I got a SAM file. It looks like this :
According to the edgeR manual, I need to have a count table that contains the genes and the count, but I don't know how to generate such a count table.
I tried the Rsubread package, but it looks very different from the one used in the example:
I asked the teacher, but he said that the problem is that I need a tool that works or works to create count tables for miRNA or ncRNA .... but I can't find a suitable one. Could you help me?
I also saw out there that it can be done in Excel ... could you give me some manual?
Try using STAR for your alignment! For the count tables, Rsubreads is perfect for the job, just take a look into the manual, it's pretty easy! I'll leave my code here, maybe it works for you...
Just keep in mind that this part is suitable for my project (Sickle cell disease), you need to read the manual and see what you need to change for your reality.
EdgeR is also pretty easy to use, the manual contains everything that you need for your job! Just read it carefully.
Hope it helps!
That is a good catch.
If you have a genome reference, you should use a program like TopHat2, STAR, HISAT2, etc.
If you have a transcriptome alignment, then you could use Bowtie2 or BWA (or quantify reads without aligning, with a program like Salmon or Kallisto).
That said, my understanding that this is for a homework assignment. It is not appropriate for us to complete the homework assignment for you, since you need the experience of doing this on your own to set appropriate expectations in the future.
One caveat could be if the teacher was wrong, or another could be that they were not providing enough support for the class. The former could require assistance from the community (but is hopefully rare). The later is probably an issue that would need to be brought up with the graduate school (either in terms of the teacher or you picking a program that is the best fit for yourself).
If the teacher can't answer the question, is there a teaching assistant?
I agree with you... Hope OP realizes that my code will NOT work for what he is intended but, will force him to at least read the documentation and do a more deep search into the subject!