I'm using Tophat 2.1.0 tophat-fusion-post to search for gene fusion. The tutor told me to download human_genetic, other_genetic, and nt* data from https://ftp.ncbi.nlm.nih.gov/blast/db/ . However, I did not see the first two in it. Which files I need to download in current version of Tophat?
This is because NCBI changed versions of the blast database to v.5 a few years back. tophat (being an old program) was using the older v.4 blast databases. You can still find archival versions of v.4 databases from https://ftp.ncbi.nih.gov/blast/db/v4/ . Note: These are from the year 2020 and are not being updated. If you are OK with that download all of your databases from link above. You can't mix and match v.4 and 5 databases.
If you are looking for fusions you would be best served by looking for an alternate program instead of tophat..
I run the program again and I received following errors:
BLAST Database error: No alias or index file found for nucleotide database [blast/human_genomic] in search path [/data/lastry::]
BLAST Database error: No alias or index file found for nucleotide database [blast/nt] in search path [/data/lastry::]
I then received a list of fusions but the quantity exceeded expectations:
It seems that I can run this without blast direction existing, but alternative splicing cannot be found this way.