I have two genome assemblies created using different platforms for same species. I am doing some post analysis to check the performance of assemblers. For this, I have the following:
- Contig files (multi-fasta) for reference (910 MB) and query (510 MB)
- Blast hits (tabular format) for reference vs query.
Here, I would like to make a circular track for Reference genome and see how the hits of query assembly distribute. Please suggest the possible methods to do it. I looked for UCSC genome browser, where it displays alignment for pre-defined set of organism. My organism is not in their list.
Kindly help.
Thanks in Advance.
Circos might be an option or D-genies
D-genies crashes while I upload the data in .gz format.
that's a pitty.
I'm not on the development team myself ;-) but I know some of them, I'll pass on the message. But don't let that hold you back to get in touch with them (== send bug report) yourself.
Thanks @lieven.sterck. I did send them an email 3 weeks back. But did not get any response !. So moved on to explore other options.
There is a bigger problem here. Most of these programs crash due to large file size. i.e File 1 has 44k contigs (510MB) and File 2 has 5900 contigs (910 MB). To make it easier, I tried to merged respective files into single long sequence. But still, the programs crashes ! Any solutions to handle these big data ?
Are you sure the problem is not the hardware you have access to?
Sure ! I am doing it in my linux workstation with 150 GB RAM and 40 Cores.