I have a list of circRNAs identified by circRNA identification tool find_circ. How can I classify these circRNAs as exonic, intronic or intergenic? Is there any tools or script? some lines of output from find_circ are given below:
# chrom start end name n_reads strand n_uniq best_qual_A best_qual_B
chr4 166006737 166024248 Sy5y_D0_circ_000001 2 - 1 5 40
chr7 101950003 101952188 Sy5y_D0_circ_000002 1 + 1 5 5
chr5 619104 620376 Sy5y_D0_circ_000003 2 + 2 5 40
Thanks in advance.
Thank you for your response. Running the above command I got the following type of results.
But I want the results as follows:
Any suggestions will be appreciated.
I see, please try this, instead:
If you want to tidy this output, then pipe it into the
cut
BASH command.This will only return all UTR, CDS, and exons, though, because that is what is included in the GENCODE GTF files. However, it contains all currently known non-coding RNA species. If you want introns and intergenic regions, then I suggest different options:
An issue that you face with these regions is that they overlap both introns and exons concurrently, i.e., they are very large circicular RNAs.
Kevin