Hi folks!
I have two GFF3 files, one with annotated genes. Another with predicted transposons.
I want to find the numerical distribution of the distances between transposons and genes.
I already found the distance amongst just transposons, and amongst just genes - using the 'spacing' sub-command of bedtools 2.2.4.0, however 'spacing' cannot compare two GFF files.
For comparing spacing of features in 2 separate GFFs, bedtools 2.24.0 has the reldist sub-command, but this does not yield absolute distance in base pairs, but only relative distance distribution, so its not so useful to me...
So I am not sure if there is an off the shelf option in bedtools that can help me answer this question. Would any of you have a simple solution to my problem? It does not have to be using bedtools... Thank you!
Thank you dariober.
At the http://bedtools.readthedocs.org/en/latest/content/tools/closest.html, there are several options, and I wonder if I should be using:
and
Does that seem right to you?
I ask because I wonder if I should instead use other options or additional options, but for my question, I suspect
-s -d
and-S -d
should be appropriate. Do you agree? Thanks again!I think you have to figure this out yourself depending on the question you are asking. All these options are indeed quite confusing (not because the tool is badly written but because the biology is complex!).