Is there a standard file format for anchored contigs?
2
0
Entering edit mode
5.9 years ago
brogroh • 0

Hi,

Given a genome with many unordered contigs, and some external information that can be used to anchor these to chromosomes/linkage groups, is there a standard file format for specifying the linkage relationships between contigs? Downstream analyses will rely on this order, for example, window-based calculations of popgen summary statistics. For example, I can map the set of linkage markers to the reference using a short-read aligner, and determine that a certain set of contigs belong to linkage group X, and are in a particular order. Should this simply be represented in a fasta file with linkage information encoded in the header?

Thanks!

genome assembly linkage map anchor contigs contigs • 1.3k views
ADD COMMENT
0
Entering edit mode

One commonly used method is to link contigs into scaffolds by an arbitrary number of Ns - for example, 10 Ns: ACGTNNNNNNNNNNACGT.

ADD REPLY
2
Entering edit mode
5.9 years ago
Malcolm.Cook ★ 1.5k

ALLMAPS: robust scaffold ordering based on multiple maps (github: ALLMAPS) project documentation includes ALLMAPS: How to use different types of genomic maps which does a good job of outlining the various formats that are be found in the wild and provides tools for inter-conversion and many other useful functions.

ADD COMMENT
1
Entering edit mode
ADD COMMENT

Login before adding your answer.

Traffic: 1916 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6