You can find the MAF format description from UCSC here. This is a fairly standard file format and is also used by Ensembl.
There's a section there that says 'lines starting with s' which is your sequences within your alignment block. If you want to define your genomic coordinates you will need to do a conversion as detailed here using the source and start coordinates which is the 2nd and 3rd column respectively:
src -- The name of one of the source sequences for the alignment. For sequences that are resident in a browser assembly, the form 'database.chromosome' allows automatic creation of links to other assemblies. Non-browser sequences are typically reference by the species name alone.
start -- The start of the aligning region in the source sequence. This is a zero-based number. If the strand field is "-" then this is the start relative to the reverse-complemented source sequence (see Coordinate Transforms).
So I think this this example: s droPer1 0 8480 + 8480, it is chromosome 1, same start coordinate as the source (dm3.chr2L, 12727116) and the length of the alignment is 8,480bp.
The Ensembl MAF readme has a bit more information on calculating the coordinates, but it seems we store a bit more information in the 2nd column with regard to the coordinates.
No worries. Yes I think that it is correct. The MAF file stores information on alignment blocks, regions of the genomes that are similar. Whichever pipeline was used to create the multiple alignment will have searched across the genome for similar regions. There's no guarantee that the most similar region/block is on the same chromosome in all species.
There's a section in the Ensembl comparative genomics paper that explains the methodology behind multiple alignments if you're interested.
I'm not sure... it may refer to a super contig (long stretch of sequence not yet assembled into chromosomes). Or a part of an alternate assembly (i.e. not part of the main chromosome level assembly, a haplotypic region). I don't know what 'DroPer1' is so I can't really comment further.
In case anyone else will also encounter this problem:
mafFrag doesn't provide regions of genomes in the alignment.
to get them, you will have to use mafsInRegion tool:
$ mafsInRegion
mafsInRegion - Extract MAFS in a genomic region
usage:
mafsInRegion regions.bed out.maf|outDir in.maf(s)
options:
-outDir - output separate files named by bed name field to outDir
-keepInitialGaps - keep alignment columns at the beginning and of a block that are gapped in all species
Also an additional question:
Is there a way to visualize maf file? i've tried IGV browser, but it seems to fail importing the alignment.
There's another post on this topic: MAF multiple alignment file viewer