Download cds region coordinates
1
0
Entering edit mode
8.1 years ago

Hello everyone, I want to download only cds region coordinates from given input file in genbank format through some script or bioperl modules. Can anyone suggest me a module of bioperl or biopython which will do the above mentioned job. And if there are no such modules available, how should i approach this problem.

bioperl biopython exon coding regions • 2.1k views
ADD COMMENT
2
Entering edit mode
8.1 years ago

using a simple xslt stylesheet ?

<?xml version="1.0" encoding="UTF-8"?>
<xsl:stylesheet xmlns:xsl="http://www.w3.org/1999/XSL/Transform" version="1.0">
<xsl:output method="text"/>

<xsl:template match="/">
<xsl:apply-templates select="//GBFeature[GBFeature_key='CDS']/GBFeature_intervals/GBInterval"/>
</xsl:template>


<xsl:template match="GBInterval">
<xsl:value-of select="GBInterval_from"/>
<xsl:text> </xsl:text>
<xsl:value-of select="GBInterval_to"/>
<xsl:text> </xsl:text>
<xsl:apply-templates select="../../GBFeature_quals/GBQualifier"/>
<xsl:text>
</xsl:text>
</xsl:template>

<xsl:template match="GBQualifier">
<xsl:value-of select="GBQualifier_name"/>
<xsl:text>:</xsl:text>
<xsl:value-of select="GBQualifier_value"/>
<xsl:text> </xsl:text>
</xsl:template>

</xsl:stylesheet>

e.g:

curl -s "http://eutils.ncbi.nlm.nih.gov/entrez/eutils/efetch.fcgi?db=nucleotide&id=U00096&retmode=xml" |\
xsltproc  --novalid transform.xsl  - | cut -c 1-100 | tail

4633745 4633233 gene:yjjX locus_tag:b4394 gene_synonym:ECK4386; JW5801 codon_start:1 transl_table:11
4633797 4634444 gene:ytjC locus_tag:b4395 gene_synonym:ECK4387; gpmB; JW4358 function:enzyme; Not cl
4635310 4634441 gene:rob locus_tag:b4396 gene_synonym:cbpB; ECK4388; JW4359; robA function:factor; D
4635521 4635994 gene:creA locus_tag:b4397 gene_synonym:ECK4389; JW4360; yjjD codon_start:1 transl_ta
4636007 4636696 gene:creB locus_tag:b4398 gene_synonym:ECK4390; JW4361; ORF2; yjjE function:regulato
4636696 4638120 gene:creC locus_tag:b4399 gene_synonym:ECK4391; JW4362; phoM function:enzyme; Global
4638178 4639530 gene:creD locus_tag:b4400 gene_synonym:cet; ECK4392; JW4363; refII function:putative
4640306 4639590 gene:arcA locus_tag:b4401 gene_synonym:cpxC; dye; ECK4393; fexA; JW4364; msp; seg; s
4640402 4640542 gene:yjjY locus_tag:b4402 gene_synonym:ECK4394; JW4365 codon_start:1 transl_table:11
4640942 4641628 gene:yjtD locus_tag:b4403 gene_synonym:ECK4395; JW4366; lasT GO_process:GO:0009451 -
ADD COMMENT
0
Entering edit mode

Thank you for answering my query. I got the desired result.

ADD REPLY

Login before adding your answer.

Traffic: 2609 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6