What Is A .Ptt File And How It Is Created?
3
0
Entering edit mode
12.9 years ago
Arun • 0

what is a .ptt file and how it is created?

blast • 25k views
ADD COMMENT
4
Entering edit mode
ADD REPLY
2
Entering edit mode

Yes, I wrote that module. There are still some tools that want a PTT file. They aren't much use for unfinished genomes, because they can only refer to ONE sequence at a time, so if you have 200 contigs, you need 200 PTT files....

ADD REPLY
0
Entering edit mode

I was just having this problem, and ended up using the perl script I got from here. gb2ptt Just note that you should run the script as "perl gb2ptt.pl --infile filename.gbk" instead of what they said.

ADD REPLY
6
Entering edit mode
12.9 years ago

A .ptt file is an NCBI Protein Table file, which is a tab delimited file containing a list of all the proteins for their genomes (ftp://ftp.ncbi.nih.gov/genomes/). It corresponds with the CDS annotations from the GenBank file and can be created by parsing the GenBank files and writing the appropriate output.

The columns are:

Location    Strand    Length    PID    Gene    Synonym    Code    COG    Product

For example ftp://ftp.ncbi.nih.gov/genomes/Fungi/Aspergillus_niger_CBS_513_88_uid19263/NC_007445.ptt. These correspond to the feature start and end coordinates, the strand (+/-), the length of the amino acid (minus stop codon), and items analogous to various other GenBank format fields.

You can use BioPerl, as aaron suggests, but it appears as though the BioPython functionality has been dropped? It shouldn't be hard to write your own parser to be honest, although it doesn't store any sequence information!

ADD COMMENT
1
Entering edit mode
9.9 years ago
a.a.kechin ▴ 20

I've found useful and simple tool for this purpose https://lfz.corefacility.ca/gbk2ptt/

ADD COMMENT
0
Entering edit mode

Using https://lfz.corefacility.ca/gbk2ptt/ I converted a gbk file to .ptt .......which gave a result of

Legionella pneumophila subsp. pneumophila str. Philadelphia 1 chromosome, complete genome. - 0..3397754
2021 proteins

While later in NCBI I got

Legionella pneumophila subsp. pneumophila str. Philadelphia 1 chromosome, complete genome - 1..3397754
2943 proteins

Can you explain

ADD REPLY
0
Entering edit mode

The tool seems to parse the first record, but discards the rest.

ADD REPLY
0
Entering edit mode

This link is not working.

ADD REPLY
1
Entering edit mode
5.9 years ago
Mel161803 ▴ 20

https://github.com/sgivan/gb2ptt/blob/master/README.md

Just note that you should run the script as "perl gb2ptt.pl --infile filename.gbk" instead of what they said.

ADD COMMENT
0
Entering edit mode

Or you can use this one: https://github.com/ajvilleg/gbk2ptt

ADD REPLY

Login before adding your answer.

Traffic: 1838 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6