I am trying to find the list of STRING IDs (e.g. 9606.ENSP00000293677) and their corresponding entry IDs (e.g. CASPE_HUMAN) as a downloadable file. After asking the help desk, they said I need the information found in this Uniprot ID mapping file: https://ftp.uniprot.org/pub/databases/uniprot/current_release/knowledgebase/idmapping/by_organism/HUMAN_9606_idmapping.dat.gz After downloading a .dat file, I was trying to convert it into a pandas df and I expect to see more than 15 columns for various identifiers about a given entry on Uniprot. When I tried running these lines of code, it produces a df with three columns and the last method I tried just gave me an array of numbers.
df3 = pd.read_csv('HUMAN_9606_idmapping.dat', sep ='\t', nrows=10)
df3 = pd.read_csv(StringIO('HUMAN_9606_idmapping.dat'),
sep="\t",
index_col=0, #convert first column to datetimeindex
header=None) #none header
df3 = np.genfromtxt('HUMAN_9606_idmapping.dat', unpack = True, delimiter='\t')
If anyone can help me, I would be very appreciative!