Pacbio: Extract h5 file and Filter sequences and compress it to h5 file.
1
0
Entering edit mode
9.2 years ago
clear.choi ▴ 30

I am trying to extract h5 file and filter sequence by myself (Another algorithm) and compress it to h5 file.

Anyone has best pratice to do extract and compress h5 file.

Thank you!

PacBio h5 Pacbio SmartAnalysis hdf5 • 3.3k views
ADD COMMENT
0
Entering edit mode

Could you precise what you're trying to do ? To read and write hdf5 files, you need access to the hdf5 library. There are wrappers for it in various programming languages (R, python, perl...) or maybe you're looking for this or something similar (see the pacbio page here) ?

ADD REPLY
0
Entering edit mode

Thank you for your information! I want to extract sequences and trimming those unexpected sequence and compress again. So main purpose try to trim unexpected sequence

ADD REPLY
0
Entering edit mode

Then I think you should be able to do this with the pbh5tools (link in my comment above).

ADD REPLY
0
Entering edit mode

Thank you for your reply, but it looks like able to exract but it's not able to compress data. Dose it?

ADD REPLY
0
Entering edit mode

Indeed, it seems pbh5tools is mostly aimed at reading PacBio's hdf5 files. However, in your case, it could be that cmph5tools.py from pbhtools may be what you're looking for, in particular the select option: select: Create a new file from a cmp.h5 file by specifying which reads to include.

Otherwise, you'll need to know how the files are structured then use you favorite language's hdf5 API to write them. But do you really need the data in hdf5? Your downstream application may also accept other formats as input.

ADD REPLY
0
Entering edit mode
9.2 years ago
mjhsieh • 0

pbh5tools is what you need.

ADD COMMENT

Login before adding your answer.

Traffic: 2910 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6