Hi all
I am trying to run conifer using the demo data set which comes bundled with it. Following are the commands I ran:
python conifer.py analyze --probes conifer_v0.2.2/sampledata/probes.txt --rpkm_dir conifer_v0.2.2/sampledata/RPKM_data/ --output /conifer_v0.2.2/sampledata/test_Run_analysis.hdf5 --svd 6 --write_svals /conifer_v0.2.2/sampledata/test_Run_singular_values.txt
Error:
[INIT] Finished reading RPKM files. Total number of samples in experiment: 26 (0 failed to read properly)
[INIT] Attempting to process chromosomes: chr1, chr2, chr3, chr4, chr5, chr6, chr7, chr8, chr9, chr10, chr11, chr12, chr13, chr14, chr15, chr16, chr17, chr18, chr19, chr20, chr21, chr22, chrX, chrY
[RUNNING: chr1] Now on: chr1
[RUNNING: chr1] Found 19822 probes; probeID range is [0-19822]
[RUNNING: chr1] Calculating median RPKM
[RUNNING: chr1] Masking 412 probes with median RPKM < 1.000000
Traceback (most recent call last):
File "conifer_v0.2.2/conifer.py", line 682, in <module>
args.func(args)
File "/conifer_v0.2.2/conifer.py", line 157, in CF_analyze
probeIDs = np.array(map(operator.itemgetter("probeID"),chr_probes))[probe_mask]
IndexError: boolean index did not match indexed array along dimension 0; dimension is 19822 but corresponding boolean dimension is 19821
Closing remaining open files:/conifer_v0.2.2/sampledata/test_Run_analysis.hdf5...done
A google search with the error pointed to NumPy version compatibility problem, however I have the latest version, please see the output of pip freeze
below:
$pip freeze
adium-theme-ubuntu==0.3.4
cycler==0.10.0
decorator==4.0.11
functools32==3.2.3.post2
matplotlib==2.0.2
netifaces==0.10.4
networkx==1.11
numexpr==2.6.2
numpy==1.13.1
pygobject==3.20.0
pyparsing==2.2.0
pysam==0.8.3
python-dateutil==2.6.0
pytz==2017.2
scour==0.32
six==1.10.0
subprocess32==3.2.7
tables==3.2.3.1
unity-lens-photos==1.0
I have the exact same problem with the sample data, changing the 142 line in conifer.py did not helped. The error is thrown again. here is the output
Hello!
I tried your method and it worked for all the chromosomes except 'chr Y' in the sample data set but that is understandable.
Note: I had to change the conifer.py and conifer_functions.py making it compatible with PyTables 3.x. Also, I didn't do any changes with the versions of other tools to use CoNIFER. I hope this helps.