I have been trying to run OMA standalone using a few genomes downloaded from the OMA browser and then adding a couple of my custom genomes. All the steps of OMA standalone including All-vs-All computations, pairwise orthologs etc have successfully finished. However, I have consistently faced the same error with the final step of generating and storing HOGs for these genomes.
Here is the error message that I get despite trying to run OMA on different machines:
********************************************************************************
Storing results
Traceback (most recent call last):
File "/home/orthologs/OMA.2.4.1/bin/..//.venv/bin/warthogs.py", line 128, in <module>
run_gethogs()
File "/home/orthologs/OMA.2.4.1/bin/..//.venv/bin/warthogs.py", line 95, in run_gethogs
Settings.check_consistency_argument()
File "/home/orthologs/OMA.2.4.1/.venv/lib64/python3.6/site-packages/gethogs/settings.py", line 53, in check_consistency_argument
cls.inputfile_handler = file_manager.inputfile_handler_factory()
File "/home/orthologs/OMA.2.4.1/.venv/lib64/python3.6/site-packages/gethogs/file_manager.py", line 97, in inputfile_handler_factory
return OmaStandaloneFiles(os.path.normpath(join(settings.Settings.pairwise_folder, '..')))
File "/home/orthologs/OMA.2.4.1/.venv/lib64/python3.6/site-packages/gethogs/file_manager.py", line 44, in __init__
for row in csv_reader:
_csv.Error: field larger than field limit (131072)
********************************************************************************
An error occured in bottom-up HOG computations:
--- WARTHOGs:
- Start at 13:36 on 2020-12-08
- Orthology relations folder:Output/PairwiseOrthologs (standalone format)
- Method use to merge HOGS: pair
- Output file: Output/HierarchicalGroups.orthoxml
Error, (in GetHOGsBottomUp) failed to compute bottom-up hogs
Can somebody please help me with this error?
Also is there a way to only run the HOG computation stage of the program without having to generate all the output files again?
Hi Adrian,
This finally worked! Although if I may point out, I also had to copy the file
hog_bottom_up/gethogs/file_manager.py
to the folder.venv/lib64/python3.6/site-packages/gethogs/
so that the program uses this modified file.A couple of days ago, I had tried to add the
csv.field_size_limit
argument infile_manager.py
that was in the lib64 folder, however this file was overwritten in a subsequent run of OMA. I'm guessing in a new run of OMA, the files inhogs_bottom_up/gethogs/
are installed/copied into lib64, however this did not happen today for some reason and I got the same "too many fields" error. Only after copying the newfile_manager.py
to lib64 was I able to finish the HOG computation.Thanks a lot for all your help! Good luck with the further development of the program. I'm a fan and would be using it for a long time ;)
Hi Adrian,
Thank you very much for your help. As Gaurav mentioned, it becomes necessary to copy the file
hog_bottom_up/gethogs/file_manager.py
to the folder.venv/lib64/python3.6/site-packages/gethogs/
, otherwise it doesn't use the modified file. I had to do this same thing as well, so I guess that's an additional confirmation.As I said above thanks a lot to you and everyone for their help; it was a good a coincidence that this happened to two people at the same time. All the best!