I did a ChIP-seq analysis with a genome GRCh38.92 from ensembl and also used the corresponding gene anotation file (.gtf) from ensembl. Genome, annotation and promoter sets directly provided by homer (for hg38 version) are retrieved from UCSC, so I should not work with those.
With peak annotation and motif finding using genomic regions I used the .fa and gtf files directly and worked fine:
annotatePeaks.pl $Peak $GENOME -gtf $GTF > $OUT
findMotifsGenome.pl $PEAK $GENOME $OUT -p 4 -size 400
To run findMotifs.pl first had to load the custom genome:
loadGenome.pl -name GRCh38.92 -org human -fasta $GENOME -gtf $GTF
and then generate a promoter set:
loadPromoters.pl -name GRCh38.92.promoter -org human -id ensembl -genome GRCh38.92 -tss /Applications/homer/data/genomes/GRCh38.92/GRCh38.92.tss
When running findMotifs.pl:
findMotifs.pl $GENE GRCh38.92.promoter $OUT -p 4
I got this error:
!!! Something is wrong... are you sure you chose the right length for motif finding?
!!! i.e. also check your sequence file!!!
Use of uninitialized value in numeric gt (>) at /Applications/homer//bin/compareMotifs.pl line 1389.
!!! Filtered out all motifs!!!
Job finished
I changed the chromossome names of .gtf and .fa files from 'Number' to 'chrNumber' and still got the same error.
Could you please tell me what am I doing wrong?
The input files are here: ftp://ftp.ensembl.org/pub/release-92/fasta/homo_sapiens/dna/Homo_sapiens.GRCh38.dna.primary_assembly.fa.gz ftp://ftp.ensembl.org/pub/release-92/gtf/homo_sapiens/Homo_sapiens.GRCh38.92.gtf.gz
Content of $GENE follows:
ENSG00000005486
ENSG00000010818
ENSG00000033122
ENSG00000115902
ENSG00000081277
ENSG00000075213
ENSG00000141429
ENSG00000149256
ENSG00000157077
ENSG00000163681
ENSG00000105662
ENSG00000107731
ENSG00000014257
ENSG00000151388
ENSG00000197386
ENSG00000197442
ENSG00000138119
ENSG00000255282
ENSG00000186094
ENSG00000148219
ENSG00000124802
ENSG00000070476
ENSG00000138311
ENSG00000159261
ENSG00000214900
ENSG00000177483
ENSG00000237128
ENSG00000112378
ENSG00000115648
ENSG00000234323
ENSG00000128512
ENSG00000196950
ENSG00000143294
ENSG00000163449
ENSG00000006652
ENSG00000115415
ENSG00000082781
ENSG00000105993
ENSG00000116285
ENSG00000188257
ENSG00000182185
ENSG00000121879
ENSG00000240925
ENSG00000078269
ENSG00000065809
ENSG00000057935
ENSG00000175110
ENSG00000136147
ENSG00000169567
ENSG00000251432
ENSG00000138759
ENSG00000138639
ENSG00000249738
ENSG00000116698
ENSG00000110801
ENSG00000066084
ENSG00000257322
ENSG00000258386
ENSG00000258902
ENSG00000187446
ENSG00000197943
ENSG00000259986
ENSG00000182149
ENSG00000008283
ENSG00000153944
ENSG00000132141
ENSG00000267551
ENSG00000168675
ENSG00000146963
ENSG00000280551
ENSG00000229140
ENSG00000231426
It was missing -id custom and -gid. It's running now! Thank you!