The first thing I do prior to any of this is, post-alignment, run some QC using the ENCODE tools, in particular the phantompeak tools strand cross-correlation (NSC, RSC, and to some degree QTag) analysis and PCR bottleneck metrics. If you have biological reps then IDR is a very good idea as it checks consistency between replicates. CHANCE is also a highly recommended tool for QC. Note: IDR only works for TF and is not really developed for broad peaks (histone modification), so keep that in mind.
Do these on all your samples, including inputs; you should see no cross-correlation in inputs and decent cross-correlation on your IP. The level of the latter being dependent on whether it is narrow vs. broad-peak; if you are doing TF then use the criteria outlines in the phantompeak tools page. We have seen some cases with inputs where you will see some cross-correlation, possibly indicative of biases in fragmentation (open chromatin fragments more easily than closed, for instance), and these will influence your peak calls. If you have access to a Covaris I highly recommend it as we have found the best overall results from samples prepped from those, but they're quite pricy.
I also recommend if you are performing ChIP-Seq runs that ENCODE has previously performed (same organism, same factor) you can run a comparison against theirs to get an idea of how your samples come out (there is an on-line speadsheet on the ENCODE site with this information).
Very useful: Use IGV and calculate coverage metrics (TDF). Run some preliminary peak calls on your best sample and visually check peaks. Use ENCODE data sets from IGV to compare to if possible!
After all that, if the samples are quite comparable based on IDR and peak metrics then combining them is probably fine for peak-calling, but if they are different then you are bound by your worst sample and thus would likely be introducing noise into the peak calling (e.g. false positives).