Hi,
I downloaded a dataset from CellXGene in hdf5 format and have been trying to use it for further analyses. Are there any standard steps one take before re-using data like this from a database? I.e. Should I just remove the original expression matrix from the object (Adata.X) and make a new hdf5 file, or can I leave all the metadata and use the original hdf5. My concern is that some of the extra stuff in the anndata may be interfering with my analysis. Is the data in adata.X the original data or has it already been manipulated?