Quantify batch effect

Quantify batch effect#

Understanding the existing batch effect in your data is crucial for achieving a good integration.

Using principle

example_batch_analysis:
  input:
    preprocessing:
      custom_file_name: custom/file/path.zarr # file path
    batch_analysis: preprocessing
  batch_analysis:
    sample: sample # smallest entity of a batch, e.g. <bio_sample>-<pool> if there is no 1-1 matching of sample to pool
    n_permutations: 100 # you have small data, so should be quick
    covariates:
      - # obs columns of potential batch covariates
  preprocessing:
    highly_variable_genes:
      n_top_genes: 2000
      # no batch_key bc we want to maximise batch effect
    assemble: # only pca is needed here
      - pca
    colors: # obs columns that you want in your UMAP
      - cell_type