maxfuse.utils.process_count_data

maxfuse.utils.process_count_data(arr, target_sum=1e4, min_mean=0.0125, max_mean=3, min_disp=0.5, max_value=10)[source]

Process count data according to scanpy pipeline.

Parameters:
  • arr (np.array of shape (n_samples, n_features)) – Data matrix

  • target_sum (float, default=1e4) – Parameter in sc.pp.normalize_total

  • min_mean (float, default=0.0125) – Parameter in sc.pp.higly_variable_genes

  • max_mean (float, default=3) – Parameter in sc.pp.higly_variable_genes

  • min_disp (float, default=0.5) – Parameter in sc.pp.higly_variable_genes

  • max_value (float, default=10) – Parameter in sc.pp.scale

Returns:

An np.array representing the processed version of arr