Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Estimated cell type counts: adjusted for abundance or not? #377

Open
BaharehAjami opened this issue Aug 12, 2024 · 4 comments
Open

Estimated cell type counts: adjusted for abundance or not? #377

BaharehAjami opened this issue Aug 12, 2024 · 4 comments
Labels
question Further information is requested

Comments

@BaharehAjami
Copy link

Hello Cell2location team,

When estimating celltype-specific counts per spot, are these counts normalized for the number of cells in each spot? I am interested in quantifying the expression of cells in each spot as if only one cell of each type (if available) were in said spot. This would make it easier to compare cell type expression between conditions if, say, the abundance of a given cell type were higher in one condition compared to the other, for example. If the counts are unnormalized for abundance, it's clear how this could be a confounding factor. Any advice would be appreciated, and thank you for the great tool.

-Ajami lab team

@BaharehAjami BaharehAjami added the question Further information is requested label Aug 12, 2024
@vitkl
Copy link
Contributor

vitkl commented Aug 21, 2024

Hi @BaharehAjami

You can look at the code. The best choice here is not decided. You don't want to normalise by cell abundance if you don't filter locations to exclude locations with near 0 abundance of a given cell type - normalisation by cell abundance only makes sense after filtering.

@BaharehAjami
Copy link
Author

BaharehAjami commented Aug 21, 2024

Hi Viktl,

Thank you for your respone, I had indeed thought about removing low-abundance cells, or filtering spots based on a gene marker for the cell type of interest. A question, when you say the best choice here is not decided, has this topic come up before/are the devs considering how best to address this? It seems like it would be an important consideration for certain downstream analyses, but perhaps it is too niche.

Also, when it does come to normalizing for abundance, which strategy do you think is best? Simply dividing the expression of each gene from the celltype-specific counts by the abundance values of the given celltype in each spot seems like the most straightforward approach, but perhaps there is a better method?

-Ajami lab team

@vitkl
Copy link
Contributor

vitkl commented Aug 21, 2024 via email

@BaharehAjami
Copy link
Author

BaharehAjami commented Aug 21, 2024

Hi Vitkl,

So, if one were intending to compare celltype specific gene expression across conditions, and one also wanted to normalize for abundance because abundance changes drastically across said conditions, which normalization would you suggest?

Currently, I am considering:

  1. remove all spots with near-zero abundance for a given cell type
  2. in each spot, divide celltype specific counts for each gene by abundances of the given cell type
  3. proceed to comparing across conditions

Does this seem sound to you? I am especially interested in what you would do in step 2. And thank you very much for the advice.

-Ajami lab team

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
question Further information is requested
Projects
None yet
Development

No branches or pull requests

2 participants