Skip to content

Population Structure #34

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
CarmenRamoser opened this issue May 21, 2025 · 3 comments
Open

Population Structure #34

CarmenRamoser opened this issue May 21, 2025 · 3 comments

Comments

@CarmenRamoser
Copy link

Hi,
I’ve tried Neural Admixture on the UKBiobank, but running it in the unsupervised mode got a lot of structure in the White Europeans instead of the continental ancestries like in your paper. Do you know what could be the issue? Here is what I tried:
neural-admixture train --k 6 --data_path 500k_sampleQC_LDpruned_allchr.bed --save_dir sample500k/outputs --name sample500k --initialization kmeans --num_cpus 24

Image

neural-admixture train --k 6 --epochs 50 --batch_size 400 --learning_rate 1e-5 --data_path 500k_sampleQC_LDpruned_allchr.bed --save_dir sample500k/outputs --name sample500k_Test2 --initialization kmeans --num_cpus 24

Image

The PCAs look fine.
Thanks for your help!

@jcgrenier
Copy link

According to some experiments I've ran myself, the training relies on the PCA, which is by itself prone to biases if an unbalanced dataset is given as an input. I would guess that a solution for this would be to either downsample the european individuals or train on a reference dataset, and infer the UKBiobank afterwards. Would it make sense?

@CarmenRamoser
Copy link
Author

Yes, I will try now to train on a reference, it makes sense that the UKBiobank is just too unbalanced. Was just curious as it seems to have worked in the paper in an unsupervised way but couldn't figure out which parameters were used.

@joansaurina
Copy link
Collaborator

Hey @CarmenRamoser .

A new version was released this week.

Could you retry now?

Thanks!

Joan

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants