-
Notifications
You must be signed in to change notification settings - Fork 13
Open
Description
Dear Jana,
I am currently constructing a pangenome using Pangenie, with the Chinese genome (CN) as the reference. My dataset includes three ragweed genomes:
- CN (China, reference genome)
- NA (North America, haploid)
- EU.1 & EU.2 (Europe, diploid, two haplotypes)
Since Pangenie requires haplotype-resolved genomes, I was able to successfully integrate CN and EU into the pangenome. However, I am facing a challenge with the NA genome, as it does not have haplotype information.
I would like to know:
- Is there a recommended way to incorporate a haploid genome (NA) into a Pangenie-based pangenome?
- Would it be valid to pseudo-diploidize the NA genome by duplicating its sequence, or is there a better approach?
- Alternatively, would you recommend aligning NA to the existing pangenome VCF and analyzing presence/absence variations (PAVs) separately?
I appreciate any suggestions you may have on how to best integrate the NA genome while maintaining the integrity of the pangenome model.
Thank you for your time and insights!
Best regards,
Yihang Huang
Metadata
Metadata
Assignees
Labels
No labels