Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

pcangsd --admix VS. fastmixture for assignment analyses? #96

Open
KChristie11 opened this issue Feb 13, 2025 · 2 comments
Open

pcangsd --admix VS. fastmixture for assignment analyses? #96

KChristie11 opened this issue Feb 13, 2025 · 2 comments

Comments

@KChristie11
Copy link

Hi Jonas,

I'm wondering about the similarities/differences and pros/cons of using pcangsd --admix vs. fastmixture for assignment/admixture/STRUCTURE analyses on datasets with called genotypes.

Does fastmixture outperform pcangsd, for example in the case we converted to a PLINK output and how the programs perform admixture analysis on the same input dataset?

Thanks for your thoughts!

@Rosemeis
Copy link
Owner

Hi!

If you have called genotypes, then I would definitely use fastmixture! The --admix option in pcangsd will be less accurate, and potentially not even faster even though it is based on a least-square approach. We showed in the fastmixture paper that least-square approaches are far less accurate than likelihood-based approaches.

You could try both options and see if they do not converge to the same solution. :-)

Best,
Jonas

@KChristie11
Copy link
Author

Hi Jonas,

Thanks for the clarifications! I'll try both was to see how pcangsd compares to fastmixture.

Best,
Kyle

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants