Skip to content

Conversation

@ASLeonard
Copy link
Contributor

@ASLeonard ASLeonard commented Nov 12, 2024

Even for "mid-sized" graphs (n~25 haplotypes and 100 Mb), the odgi::gfa_to_handle step could take up something like ~10% of the entire smoothxg walltime. Presumably this would scale poorly with more haplotypes as path information starts to dominate. Serialising the graph is already implemented and most people probably don't keep intermediate files, so this likely has no external change. Worst case a user can run odgi view -g -i <graph.og> to get the gfa version of the intermediate graphs.

Debatably the output could also be .og, since the next step in pggb is to sort the graph with odgi, but that is out of scope for here. This was easy enough to just toggle based on the extension of the output file.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant