-
Notifications
You must be signed in to change notification settings - Fork 21
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
too many alignments with v0.12.6 #227
Comments
Replace |
And also |
LOL. |
So PanSN-spec mandatory? Why |
I can't see any difference. What am I missing?
|
Might be good to remove -X. It could be conflicting. |
this didn't do the trick |
The input sequences are all just Could you please send an example command line where I can just get the diagonal? |
Hi @subwaystation, we're updating wfmash w/ the improvements from MashMap3. One of these improvements requires that filtering is applied to each pair of assemblies independently. Consider the extreme/contrived scenario where In a similar fashion, MashMap3 also filters secondary mappings based on the best hit. Again, its important that it can do this independently across each target assembly. For example, if our query was a copy of |
Also, the organization of the plotting script doesn't seem to be clustering the mappings very well. Here's what happens if I plot the alignments w/ pafplot. There aren't any contig names unfortunately, but at least you can see that things are lining up a bit more like expected.
|
Thanks @bkille |
That's not too surprising that the results are the same. The |
aah got it! assuming there would be more mappings in the PAF, do you know how this would affect the pangenome graph construction with seqwish? |
or would this be resolved with wfmash during the alignment step? |
@ekg Would the lower triangle mode be sufficient as input for pggb? |
Following question, @bkille I use the sequence without "#" and follow panSN, why the running time is so different? Name:
Name: wfmash -a -s 10000 -p 80 -n 1 -k 19 -H 0.001 -Y # -t 36 --hg-filter-ani-diff 30 Col-CC.panSN.fa.gz Rabacal-1.panSN.fa.gz
2987.04s user 13.63s system 772% cpu 388.25s total 3944900Kb max memory
--
wfmash -a -s 10000 -p 90 -n 1 -k 19 -H 0.001 -Y # -t 36 --hg-filter-ani-diff 30 Col-CC.panSN.fa.gz Rabacal-1.panSN.fa.gz
1897.45s user 13.90s system 1092% cpu 175.01s total 4115444Kb max memory
--
wfmash -a -s 10000 -p 95 -n 1 -k 19 -H 0.001 -Y # -t 36 --hg-filter-ani-diff 30 Col-CC.panSN.fa.gz Rabacal-1.panSN.fa.gz
625.62s user 12.68s system 901% cpu 70.81s total 3093592Kb max memory |
@baozg As far as I figured just right now, you need to add |
But such things are not documented... :P @bkille @AndreaGuarracino |
The changes are at least verbose in the actual pggb.sh script in the pggb repo. Looking through the blame, Caveat emptor when building from latest source rather than pinned releases I guess |
Hi developers :)
running
It seems I am getting too many alignments?
I would have expected to only get the diagonal output in an all-vs-all setting, right? Or am I missing something?
Best,
Simon
The text was updated successfully, but these errors were encountered: