-
Notifications
You must be signed in to change notification settings - Fork 4
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Unexpected behavior on converting between gfa and xg #4
Comments
Your input is incorrect. There are only 9 It's annoying that we have to keep these two lists in sync. Maybe we can fix that in rGFA. |
I don't think it is incorrect because it obeys the GFA1 spec. According to the spec, the 4th column means overlaps (between nodes on a path). As long as the path is linear, the number of overlaps between nodes is |
I thought this was the cigar between the path step and the node. If so,
this means none of the GFA we have been making is correct.
…On Thu, Aug 22, 2019, 07:01 Toshiyuki Yokoyama ***@***.***> wrote:
I don't think it is incorrect because it obeys the GFA1 spec. According to
the spec, the 4th column means overlaps (between nodes on a path). As long
as the path is linear, the number of overlaps between nodes is len(nodes)
-1. So, it is natural that there are 9 elements. The example at the end
of https://github.com/GFA-spec/GFA-spec/blob/master/GFA1.md is similar.
I hope such kinds of ambiguity can be resolved in rGFA.
—
You are receiving this because you modified the open/close state.
Reply to this email directly, view it on GitHub
<#4?email_source=notifications&email_token=AABDQEO34PLTHMTTDITTR33QFYMTVA5CNFSM4H4L77R2YY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOD434KCQ#issuecomment-523748618>,
or mute the thread
<https://github.com/notifications/unsubscribe-auth/AABDQEOJENTCUKGEGXCJ4K3QFYMTVANCNFSM4H4L77RQ>
.
|
Thanks for pointing this out. I guess the overlaps are being stored in the path because they aren't determined based on the graph topology of an assembly graph. This does mean that all our GFA P lines are broken. But the fact that we weren't using these fields for any purpose also indicates how useless they were for our applications. In graphs with paths, these overlap/cigar descriptions are hugely expensive. I would love to get rid of them or make them optional. Perhaps |
That said, the current setup of the gfakluge parser used by xg should work for the correct format and correctly parses your example. |
Thank you for considering my comment. The reason why we faced this problem is that https://github.com/graph-genome/vgbrowser uses |
I tried
bin/xg
built at commit hash6871a1e011954483e01ace8a517a78ba1a57b7d9
.An input file
test.gfa
is following.I run the following commands on a shell.
After that, I found that the node
15+
on the pathz
was truncated.The text was updated successfully, but these errors were encountered: