-
Notifications
You must be signed in to change notification settings - Fork 2
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
B.1.118 gets called as B.1 #49
Comments
There was a comment on the release of 1.23: |
We’re not - we don’t have any A lineages among our control samples, and I‘d understood the instructions in the notes as a workaround until Usher 0.6.3 was available and thus assumed it wasn’t relevant now that version was out. I’ll try and see how it looks with assignment cache mode as soon as I can. |
Perhaps @AngieHinrichs can clarify if this is the problem still? |
The issue seems to persist with
in a fresh conda env with the specs given above results in the following output:
|
Thanks for reporting this. I will fix it in the next release. Due to a recent shuffling around of the order in which mutations are annotated on successive branches, B.1.118 is annotated on a small branch within the larger branch where it should be annotated, with two extra mutations, one of which is absent from most samples. In previous versions, although B.1.118 was annotated on a branch that had the two extra mutations, the extra mutations were placed on a larger branch, and then one was reverted to reference on a sub-branch that covered most of the samples. The effect was that in the previous version, B.1.118 samples without the extra mutation(s) would be placed on the branch where B.1.118 was annotated (with a reversion on the mutation that shouldn't have been in the path in the first place), but now, with an arguably better structure / order of mutations, B.1.118 is annotated on a sub-branch and I need to fix the annotation. |
With version 1.23.1, one of our positive controls which has been consistently called as B.1.118 suddenly gets called as B.1.
We're running pangolin 4.3 in usher placement mode, relevant versions are
Given it's a positive control I should be able to share the sequence if needed, but it looks like this might be a general issue with B.1.118 sequences - UCSC UShER gives the same results for a bunch of B.1.118 genomes from GISAID, while COG-UK (still on 1.22) gives B.1.118 - kudos to Ammar Aziz over on the µbioinfo slack for digging into it.
The text was updated successfully, but these errors were encountered: