Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Evidence string generation for 2024.12 release #446

Closed
apriltuesday opened this issue Oct 16, 2024 · 1 comment
Closed

Evidence string generation for 2024.12 release #446

apriltuesday opened this issue Oct 16, 2024 · 1 comment

Comments

@apriltuesday
Copy link
Contributor

Deadline for submission: 23 October

Refer to documentation for full description of steps.

Blocked by #434 and #445

@apriltuesday
Copy link
Contributor Author

There are 460 invalid evidence strings which on manual inspection all seem to use the new clinical significance values (e.g. "likely oncogenic" or "tier ii - potential"). Along with 46 records invalid for having multiple clinical classifications in the new XML structure, this is a marked increase in somatic or oncogenic records we are dropping by not supporting the new XML structure for clinical classification.

I've extracted these from the output in this batch (invalid_evidence.json and multiple_classification.txt in logs directory) and will create an issue to look into it.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant