Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Duplicates in node list of CHIRO, version 2015-11-23 #208

Open
LucaCappelletti94 opened this issue Jul 20, 2023 · 1 comment
Open

Duplicates in node list of CHIRO, version 2015-11-23 #208

LucaCappelletti94 opened this issue Jul 20, 2023 · 1 comment
Labels
bug Something isn't working

Comments

@LucaCappelletti94
Copy link

Describe the bug

The provided graph name and version contains the following duplicated nodes: ["GO:0003824", "NCBITaxon:2"]

To Reproduce

Investigate node list.

Expected behavior

Node list should not contain duplicates

@LucaCappelletti94 LucaCappelletti94 added the bug Something isn't working label Jul 20, 2023
@LucaCappelletti94 LucaCappelletti94 changed the title CHIRO, version 2015-11-23 Duplicates in node list od CHIRO, version 2015-11-23 Jul 20, 2023
@LucaCappelletti94 LucaCappelletti94 changed the title Duplicates in node list od CHIRO, version 2015-11-23 Duplicates in node list of CHIRO, version 2015-11-23 Jul 20, 2023
@caufieldjh
Copy link
Collaborator

Looks like one of nodes is defined twice in the CHIRO OWL:

$ grep NCBITaxon_2 chiro.owl
    <!-- http://purl.obolibrary.org/NCBITaxon_2 -->
    <owl:Class rdf:about="http://purl.obolibrary.org/NCBITaxon_2"/>
                        <owl:someValuesFrom rdf:resource="http://purl.obolibrary.org/obo/NCBITaxon_2"/>
                <owl:someValuesFrom rdf:resource="http://purl.obolibrary.org/obo/NCBITaxon_2"/>
                        <owl:someValuesFrom rdf:resource="http://purl.obolibrary.org/NCBITaxon_2"/>
                <owl:someValuesFrom rdf:resource="http://purl.obolibrary.org/NCBITaxon_2"/>
    <!-- http://purl.obolibrary.org/obo/NCBITaxon_2 -->
    <owl:Class rdf:about="http://purl.obolibrary.org/obo/NCBITaxon_2"/>

Not sure why GO:0003824 appears twice.

Anyway, looks like we need an additional normalization step to handle these cases. I'm surprised it doesn't happen more often.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

2 participants