From 50492e284ef5dd8dbae7ac5281d19664235d97d9 Mon Sep 17 00:00:00 2001 From: Carolyn Caron Date: Wed, 13 Dec 2023 15:26:22 -0600 Subject: [PATCH] Re-organization of germplasm data under curation --- _data/toc.yml | 8 ++-- .../{ => genotypic-data}/genotypic-data.md | 0 .../germplasm-accessions-importer.md | 48 +++++++++++++++++++ .../germplasm-collections-importer | 7 +++ .../germplasm-data/germplasm-crosses-importer | 7 +++ .../{ => germplasm-data}/germplasm-data.md | 0 6 files changed, 66 insertions(+), 4 deletions(-) rename _docs/curation/{ => genotypic-data}/genotypic-data.md (100%) create mode 100644 _docs/curation/germplasm-data/germplasm-accessions-importer.md create mode 100644 _docs/curation/germplasm-data/germplasm-collections-importer create mode 100644 _docs/curation/germplasm-data/germplasm-crosses-importer rename _docs/curation/{ => germplasm-data}/germplasm-data.md (100%) diff --git a/_data/toc.yml b/_data/toc.yml index 3983034..bbca2cc 100644 --- a/_data/toc.yml +++ b/_data/toc.yml @@ -3,13 +3,13 @@ links: - title: "Germplasm Data" url: "docs/curation/germplasm-data" - children: + links: - title: "Germplasm Accessions" - url: "docs/curation/germplasm-data#germplasm-accessions" + url: "docs/curation/germplasm-data/germplasm-accessions-importer" - title: "Germplasm Crosses" - url: "docs/curation/germplasm-data#germplasm-crosses" + url: "docs/curation/germplasm-data/germplasm-crosses-importer" - title: "Germplasm Collections" - url: "docs/curation/germplasm-data#germplasm-collections" + url: "docs/curation/germplasm-data/germplasm-collections-importer" - title: "Phenotypic Data" url: "docs/curation/phenotypic-data" - title: "Genotypic Data" diff --git a/_docs/curation/genotypic-data.md b/_docs/curation/genotypic-data/genotypic-data.md similarity index 100% rename from _docs/curation/genotypic-data.md rename to _docs/curation/genotypic-data/genotypic-data.md diff --git a/_docs/curation/germplasm-data/germplasm-accessions-importer.md b/_docs/curation/germplasm-data/germplasm-accessions-importer.md new file mode 100644 index 0000000..dd9f55d --- /dev/null +++ b/_docs/curation/germplasm-data/germplasm-accessions-importer.md @@ -0,0 +1,48 @@ +--- +title: Germplasm Accessions Importer +tags: + - germplasm + - importers +description: This is documentation pertaining to import of germplasm accessions data into the Chado database. +--- + +The Germplasm Accessions Importer can be accessed by going to "Administration > Tripal > Data Loaders > Tripal Cultivate: Germplasm Accessions" + +### Select Genus +First, select the genus of the accessions in the file you wish to upload by selecting it from the dropdown menu. All accessions in one file must belong to the same genus and match this selection. Regardless, it is not recommended to maintain germplasm data across multiple genus within the same file, as assigned names can be re-used across genus and cause confusion during curation. + +### File Format +The file should be formatted as a TSV or TXT file with up to 12 columns in the following order: +1. **Germplasm Name**: The name of the germplasm. +2. **External Database**: The name of the institution who assigned the accession. This name must already be present in the chado table `db` and match exactly. +3. **Accession Number**: A unique identifier for the germplasm accession. +4. **Germplasm Species**: The species of the accession. +5. **Germplasm Subtaxa**: A subtaxon can be used to store any additional taxonomic identifier. The rank below species is specified, followed by the name. For example, if the full scientific name is `Daucus carota subsp. sativus`, then the value for this column will be `subsp. sativus`. Full and abbreviated versions of rank are supported. The following abbreviations are allowed: subsp. (for subspecies); convar. (for convariety); var. (for variety or varietas); subvar. (subvariety or subvarietas) f. (for form or forma); subf. (for subform or subforma); Group (for cultivar group). +6. **Institute Code**: The FAO WIEWS code of the institute where the accession is maintained. An FAO WIEWS Institute Code consists of the 3-letter ISO 3166-1 alpha 3 country code of the country where the institute is located plus a number (e.g. COL001, USA1004). +7. **Institute Name**: The name of the Institute that has bred the material. +8. **Country of Origin Code**: A 3-letter ISO 3166-1 code of the country in which the sample was originally collected, bred or selected. Codes can be [found here](https://en.wikipedia.org/wiki/ISO_3166-1_alpha-3). +9. **Biological Status of Accession**: The 3 digit code representing the biological status of the accession. The coding scheme for biological status can be used at two different levels of detail: either as a general code (e.g. 100, 200) or a more specific code (e.g. 110, 120). + - 100 Wild + - 110 Natural + - 120 Semi-natural/wild + - 130 Semi-natural/sown + - 200 Weedy + - 300 Traditional cultivar/landrace + - 400 Breeding/research material + - 410 Breeder’s line + - 411 Synthetic population + - 412 Hybrid + - 413 Founder stock/base population + - 414 Inbred line (parent of hybrid cultivar) + - 415 Segregating population + - 416 Clonal selection + - 420 Genetic stock + - 421 Mutant (e.g. induced/insertion mutant, tilling population) + - 422 Cytogenetic stock (e.g. chromosome addition/substitution, aneuploid, amphiploid) + - 423 Other genetic stock (e.g. mapping population) + - 500 Advanced or improved cultivar (conventional breeding methods) + - 600 GMO (by genetic engineering) + - 999 Other +10. **Breeding Method**: The unique identifier for the breeding method used to create this germplasm. +11. **Pedigree**: The parental selection history of this germplasm. It is recommended to use the format: `maternal-parental\paternal-parent` +12. **Synonyms**: A list of alternative names or IDs used to reference this germplasm. Multiple synonyms can be separated using commas or semicolons (eg. `synonym1,synonym2`). \ No newline at end of file diff --git a/_docs/curation/germplasm-data/germplasm-collections-importer b/_docs/curation/germplasm-data/germplasm-collections-importer new file mode 100644 index 0000000..ce407bf --- /dev/null +++ b/_docs/curation/germplasm-data/germplasm-collections-importer @@ -0,0 +1,7 @@ +--- +title: Germplasm Collections Importer +tags: + - germplasm + - importers +description: This is documentation pertaining to import of germplasm collections data into the Chado database. +--- \ No newline at end of file diff --git a/_docs/curation/germplasm-data/germplasm-crosses-importer b/_docs/curation/germplasm-data/germplasm-crosses-importer new file mode 100644 index 0000000..594edcd --- /dev/null +++ b/_docs/curation/germplasm-data/germplasm-crosses-importer @@ -0,0 +1,7 @@ +--- +title: Germplasm Crosses Importer +tags: + - germplasm + - importers +description: This is documentation pertaining to import of germplasm cross data into the Chado database. +--- \ No newline at end of file diff --git a/_docs/curation/germplasm-data.md b/_docs/curation/germplasm-data/germplasm-data.md similarity index 100% rename from _docs/curation/germplasm-data.md rename to _docs/curation/germplasm-data/germplasm-data.md