Skip to content

Commit

Permalink
Re-organization of germplasm data under curation
Browse files Browse the repository at this point in the history
  • Loading branch information
carolyncaron committed Dec 13, 2023
1 parent 8384f1f commit 50492e2
Show file tree
Hide file tree
Showing 6 changed files with 66 additions and 4 deletions.
8 changes: 4 additions & 4 deletions _data/toc.yml
Original file line number Diff line number Diff line change
Expand Up @@ -3,13 +3,13 @@
links:
- title: "Germplasm Data"
url: "docs/curation/germplasm-data"
children:
links:
- title: "Germplasm Accessions"
url: "docs/curation/germplasm-data#germplasm-accessions"
url: "docs/curation/germplasm-data/germplasm-accessions-importer"
- title: "Germplasm Crosses"
url: "docs/curation/germplasm-data#germplasm-crosses"
url: "docs/curation/germplasm-data/germplasm-crosses-importer"
- title: "Germplasm Collections"
url: "docs/curation/germplasm-data#germplasm-collections"
url: "docs/curation/germplasm-data/germplasm-collections-importer"
- title: "Phenotypic Data"
url: "docs/curation/phenotypic-data"
- title: "Genotypic Data"
Expand Down
File renamed without changes.
48 changes: 48 additions & 0 deletions _docs/curation/germplasm-data/germplasm-accessions-importer.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,48 @@
---
title: Germplasm Accessions Importer
tags:
- germplasm
- importers
description: This is documentation pertaining to import of germplasm accessions data into the Chado database.
---

The Germplasm Accessions Importer can be accessed by going to "Administration > Tripal > Data Loaders > Tripal Cultivate: Germplasm Accessions"

### Select Genus
First, select the genus of the accessions in the file you wish to upload by selecting it from the dropdown menu. All accessions in one file must belong to the same genus and match this selection. Regardless, it is not recommended to maintain germplasm data across multiple genus within the same file, as assigned names can be re-used across genus and cause confusion during curation.

### File Format
The file should be formatted as a TSV or TXT file with up to 12 columns in the following order:
1. **Germplasm Name**: The name of the germplasm.
2. **External Database**: The name of the institution who assigned the accession. This name must already be present in the chado table `db` and match exactly.
3. **Accession Number**: A unique identifier for the germplasm accession.
4. **Germplasm Species**: The species of the accession.
5. **Germplasm Subtaxa**: A subtaxon can be used to store any additional taxonomic identifier. The rank below species is specified, followed by the name. For example, if the full scientific name is `Daucus carota subsp. sativus`, then the value for this column will be `subsp. sativus`. Full and abbreviated versions of rank are supported. The following abbreviations are allowed: subsp. (for subspecies); convar. (for convariety); var. (for variety or varietas); subvar. (subvariety or subvarietas) f. (for form or forma); subf. (for subform or subforma); Group (for cultivar group).
6. **Institute Code**: The FAO WIEWS code of the institute where the accession is maintained. An FAO WIEWS Institute Code consists of the 3-letter ISO 3166-1 alpha 3 country code of the country where the institute is located plus a number (e.g. COL001, USA1004).
7. **Institute Name**: The name of the Institute that has bred the material.
8. **Country of Origin Code**: A 3-letter ISO 3166-1 code of the country in which the sample was originally collected, bred or selected. Codes can be [found here](https://en.wikipedia.org/wiki/ISO_3166-1_alpha-3).
9. **Biological Status of Accession**: The 3 digit code representing the biological status of the accession. The coding scheme for biological status can be used at two different levels of detail: either as a general code (e.g. 100, 200) or a more specific code (e.g. 110, 120).
- 100 Wild
- 110 Natural
- 120 Semi-natural/wild
- 130 Semi-natural/sown
- 200 Weedy
- 300 Traditional cultivar/landrace
- 400 Breeding/research material
- 410 Breeder’s line
- 411 Synthetic population
- 412 Hybrid
- 413 Founder stock/base population
- 414 Inbred line (parent of hybrid cultivar)
- 415 Segregating population
- 416 Clonal selection
- 420 Genetic stock
- 421 Mutant (e.g. induced/insertion mutant, tilling population)
- 422 Cytogenetic stock (e.g. chromosome addition/substitution, aneuploid, amphiploid)
- 423 Other genetic stock (e.g. mapping population)
- 500 Advanced or improved cultivar (conventional breeding methods)
- 600 GMO (by genetic engineering)
- 999 Other
10. **Breeding Method**: The unique identifier for the breeding method used to create this germplasm.
11. **Pedigree**: The parental selection history of this germplasm. It is recommended to use the format: `maternal-parental\paternal-parent`
12. **Synonyms**: A list of alternative names or IDs used to reference this germplasm. Multiple synonyms can be separated using commas or semicolons (eg. `synonym1,synonym2`).
7 changes: 7 additions & 0 deletions _docs/curation/germplasm-data/germplasm-collections-importer
Original file line number Diff line number Diff line change
@@ -0,0 +1,7 @@
---
title: Germplasm Collections Importer
tags:
- germplasm
- importers
description: This is documentation pertaining to import of germplasm collections data into the Chado database.
---
7 changes: 7 additions & 0 deletions _docs/curation/germplasm-data/germplasm-crosses-importer
Original file line number Diff line number Diff line change
@@ -0,0 +1,7 @@
---
title: Germplasm Crosses Importer
tags:
- germplasm
- importers
description: This is documentation pertaining to import of germplasm cross data into the Chado database.
---
File renamed without changes.

0 comments on commit 50492e2

Please sign in to comment.