title | layout | author | date |
---|---|---|---|
How To Migrate Data Into Arctos |
default_toc |
Teresa J. Mayfield-Meyer |
2024-07-01 |
This How To walks through the steps of data migration. These steps are more completely outlined in the numbered issue templates in the Arctos Data Migration Github repository.
Before any data is entered, it is important that the collection metadata is complete.
- Github Issue Template: 3. Manage Collection
At least one person at any institution will have an operator account created as part of creation of the collection(s). Anyone else who will assist with data migration or data entry will need to follow the same steps to become an Arctos operator.
- Documentation: Agents
- Handbook instructions: How To Create and Manage Your Arctos Team (Users and Operators)
- Github Issue Template: 4. Create an Arctos Team
People and organizations are associated with almost all types of data available in Arctos so it is important to map the names in use in your current data to names available in Arctos. Ensure that all People (collectors, preparators, makers, identifiers, event assignors, donors) in your data file are included as Agents in Arctos and that there are no duplications (same person with name spelled different ways). If you are entering records one at a time, you will probably return to this step every time you enter something.
- Documentation: Agents
- Handbook instructions: How To Create Agents
- Github Issue Template: 5. Agents
Agents are a shared resource in Arctos and care should always be given to editing existing agents and efforts made to not create duplicate agents. At least one person at every institution should have permission to manage agents. This person should read the documentation about Agents above and respond in the appropriate Github issue that they understand that Agents are a shared resource in Arctos and they understand how to manage them. Once this is complete, they will be given permission to manage agents.
ALL Arctos records require an associated accession. If your institution does not currently use accessions, at least one "legacy accession" will need to be created to facilitate entry of catalog records into Arctos.
- Documentation: Accessions
- Handbook instructions: How To Create an Accession
- Github Issue Template: 6. Accessions
At least one person at every institution should have manage transactions access, this person should read the documentation above and and respond in the appropriate Github issue that they understand that Transactions are an important and useful tool for demonstrating collection building and use. Once this is complete, they will be given permission to manage transactions.
Identifications must match at least one Arctos Taxon Name exactly for every object record. Please note that taxon names are managed separately from classifications associated with those names. At this stage, the only concern is the name used in the identification of a catalog record.
- Documentation: Identifications
- Handbook instructions: How to Create Taxa and How to Edit Taxa.
- Github Issue Template: 7. Identifications (Taxon Names)
At least one person at every institution should have manage taxonomy access, this person should read the documentation above and and respond in the appropriate Github issue that they understand that Taxonomy is a community resource and managing it means a commitment to the community that they will do their best to reduce the possibility of misspelled taxon names and to only edit classifications with information that they are certain make the classification more accurate. Once this is complete, they will be given permission to manage taxonomy.
Higher Geography values (Country, State, County) in your data file must exist in Arctos before data can be bulkloaded.
- Documentation: Higher Geography
- Github Issue Template: 8. Higher Geography
Every object record may be assigned a locality (e.g. place of collection). Localities include coordinates used for mapping.
- Documentation: Localities
- Handbook instructions: How To Create a Locality
- Github Issue Template: 9. Locality
Localities may be treated in several different ways.
- If you have many specimens from a single specific locality, you may want to create that locality, name it, and use the locality name in your bulkload file.
- You can also bulkload your data with the localities you have entered
Localities that are exactly alike should load as a single locality shared by many specimens, but just one little difference (a capital letter or period, for example) will create two localities when you think there is only one. These can be merged later, if you find them.
At least one person at every institution should have manage locality access, this person should read the documentation above and and respond in the appropriate Github issue that they understand that localities are a community resource and managing them means a commitment to the community that they will do their best to reduce the possibility of misspelled place names and to only edit localities with information that they are certain make the location more accurate. Once this is complete, they will be given permission to manage locality.
At this point, records can be loaded to Arctos and the remaining data can be added to them using component loader tools. This can simplify the process of data migration and allow for more detailed review of components such as attributes, parts, collectors, preparators, and identifiers.
- Github Issue Template: 11. Bulkload Legacy Data
People who participate in the Collection Object: Agent Roles related to a catalog record must be Agents. If a person in a catalog record agent role is not well-known enough to create an agent, their role can be recorded using the verbatim agent atttribute with the role listed in the attribute method.
- Documentation: Attributes
- Handbook instructions: How to Create and Edit Catalog Record Attributes
If not loaded with the rest of the catalog record, agent roles may be loaded with the Agent Role: Bulkload tool.
Part names (object descriptions) must match to the available Part: Names in Arctos. Part names must be selected for use by each individual collection. For instructions on how to select the parts you want to use - see How To Video. If you need to add a part name that is not included in the code table, you will need to request it and provide a definition for the part.
- Documentation: Parts
- Handbook instructions: How To Create and Edit Parts
- Github Issue Template: 10. Parts
Parts may be associated with many part attributes and the attribute type must match a term in Part: Attribute Types in Arctos. It is preferable to use part attributes rather than placing this information in part remark so that the data is more discoverable.
If not loaded with the rest of the catalog record, parts may be loaded with the Part+Attribute Bulkload tool.
Attributes must match to the available Collection Object Attribute: Types in Arctos. Attributes must be selected for use by each individual collection. For instructions on how to select the attributes you want to use - see How To Video. If you need to add an attribute, you will need to request it and possibly provide a controlled vocabulary for the attribute value (for example, sex is the attribute, with possible values of male, female, etc.).
- Documentation: Attributes
- Handbook instructions: How to Create and Edit Catalog Record Attributes
If not loaded with the rest of the catalog record, attributes may be loaded with the Attribute: Bulkload tool.
Every catalog record in Arctos has a catalog number, but often objects have been shared, transferred, or given identifiers by individuals other than the institution that holds them now. Every catalog record in Arctos can have multiple identifiers other than the Arctos catalog number. In addition to identifiers assigned to the cataloged item(s), identifiers may be used to document relationships to other catalog records or data.
- Documentation: Other Identifying Numbers (Other IDs) and Relationships
- Handbook instructions: How To Record Prior Inclusion in Another Collection
If not loaded with the rest of the catalog record, identifiers may be loaded with the Identifier: Bulkload tool.
If you see something that needs to be edited in this document, you can create an issue using the link under the search widget at the top left side of this page, or you can edit directly here.