Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

batch ingest should not duplicate objects #202

Open
ebenenglish opened this issue Nov 7, 2019 · 0 comments
Open

batch ingest should not duplicate objects #202

ebenenglish opened this issue Nov 7, 2019 · 0 comments

Comments

@ebenenglish
Copy link
Collaborator

This is a placeholder ticket, more detail TK

We want to prevent re-running a batch ingest (in case there is an error/failure first time around) from re-creating existing objects.

There is some existing functionality for preventing duplicate NewspaperTitle objects from being created (see #175), but nothing currently exists for Container, Issue, Page, Article, etc.

To make this work, we'll need to identify some metadata attributes (or combination of attributes) that can be used to create a unique value for the above objects so that the batch ingest script can test if they already exist.

This is essentially just a typical Rails find_or_create pattern.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant