Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Using kedro for a data dictionary #1234

Closed
eepgwde opened this issue Feb 10, 2022 · 2 comments
Closed

Using kedro for a data dictionary #1234

eepgwde opened this issue Feb 10, 2022 · 2 comments
Labels
Community Issue/PR opened by the open-source community Issue: Feature Request New feature or improvement to existing feature

Comments

@eepgwde
Copy link

eepgwde commented Feb 10, 2022

Description

Is your feature request related to a problem? A clear and concise description of what the problem is: "I'm always frustrated when ..."

Most Data Warehouses have a painstakingly generated and curated data dictionary. Every field of every table is politely described, its usages are located and it is possible to edit descriptions and add caveats.

Context

Why is this change important to you? How would you use it? How can it benefit other users?

Kedro has a very good catalog for tables and pipelines. But nothing for columns or schema or metadata.

Possible Implementation

(Optional) Suggest an idea for implementing the addition or change.

Would it be possible to add a "metadata" pipeline that can go through all the tables that currently exist and catalog their columns.

Possible Alternatives

(Optional) Describe any alternative solutions or features you've considered.

Altova and other companies do have software solutions for databases and spreadsheets. These work by using some XML schema extraction and building a nested DOM.

@eepgwde eepgwde added the Issue: Feature Request New feature or improvement to existing feature label Feb 10, 2022
@datajoely
Copy link
Contributor

Hi @eepgwde I would absolutely love this - it's come up before and I would love to introduce this eventually or perhaps see an open source kedro-data-docs plugin emerge like we've seen for MLFlow, Dolt and Neptune :)

Would you mind adding your thoughts to #1076 since it's actually talking about this very problem 🚀

datajoely pushed a commit that referenced this issue Feb 11, 2022
[AUTO-MERGE] Merge master into develop via merge-master-to-develop
@merelcht merelcht added the Community Issue/PR opened by the open-source community label Mar 7, 2022
@merelcht
Copy link
Member

merelcht commented Apr 4, 2022

Closing this ticket now in favour of continuing the discussion in #1076

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Community Issue/PR opened by the open-source community Issue: Feature Request New feature or improvement to existing feature
Projects
None yet
Development

No branches or pull requests

3 participants