Skip to content

Issues: IBM/data-prep-kit

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Sort

Issues list

[Bug] The build of fasttext==0.9.2 requires GCC v11 bug Something isn't working
#932 opened Jan 9, 2025 by burn2l
1 of 2 tasks
[Bug] Cleanup Makefiles, Dockerfiles and other assets used for CI/CD bug Something isn't working
#930 opened Jan 9, 2025 by touma-I
1 of 2 tasks
[Bug] Dependency conflict with requests>=2.2.3 bug Something isn't working
#925 opened Jan 8, 2025 by touma-I
1 of 2 tasks
[Feature] New transform to annotate with readability scores enhancement New feature or request
#923 opened Jan 8, 2025 by Harmedox
1 of 2 tasks
[Feature] New transform to collect token stats enhancement New feature or request
#922 opened Jan 8, 2025 by Harmedox
1 of 2 tasks
[Feature] New transform to remove repeating text sequences from documents enhancement New feature or request
#921 opened Jan 8, 2025 by Harmedox
1 of 2 tasks
[Bug] web2parquet is not a conforming transform implementation bug Something isn't working
#920 opened Jan 7, 2025 by daw3rd
1 of 2 tasks
[Bug] path issues when running superworkflow pipeline sample for kfp v2 bug Something isn't working
#909 opened Jan 2, 2025 by juancappi
2 tasks done
[Feature] Html2ParquetTransform support output_format_value json enhancement New feature or request
#908 opened Jan 2, 2025 by 1337stn
2 tasks done
[Bug] Multiple broken paths after folder relocation in kfp v2 documentation bug Something isn't working
#906 opened Jan 2, 2025 by juancappi
2 tasks done
[Feature] Create a LamaIndex2parquet ingector enhancement New feature or request
#905 opened Jan 2, 2025 by roytman
1 of 2 tasks
[Bug] Rootdir not found in util package for Document Quality Module bug Something isn't working
#903 opened Dec 30, 2024 by azka2001
2 tasks done
[Bug] bug Something isn't working
#902 opened Dec 25, 2024 by usmanxia
1 of 2 tasks
Added Focus on Spark-based scaling of transforms (medium priority) enhancement New feature or request
#884 opened Dec 16, 2024 by shahrokhDaijavad
1 of 2 tasks
Memory Consumption and Batch Processing in DPK (Medium Priority) enhancement New feature or request
#883 opened Dec 16, 2024 by shahrokhDaijavad
1 of 2 tasks
[Feature] Web2Parquet should expose all options supported by dpk_connector enhancement New feature or request
#876 opened Dec 13, 2024 by sujee
2 tasks done
[Bug] pip install data-prep-toolkit-transforms[all]==0.2.2 gets error bug Something isn't working
#873 opened Dec 12, 2024 by daw3rd
1 of 2 tasks
Simplify DPK Readme
#872 opened Dec 12, 2024 by agoyal26
[Feature] a transform to perform file level de-dupe (exact) enhancement New feature or request
#870 opened Dec 12, 2024 by sujee
2 tasks done
[Bug] ededup removes all samples if the document_id is an int bug Something isn't working
#868 opened Dec 10, 2024 by burn2l
2 tasks done
ProTip! Add no:assignee to see everything that’s not assigned.