-
Notifications
You must be signed in to change notification settings - Fork 148
Issues: IBM/data-prep-kit
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
[Bug] The build of fasttext==0.9.2 requires GCC v11
bug
Something isn't working
#932
opened Jan 9, 2025 by
burn2l
1 of 2 tasks
[Feature] Grow core library and transforms to enable easy launching of a transform in a runtime from a .py file
enhancement
New feature or request
#931
opened Jan 9, 2025 by
daw3rd
2 tasks done
[Bug] Cleanup Makefiles, Dockerfiles and other assets used for CI/CD
bug
Something isn't working
#930
opened Jan 9, 2025 by
touma-I
1 of 2 tasks
[Bug] Dependency conflict with requests>=2.2.3
bug
Something isn't working
#925
opened Jan 8, 2025 by
touma-I
1 of 2 tasks
[Feature] New transform to annotate with any classifier model with multi-classifier support
enhancement
New feature or request
#924
opened Jan 8, 2025 by
Harmedox
1 of 2 tasks
[Feature] New transform to annotate with readability scores
enhancement
New feature or request
#923
opened Jan 8, 2025 by
Harmedox
1 of 2 tasks
[Feature] New transform to collect token stats
enhancement
New feature or request
#922
opened Jan 8, 2025 by
Harmedox
1 of 2 tasks
[Feature] New transform to remove repeating text sequences from documents
enhancement
New feature or request
#921
opened Jan 8, 2025 by
Harmedox
1 of 2 tasks
[Bug] web2parquet is not a conforming transform implementation
bug
Something isn't working
#920
opened Jan 7, 2025 by
daw3rd
1 of 2 tasks
[Bug] path issues when running superworkflow pipeline sample for kfp v2
bug
Something isn't working
#909
opened Jan 2, 2025 by
juancappi
2 tasks done
[Feature] Html2ParquetTransform support output_format_value json
enhancement
New feature or request
#908
opened Jan 2, 2025 by
1337stn
2 tasks done
[Bug] Multiple broken paths after folder relocation in kfp v2 documentation
bug
Something isn't working
#906
opened Jan 2, 2025 by
juancappi
2 tasks done
[Feature] Create a LamaIndex2parquet ingector
enhancement
New feature or request
#905
opened Jan 2, 2025 by
roytman
1 of 2 tasks
[Bug] Rootdir not found in util package for Document Quality Module
bug
Something isn't working
#903
opened Dec 30, 2024 by
azka2001
2 tasks done
[Bug] cannot import name 'FdedupRayTransformConfiguration' from 'fdedup_transform_ray'
bug
Something isn't working
#898
opened Dec 20, 2024 by
MFahadShahid
1 of 2 tasks
Added Focus on Spark-based scaling of transforms (medium priority)
enhancement
New feature or request
#884
opened Dec 16, 2024 by
shahrokhDaijavad
1 of 2 tasks
Memory Consumption and Batch Processing in DPK (Medium Priority)
enhancement
New feature or request
#883
opened Dec 16, 2024 by
shahrokhDaijavad
1 of 2 tasks
[Feature] Web2Parquet should expose all options supported by dpk_connector
enhancement
New feature or request
#876
opened Dec 13, 2024 by
sujee
2 tasks done
[Bug] pip install data-prep-toolkit-transforms[all]==0.2.2 gets error
bug
Something isn't working
#873
opened Dec 12, 2024 by
daw3rd
1 of 2 tasks
[Feature] a transform to perform file level de-dupe (exact)
enhancement
New feature or request
#870
opened Dec 12, 2024 by
sujee
2 tasks done
[Bug] ededup removes all samples if the document_id is an int
bug
Something isn't working
#868
opened Dec 10, 2024 by
burn2l
2 tasks done
Making sure that we run the Jupyter lab inside venv, when running notebooks locally
enhancement
New feature or request
#856
opened Dec 4, 2024 by
shahrokhDaijavad
2 tasks done
Link to the three example notebooks of fdedup in the README file
enhancement
New feature or request
#848
opened Dec 2, 2024 by
shahrokhDaijavad
1 of 2 tasks
Previous Next
ProTip!
Add no:assignee to see everything that’s not assigned.