We've significantly enhanced our data extraction capabilities with several key updates:
- Extract now returns a lot more data due to a new re-ranker system
- Improved infrastructure reliability
- Migrated from Cheerio to a high-performance Rust-based parser for faster and more memory-efficient parsing
- Enhanced crawl cancellation functionality for better control over running jobs
What's Changed
- Added "today" to extract prompts by @rafaelsideguide in #1084
- docs: update cancel crawl response by @ftonato in #1087
- port most of cheerio stuff to rust by @mogery in #1089
- Re-ranker changes by @nickscamara in #1090
- Rerank with lower threshold + back to map if length = 0 by @rafaelsideguide in #1086
Full Changelog: v1.4.0...1.4.1