A Streamlit-based web application for detecting plagiarism in text documents and comparing multiple files for similarity using Ollama
- Direct text input for plagiarism checking
- Sentence-by-sentence analysis
- Real-time progress tracking
- Google search integration for source detection
- Multiple file format support:
- PDF (.pdf)
- Word (.docx)
- Text (.txt)
- File preview functionality
- Multi-file comparison
- Similarity scoring (0-100%)
- Detailed analysis for each comparison
- Source URL tracking
- Customizable similarity threshold
- Interactive charts:
- Bar charts
- Scatter plots
- Progress indicators
- Metric displays
- Export options:
- PDF reports
- Excel spreadsheets
- Customizable report formats
- Timestamp-based file naming
- File Processing
- API Integration
- UI Components
- Dark theme
- Responsive design
- Three main tabs:
- Text Input
- File Check
- File Comparison
- Overview metrics
- Interactive charts
- Detailed results table
streamlit
: Web interfacepandas
: Data handlingnltk
: Text processingbeautifulsoup4
: Web scrapingplotly
: Data visualizationfpdf
: PDF generationxlsxwriter
: Excel report generation