Topic Modelling for the CMP617 class (NLP and IR). This project is about topic modelling using a Amazon Products Review Dataset.
This project was used to a class homework at the CMP617 class, the scripts are simple and very straight forward to run.
This project was developed to explore a BERTopic usage in comparison with LDA. The dataset used was the Amazon Products Review Dataset, which can be found at the data
folder. The article and the presentation are also at the apresentação e artigo
folder.
├── data
├── ... # Some raw data samples
|── apresentação e artigo
├── ... # Presentation and Article
main_research.ipynb # Main research notebook
Feel free to contribute to this project by forking it and submitting a pull request. If you have any questions, please contact me at [email protected]