Prediction of mosquito species and population age structure using mid-infrared spectroscopy and supervised machine learning
Mario González-Jiménez1*, Simon A. Babayan2*, Pegah Khazaeli1, Margaret Doyle2, Finlay Walton1, Elliott Reedy1, Thomas Glew1, Mafalda Viana2, Lisa Ranford-Cartwright2, Abdoulaye Niang4, Doreen J. Siria3, Fredros O. Okumu2,3, Abdoulaye Diabaté4, Heather M. Ferguson2, Francesco Baldini2, and Klaas Wynne1
1 School of Chemistry, University of Glasgow, Glasgow G12 8QQ, UK.
2 Institute of Biodiversity Animal Health and Comparative Medicine, University of Glasgow, Glasgow G12 8QQ, UK.
3 Environmental Health & Ecological Sciences Department, Ifakara Health Institute, Off Mlabani Passage, PO Box 53, Ifakara, Tanzania
4 Department of Medical Biology and Public Health, Institut de Recherche en Science de la Santé (IRSS), Bobo-Dioulasso, Burkina Faso
* These authors contributed equally to this work.
This repository contains code used to process wave number readings, to generate models for the classification of mosquito species (see Fig 2), their ages (Fig 3), and population age structure (Fig 4).
Loco mosquito.ipynb
contains code for processing outputs of the mid-infrared spectrometer into datasets suitable for machine learningOpWT_classification.py
contains a summary of the code used for training classification models for both species and age class prediction.OpWT_population_structure.py
contains code used to compare true and predicted age structures of a simulated population. Predictions were made using the most accurate model trained inOpWT_classification.py