To train machine-learning classifiers and evaluate the effectiveness of the different approaches, we manually create a benchmark, in which emails are classified at character granularity.
Given the time and effort needed to create such a benchmark, we humbly think it is a valuable contribution to the community. With the help of this benchmark, other researchers can reproduce our experiments and devise new classification methods, which can be immediately compared to ours.
For this reason, we make our benchmark publicly available. You can download the dataset from the GitHub repository, or download the full database dump. If you need another format (e.g., JSON) to better suite your needs, please contact us.