Roadmap

This project is still in an exploratory stage. Here is my current plan:

v0.1 - MVP

Enable iterative refinement
Add support for unsupervised losses (?)
Publish pre-trained models for general categories (i.e. Humor/Drama) as well as a large unsupervised model trained on our entire dataset.
Add a basic cli
- fastexcerpt train --dataset ao3.jsonl.bz2 --model model.bin
- fastexcerpt excerpts --model model.bin --path_to_file example.txt