About the data
Tang and Song poems are from github.com/chinese-poetry. Their repository is distributed under an MIT license.
The pre-Tang poetry has been checked out from a database; a number of formatting errors and idiosyncrasies will affect the search results.