This is a data set about the Hathitrust collection. It features part-of-speech tagged term token counts, header/footer identification, marginal character counts, and more.
A "national search engine for economic development subsidies and other forms of government financial assistance to business" that draws on information from numerous state government programs.