Start Pages for WE1S Collections

WE1S studies a corpus of journalistic media and other documents related to the humanities that it harvested for text analysis (but does not store or make available as readable text due to copyright constraints).* This corpus is organized as approximately 30 “collections” (combinations of different kinds of sources and year ranges) to facilitate exploring different research questions. Collections are represented and made available as word frequency, topic modeling, and other data generated from analyzing the original texts. "Start pages" describe each collection and provide links to the datasets, topic models, and their visualizations. [Start pages on this page are under construction.]


____________________ * WE1S makes available only derived-data, “non-consumptive use” word frequency, topic model, and other datasets along with their visualizations. Datasets cannot be used to access, read, or reconstruct the original texts.