Corpus Builder for Scholarly Works
OpenMinTeD has set up a mechanism which provides access to scholarly and scientific content from a wide range of sources (publishers, repositories, journals, etc.) and enables users to search and select among them the ones that interest them for mining; the selection is based on a faceted search or a Google-like natural text query based on the harmonised metadata descriptions of the documents (e.g. publication year, keywords, domain, etc.) while the selected documents form together a collection or "corpus".