Please enter your username below and press the send button.A password reset link will be sent to you.
If you are unable to access the email address originally associated with your Delicious account, we recommend creating a new account.
This link recently saved by vancestevens on March 08, 2012
This link recently saved by vancestevens on January 13, 2011
Despite drawbacks, the World Wide Web is a mine of language data of unprecedented richness and ease of access.
It is also the only viable source of "disposable" corpora built ad hoc for a specific purpose. These corpora are essential resources for language professionals who routinely work with specialized languages, often in areas where neologisms and new terms are introduced at a fast pace and where standard reference corpora have to be complemented by easy-to-construct, focused, up-to-date text collections.
While it is possible to construct a web-based corpus through manual queries and downloads, this process is extremely time-consuming.
The perl scripts included in the BootCaT toolkit implement an iterative procedure to bootstrap specialized corpora and terms from the web, requiring only a list of "seeds" (terms that are expected to be typical of the domain of interest) as input.