This book comprises a set of articles that specify the methodology of text mining, describe the creation of lexical resources in the framework of text mining and use text mining for various tasks in natural language processing (NLP). The analysis of large amounts of textual data is a prerequisite to build lexical resources such as dictionaries and ontologies and also has direct applications in automated text processing in fields such as history, healthcare and mobile applications, just to name a few. This volume gives an update in terms of the recent gains in text mining methods and reflects the most recent achievements with respect to the automatic build-up of large lexical resources. It addresses researchers that already perform text mining, and those who want to enrich their battery of methods. Selected articles can be used to support graduate-level teaching.
The book is suitable for all readers that completed undergraduate studies of computational linguistics, quantitative linguistics, computer science and computational humanities. It assumes basic knowledge of computer science and corpus processing as well as of statistics.
Foreword.- PART I. Text Mining Techniques and Methodologies.-? Thomas Eckart, Dirk Goldhahn, and Uwe Quasthoff: Building large resources for text mining.- Hristo Tanev: Learning Textologies: Networks of Linked Word Clusters.- Zornitsa Kozareva: Simple, Fast and Accurate Taxonomy Learning.- Patrick Oesterling, Christian Heine, Gunther H. Weber and Gerik Scheuermann: A Topology-Based Approach to Visualize the Thematic Composition of Document Collections.- Alexander Mehler, Tim vor der Br?ck, R?diger Gleim and Tim Geelhaar: Towards a Network Model of the Coreness of Texts; An Experiment in Classifying Latin Texts using the TTLab Latin Tagger.- PART II. Text Mining Applications. Stefan Bordag and Christian H?nig and Christian Beutenm?ller: A strl£)