Übung zum Hauptseminar unter "Zusatzlehrveranstaltungen"
Research tasks in the eHumanities typically require the analysis of natural language text or combinations of text plus images and other semiotic modes of representation. In the past 30 odd years the large-scale introduction of the personal computer to the humanities workspace has opened up the possibility of harnessing considerable computing power by means of bespoke analysis tools. What is more, through the advent of the internet, electronic publishing and digitization, the humanities have realized and incorporated new possibilities for data analysis and dissemination. The evolution of the fields of research subsumed under the term eHumanities has brought with it vast advances in the methods and tools employed in textual and multisemiotic analysis. One specific issue in this context is the question of the joint use of data from different research perspectives such as different fields within the philologies, e.g. the joint use of the same text repositories by computer philology and corpus and computational linguistics.
In this seminar, we are going to explore approaches to digital analysis in the eHumanities in terms of their suitability for different research questions and tasks. The focus will be on research in linguistics and related philological fields. In the seminar, we are going to look into the development of the eHumanities as a field and the state of the play today. We are also going to look into specific issues such as standards of data encoding (e.g. XML / Text Encoding Initiative P5), long-term data preservation, and the interoperability between tools for specific research tasks and especially research infrastructures (e.g. TextGrid).
The seminar is interdisciplinary from the outset and aims to include the humanities (linguistics, literary and textual studies, and historical sciences) and the computer and information sciences (computer science, computational linguistics etc.).
Students are going to get the opportunity to explore and critically discuss many aspects of the research questions, tasks and processing scenarios currently encountered in the eHumanities. The aim of the seminar is to raise awareness of the diverse research traditions underlying the disciplines involved in the eHumanities in the light of the consequences for planning and setting up research scenarios.
The seminar is accompanied by weekly practical exercises in which students can explore eHumanities research scenarios based on commonly used tools. These comprise stand-alone tools (e.g. the Stanford NLP Tools), tools in linguistic frameworks and processing pipelines (e.g. GATE) and research infrastructures (e.g. TextGrid, eHumanities Desktop) as well as different solutions for querying, analyzing and visualizing data.
LinguisticsWeb.org – a web of information for students and researchers in linguistics: http://www.linguisticsweb.org
H. Cunningham, et al. Text Processing with GATE (Version 6). University of Sheffield Department of Computer Science. 15 April 2011.
McEnery, Tony, Andrew Wilson. 2001. Corpus linguistics. An introduction. 2nd edition. Edinburgh: Edinburgh University Press.
McEnery, A., Z. Xiao & Y. Tono. 2005. Corpus-Based Language Studies : An Advanced Resource Book. London : Routledge.
Empfehlungen des Wissenschaftsrats zu Forschungsinfrastrukturen in den Geistes- und Sozialwissenschaften: http://www.wissenschaftsrat.de/download/archiv/10465-11.pdf
Neuroth, Heike, Fotis Jannidis, Andrea Rapp, Felix Lohmeier: Virtuelle Forschungsumgebungen für e-Humanities. Maßnahmen zur optimalen Unterstützung von Forschungsprozessen in den Geisteswissenschaften. In: Bibliothek. Forschung und Praxis 2009/2, S. 161-169, www.bibliothek-saur.de/2009_2/part2.pdf
The Stanford NLP Group - software: http://nlp.stanford.edu/software/
Lehrveranstaltungskonzept 1 | Sabine Bartsch 1 | 2