1 Paul Bennett, Martin Durrell, Silke Scheible, Jason Whitt The GerManC Project A Representative Corpus of Early Modern German (1650-1800)

Slides:



Advertisements
Ähnliche Präsentationen
Separable Verbs There is a group of verbs in German called separable verbs = trennbare Verben.
Advertisements

die Zeiten (The Tenses) das Aktiv (Active Voice)
You need to use your mouse to see this presentation © Heidi Behrens.
You need to use your mouse to see this presentation © Heidi Behrens.
Passiv What are these sentences expressing?
1 Paul Bennett, Martin Durrell, Silke Scheible, Jason Whitt The GerManC Project A Representative Corpus of Early Modern German ( )
Der formelle Imperativ – the Imperative
Relativpronomen Der Mann ist mein Onkel. --Wir haben den Mann im Theater gesehen. You can express related ideas in separate sentences, or you can.
 Every part in a sentence has a grammatical function. Some common functions are: - Subject - Verb - Direct object / accusative object - Indirect object.
September 29th 2008 Dr. Bernhard Schmidt Lehrstuhl für Allgemeine Pädagogik und Bildungsforschung der LMU Perception of Age, Expectations of Retirement.
Konjugier,,sein”! ichwir du ihr er, sie,essie, Sie.
Networking on local area knowledge of territory-continuous presence in community (family-centre – people centre – key locations)
Review of Verb Tense & Expressing Opinions
Stephanie Müller, Rechtswissenschaftliches Institut, Universität Zürich, Rämistrasse 74/17, 8001 Zürich, Criminal liability.
Deutsch 1 G Stunde. Dienstag, der 13. November 2012 Deutsch 1, G Stunde Heute ist ein G- Tag Unit: Family & home Familie & Zuhause Question: Who / How.
Literary Machines, zusammengestellt für ::COLLABOR:: von H. Mittendorfer Literary MACHINES 1980 bis 1987, by Theodor Holm NELSON ISBN
Akkusativ Präpositionen
Arbeiten in einem agilen Team mit VS & TFS 11
3rd Review, Vienna, 16th of April 1999 SIT-MOON ESPRIT Project Nr Siemens AG Österreich Robotiker Technische Universität Wien Politecnico di Milano.
Collaborative Research Online: Knowledge management pilot project on Haskala Dr. Rachel Heuberger, Judaica Abteilung, Universitätsbibliothek Frankfurt.
What is a “CASE”? in English: pronouns, certain interrogatives
The Subjunctive What? -The subjunctive is used to express hypothetical situations. -A ‘mood’ used to express these situations Eg. If he came, we would.
What is a “CASE”? in English: pronouns, certain interrogatives
Museumsinsel Museum Island (German: Museumsinsel) is the name of the northern half of an island in the Spree river in the central Mitte district of Berlin,
Es gibt there is (singular) or there are (plural)
Present and past tense German 2. Basic present tense.
type / function / form type of words:
Schreiben Sie fünf Sätze aus diesen Elementen. [Beispiel
COMMANDS imperative There are three command forms: formal familiar singular familiar plural.
COMMANDS imperative 1. you (formal): Sie 2. you (familiar plural): ihr
The Workers‘ Freedom The debate about industrial democracy in Germany and Sweden, Klaus Neumann presentation held at the.
1/15 Thursday, 21 June 2007 Werner Sudendorf, Jürgen Keiper Deutsche Kinemathek – Museum für Film und Fernsehen Werner Sudendorf, Jürgen Keiper Reconstructing.
Kapitel 4: Mein Tag Sprache.
EUROPÄISCHE GEMEINSCHAFT Europäischer Sozialfonds EUROPÄISCHE GEMEINSCHAFT Europäischer Fonds für Regionale Entwicklung Workpackage 5 – guidelines Tasks.
Imperfekt (Simple Past) Irregular or strong verbs
Kapitel 2 Grammar INDEX 1.Subjects & Verbs 2.Conjugation of Verbs 3.Subject Verb Agreement 4.Person and Number 5.Present Tense 6.Word Order: Position of.
Kapitel 7 Grammar INDEX 1.Comparison 2.Adjectives 3.Adjective Endings Following Ein-Words.
Deutsch 1 G Stunde. Montag, der 3. Dezember 2012 Deutsch 1, G Stunde Heute ist ein E - Tag Unit: Family & home Familie & Zuhause Goal: to talk about,
EUROPÄISCHE GEMEINSCHAFT Europäischer Sozialfonds EUROPÄISCHE GEMEINSCHAFT Europäischer Fonds für Regionale Entwicklung Workpackage 5 – guidelines Tasks.
Caroline Euringer Hamburg University LEO.-App: Mobile phone application for self-testing in reading and writing Peer Learning Activity on the use of digital.
On the case of German has 4 cases NOMINATIVE ACCUSATIVE GENITIVE DATIVE.
Bayreuth Festspielhaus The Bayreuth Festspielhaus or Bayreuth Festival Theatre, is an opera house north of Bayreuth, Germany, dedicated solely to the performance.
G Stunde DEUTSCH 1.  Unit: Family & homeFamilie & Zuhause  Objectives:  Phrases about date, weather and time-telling  Family and family relations.
B LOCKED DAY 1 OBJECTIVES: To consolidate vocabulary and structures within the theme of DIE UMWELT To further practise the techniques used in the prose.
Sven Koerber-Abe, 2016 Grammatik: Artikel (Zusammenfassung) Grammatik: Artikel (Zusammenfassung)
© Boardworks Ltd of 8 © Boardworks Ltd of 8 This icon indicates that the slide contains activities created in Flash. These activities are not.
DAS VIERTE DEUTSCHE KASUS Genitiv. Kasus ● What is a case? A case shows the grammatical function of a word. ● There are four cases in German. Up to now.
Karl der Große January 28 - Feast Day of St. Karl der Große (Charlemagne) (ca ) Karl der Große or Charlemagne was born near Aachen in about 742.
Sentence Structure Questions
Agenda Eröffnung und Begrüßung durch Mag.a Elisabeth Rosenberger
Dom zu Lübeck The Lübeck Cathedral (German: Dom zu Lübeck, or colloquially Lübecker Dom) is a large brick Lutheran cathedral in Lübeck, Germany and part.
Freizeit Thema 5 Kapitel 1 (1)
you: ihr ( familiar plural ) you: du ( familiar singular)
Sentence Structure Connectives
Simple Past The Narrative Past.
Bell Work What countries border Germany?
Process and Impact of Re-Inspection in NRW
Safe but attractive. Bike accessories
Synonyms are two or more words belonging to the same part of speech and possessing one or more identical or nearly identical denotational meanings, interchangeable.
Students have revised SEIN and HABEN for homework
The new online recognition process
Uranus. Uranus is the seventh in terms of distance from the Sun, the third in diameter and the fourth in mass of the planet of the Solar System. It was.
type / function / form type of words:
Official Statistics Web Cartography in Germany − Regional Statistics, Federal and European Elections, Future Activities − Joint Working Party meeting.
INDICATIVE ROADMAP CO-CREATION OUTREACH TRAINING MID 2020
Ich - Projekt Due Monday, September 19..
School supplies.
- moodle – a internet based learning platform
 Präsentation transkript:

1 Paul Bennett, Martin Durrell, Silke Scheible, Jason Whitt The GerManC Project A Representative Corpus of Early Modern German ( )

2 Representative historical corpus of German Aim Facilitation of comparative studies of the development and standardisation of English and German in 17th and 18th centuries Resource needed ARCHER-corpus (also Helsinki Corpus) Model

Representativeness 1. Not complete texts, but extracts of approximately 2000 words (cf. Brown corpora and ARCHER) 2. Nine genres a. Dramas b. Newspapers c. Letters d. Sermons e. Narrative prose f. Journals g. Scholarly texts (humanities) h. Scholarly texts (science & medicine) i. Legal texts 3

Representativeness 3. Periods (cf. Bonn corpus of ENHG) Regions a. North German b. West Central German c. East Central German d. West Upper German (incl.Swiss) e. East Upper German (incl. Austrian) 5. Three extracts of ≥2000 words per genre/period/region = approx. 900,000 words 4

Pilot Project: GerManC One year grant from ESRC: [March March 2007] Team: Paul Bennett, Martin Durrell, Astrid Ensslin Aim: testing corpus design and aims with a single genre, and evaluating and developing a set of analytical tools Newspapers were selected as genre for the pilot 5

Extended GerManC Pilot project completed March Newspaper corpus lodged with Oxford Text Archive (and available on project website) Application for funding the extended corpus approved early 2008, with equal funding from ESRC and AHRC Original design maintained, eight further genres to be added Team: Paul Bennett, Martin Durrell, Silke Scheible, Jason Whitt Work started in September

Digitization Scanning black letter (Fraktur) texts with OCR proved impractical and prone to error All texts keyed in twice and the results compared electronically (“double-keying“) to eliminate mistakes Texts keyed into XML Editor and marked- up according to TEI 5 Lite guidelines Only texts with 2000 words of (more or less) continuous German prose were selected 7

Der in seiner Freyheit vergnügte ALCBIADES (Drama: North German, 1700) 8

Development of tools A program for tokenization A program to recognize orthographic variants A lemmatization program with the ultimate aim of lemmatizing the whole corpus The development of a POS-tagger (on the basis of the Stuttgart-Tübingen Tagset, and based on the TreeTagger) with a view to tagging the complete corpus Developing a program to enable more detailed morphosyntactic tagging of the whole corpus If possible within the time constraints, developing a parser (possibly on the basis of the parser used in York for Old English) and parsing the complete corpus on this basis. 9

10 Case Study I: Changing norms weak adjective inflection “Innerhalb der nach grammatischem Bestimmungswort zu erwartenden indet. Flexion des Nom./Akk.Pl. aller Genera (die klugen Frauen) kommt es zu allen Zeiten des Fnhd. zu einer zwischen -(e) und -(e)n schwankenden Formbildung” Gramm. d. Fnhd. VI, 174

Findings: weak adjective inflection 1 (newspapers) process of standardization weak adjective inflection (Durrell et al. 2008) in nom./acc. pl., e.g. : die gute[n] Kinder (die Gute[n]) e-en-e-en-e-en North German20 (6)6 (5)633 (16)132 (14) West Central45 (18)4 (3)18 (4)10 (5)328 (6) East Central7 (2)18 (11)718 (3)231 (5) West Upper25 (7)6 (3)16 (3)6 (2)16 (3)16 (8) East Upper38 (22)14 (11)24 (3)11 (8)334 (5) Total135 (55)48 (33)71 (10)78 (34)25 (3)141 (38)

12 Genre-dependent variant selection “Die Entwicklung vom späten 16. Jh. bis zur Mitte des 18. Jhs. erweist die Durchsetzung [von -en] als die Verallgemeinerung eines in erster Linie omd. Usus. Die [...] stilschichtliche Distribution bestätigt die Einschätzung bei Hemmer [...], daß -n über literarische Sprachvorbilder übernommen worden ist.” (Gramm d. Fnhd. 176)

Findings: weak adjective inflection 2 (literary genres) Preliminary examples from ‘drama’ and ‘narrative prose’ in new extended corpus e-en-e-en-e-en North German219 (2)7 (2)23 (7)025 (7) West Central219 (7)6 (1)14 (5)68 (1) East Central224 (3)1 (1)23 (4)017 (3) West Upper5003 (1) 21 (5) East Upper211 (2)5 (1)3 (1) 14 (2) Total1373 (14)19 (5)78 (34)12 (2)85 (18)

14 Case Study II: Morphological simplification zween/zwo/zwei-zwey “Bei den Grammatikern ist bis in die 2. Hälfte des 18. Jh. hinein die Genusdifferenzierung aufrechterhalten” (Schottel, Bödiker, Gottsched) “Erst Adelung (a. 1782) gibt ausschließlich die Form zwey für alle Genera”. (Gramm. d. Fnhd. VII, 539)

15 Case Study II: Morphological simplification zween/zwo/zwei-zwey “Am frühesten ist das Neutrum als Einheitsform festgeworden im Niederdeutschen [1303]. Im Ostmitteldeutschen (Obersächsischen und Schlesischen) herrscht es seit der Mitte des 17. Jhs. und drang von dort auch in die Literatursprache” (Schirmunski, Deutsche Mundartkunde, 474).

zweenzwozweizweenzwozweizweenzwozwei North German1288 West Central German East Central German West Upper German 123 East Upper German ‘zwei’ in newspaper corpus

zweenzwozweizweenzwozweizweenzwozwei North German West Central German East Central German West Upper German East Upper German ‘zwei’ in extended corpus to date

Historical/cultural findings Media history Ensslin (2009), ‘”Im Unterhause groß Getöse”: representations of 18th century British parliamentary democracy in Early Modern German newspaper discourse’ The representation of a parliamentary monarchy in 17th & 18th century Germany, with predominantly absolute rulers - but responding to increased interest in Britain ruled by the Hannoverians Initially straightforward factual presentations, concise and apparently objective, though with (intentional?) emphasis on the leading role of the king Later clear tendency towards stigmatization of the raucous ‘debates’ in the House of Commons, with a much more subjective style of presentation and often sensationalist tone 18

Other investigations The development of the würde + Infinitive construction (Smirnova 2006; Durrell 2007) Das Doppelperfekt (Topalović 2007) Evidentiality and text type (Whitt 2008) The general notion of text type/genre/register as it relates to historical corpora 19

Thank you Contacts: Web page: 20

Project publications Martin Durrell, Astrid Ensslin and Paul Bennett, "The GerManC project", In: Sprache und Datenverarbeitung 31 (2007), Martin Durrell, Astrid Ensslin und Paul Bennett, "Zur Standardisierung der Adjektivflexion im Deutschen im 18. Jahrhundert". In: W. Czachur and M. Czyzewska (eds.), Vom Wort zum Text. Studien zur deutschen Sprache und Kultur. Festschrift für Professor Józef Wiktorowicz zum 65. Geburtstag. Warszawa, Instytut Germanistyki Uniwersitetu Warszawskiego, 2008, pp Martin Durrell, Astrid Ensslin und Paul Bennett, "Zeitungen und Sprachausgleich im 17. und 18. Jahrhundert.“ In: Zeitschrift für deutsche Philologie 127 (2008), Sonderheft, pp Ensslin, Astrid (2008), '"Im Unterhause abscheulich groß Getöse". Representations of 18th century British parliamentary democracy in early modern German newspaper discourse and their treatment of borrowings from English'. In: Pfalzgraf, F. & Rash, F. (eds.), "Anglo- German Literary Relations". Bern, etc.: Lang, pp

Other references Durrell, Martin "'Deutsch ist eine würde-lose Sprache'. On the history of a failed prescription". In: Stephan Elspaß, Nils Langer, Joachim Scharloth & Wim Vandenbussche (eds.), Germanic Language Histories 'from Below' ( ). (Studia Linguistica Germanica 86). Berlin & New York: de Gruyter, pp Smirnova, Elena Die Entwicklung der Konstruktion würde + Infinitiv im Deutschen: Eine funktional-semantische Analyse unter besonderer Berücksichtigung sprachhistorischer Aspekte. Berlin & New York: de Gruyter. Topalović, Elvira. To appear. "Perfekt II und Plusquamperfekt II. Zur historischen Kontinuität doppelter Perfektbildungen im Deutschen". In: Claudine Moulin, Fausto Ravida & Nikolaus Ruge (eds.), Sprache in der Stadt. Akten der 25. Tagung des Internationalen Arbeitskreises Historische Stadtsprachenforschung. Luxemburg, Oktober Heidelberg: Winter. Whitt, Richard J Evidentiality and Perception Verbs in English and German: A Corpus-Based Analysis from the Early Modern Period to the Present. Ph.D. Dissertation: The University of California, Berkeley. 22