Hans Uszkoreit Universität des Saarlandes und

Slides:



Advertisements
Ähnliche Präsentationen
Cadastre for the 21st Century – The German Way
Advertisements

PRESENTATION HEADLINE
Vernetzung von Repositorien : DRIVER Guidelines Dr Dale Peters, SUB Goettingen 4. Helmholtz Open Access Workshop Potsdam, 17 Juni 2008.
Finding the Pattern You Need: The Design Pattern Intent Ontology
E-Solutions mySchoeller.com for Felix Schoeller Imaging
H - A - M - L - E - IC T Teachers Acting Patterns while Teaching with New Media in the Subjects German, Mathematics and Computer Science Prof. S. Blömeke,
Managing the Transition from School-to-Work Empirical Findings from a Mentoring Programme in Germany Prof. i.V. Dr. Martin Lang.
R. Zankl – Ch. Oelschlegel – M. Schüler – M. Karg – H. Obermayer R. Gottanka – F. Rösch – P. Keidler – A. Spangler th Expert Meeting Business.
Windows Vista für Entwickler
Herzlich Willkommen zum Informations-Forum: SAP Interoperabilität
Die ZBW ist Mitglied der Leibniz-Gemeinschaft Copyright © ZBW 2010 Seite 1 Potenziale semantischer Technologien für die Bibliothek der Zukunft Klaus Tochtermann.
The difference between kein and nicht.
Steinbeis Forschungsinstitut für solare und zukunftsfähige thermische Energiesysteme Nobelstr. 15 D Stuttgart WP 4 Developing SEC.
Dr. M. Schlottke Common Description of Web Services Source: P. Gerbert, 2002 Web Services allow to establish B2B e-commerce on the fly Web Services allow.
Three minutes presentation I ArbeitsschritteW Seminar I-Prax: Inhaltserschließung visueller Medien, Spree WS 2010/2011 Giving directions.
POST MARKET CLINICAL FOLLOW UP
Introduction to the topic. Goals: Improving the students essay style in general Finding special words and expressions that can be used in essay writing.
Deutsche Gesellschaft für Technische Zusammenarbeit GmbH Integrated Experts as interface between technical cooperation and the private sector – An Example.
virtPresenter „lecture recording framework“
KAE praxis verstehen chancen erkennen zukunft gestalten understanding reality facing challenges creating future Laboratory of Integrated Energy Systems.
Question words and word order By the end of this lesson you will have revised question words By the end of this lesson you will be able to use question.
Laurie Clarcq The purpose of language, used in communication, is to create a picture in the mind and/or the heart of another.
Case Study Session in 9th GCSM: NEGA-Resources-Approach
Machen Sie sich schlau am Beispiel Schizophrenie.
die Zeiten (The Tenses) das Aktiv (Active Voice)
Institut AIFB, Universität Karlsruhe (TH) Forschungsuniversität gegründet 1825 Towards Automatic Composition of Processes based on Semantic.
Sanjay Patil Standards Architect – SAP AG April 2008
| DC-IAP/SVC3 | © Bosch Rexroth Pneumatics GmbH This document, as well as the data, specifications and other information set forth in.
Das Perfekt (Present Perfect Tense). Think of 5 things you did in your holidays but think of sentences in the PRESENT TENSE. 1.Am Montag schlafe ich viel.
Dienstag, den Hausaufgabe für Mittwoch den IA 10.4 and printout LB 10.4 A-C Guten Tag!
VOCABULARY SOCCER QUESTIONS PRESENT CONTINOUS WORDLIST – CELL PHONE Unit 2 - Trainer.
Morphology and Syntax More on sentence structure.
Centre for Public Administration Research E-Government for European Cities Thomas Prorok
BAS5SE | Fachhochschule Hagenberg | Daniel Khan | S SPR5 MVC Plugin Development SPR6P.
Z Corp Customer Examples
Christoph Durt: Wittgenstein on the possibility of philosophy: The importance of an intercultural approach
The free XML Editor for Windows COOKTOP Semistrukturierte Daten 1 Vortrag Semistrukturierte Daten 1 COOKTOP The free XML-Editor for Windows
Your name Bedeutung von Internet- Technologien Gruppe 1 Andreas Feuerstein Philipp Hochratner Christian Weinzinger.
Neno Loje Berater & MVP für Visual Studio ALM und TFS (ehemals VSTS) Hochqualitative Produkte mit Visual Studio & TFS 2010.
3/28/2017 8:11 PM Visual Studio Tools für Office { Rapid Application Development für Office } Jens Häupel Platform Strategy Manager Microsoft Deutschland.
You need to use your mouse to see this presentation © Heidi Behrens.
Department of Computer Science Homepage HTML Preprocessor Perl Database Revision Control System © 1998, Leonhard Jaschke, Institut für Wissenschaftliches.
INTAKT- Interkulturelle Berufsfelderkundungen als ausbildungsbezogene Lerneinheiten in berufsqualifizierenden Auslandspraktika DE/10/LLP-LdV/TOI/
States in the development of a new service During the development a service will pass through a chain of stages determining the service.
Wikipedia – Idee und Geschichte Zur Medienkultur des Netz-Wissens Sommersemester 2007 Christian Kern.
DEUTSCHLAND UND DIE MEDIEN
Guten Morgen!.
Institut für Öffentliche Dienstleistungen und Tourismus Informal learning for regional development Manfred Walser Towards a Knowledge Society: Is Knowledge.
Verben Wiederholung Deutsch III Notizen.
The word,,aber in German is most often used as a coordinating conjunction. Ich wollte nach Bremen fahren aber Mein Auto ist kaputt. Ich mag English aber.
Einführung Bild und Erkenntnis Einige Probleme Fazit Eberhard Karls Universität Tübingen Philosophische Fakultät Institut für Medienwissenschaft Epistemic.
4th Symposium on Lidar Atmospheric Applications
Ein Projekt des Technischen Jugendfreizeit- und Bildungsvereins (tjfbv) e.V. kommunizieren.de Blended Learning for people with disabilities.
Premium AEROTEC S.R.L., Brasov Plant Noul spatiu de joaca tematic al Parcului Central Brasov construit de Premium AEROTEC cu sustinerea Primariei Brasov.
The most obvious or direct use of auch is to mean also. Ich möchte auch Gitarre lernen. Auch ich möchte Gitarre lernen. I would like to learn Guitar. Someone.
1 von 10 ViS:AT Abteilung IT/3, IT – Systeme für Unterrichtszwecke ViS:AT Österreichische Bildung auf Europaniveau BM:UKK Apple.
Heute ist Montag, der 17. September 2012: Lernziel: 1.Diskussion: Sprechen 2.Wie kommst du zur Schule? 3.Leseverständnis (Reading Comprehension) 4.Quiz.
3rd Review, Vienna, 16th of April 1999 SIT-MOON ESPRIT Project Nr Siemens AG Österreich Robotiker Technische Universität Wien Politecnico di Milano.
Can you tell me about your school?
Adjectiv Endungen Lite: Adjective following articles and pre-ceeding nouns. Colors and Clothes.
Sentence Structure Subject and verb are always together. Subject and verb are always together. Subject and verb must agree Subject and verb must agree.
1 Stevens Direct Scaling Methods and the Uniqueness Problem: Empirical Evaluation of an Axiom fundamental to Interval Scale Level.
Selectivity in the German Mobility Panel Tobias Kuhnimhof Institute for Transport Studies, University of Karlsruhe Paris, May 20th, 2005.
KGE Kommunalgrund GmbH Entwicklungsträger der Landeshauptstadt Magdeburg Presentation for the REDIS Project at the Interim Conference.
How to use and facilitate an OptionFinder Audience Response System.
Technische Universität München 1 CADUI' June FUNDP Namur G B I The FUSE-System: an Integrated User Interface Design Environment Frank Lonczewski.
Institut für Nachrichtentechnik U. Reimers Technische Universität Braunschweig The MultiMedia Home Platform (MHP): Hype or Reality ?
© Handwerkskammer für München und Oberbayern, Max-Joseph-Straße 4, München Dietmar Schneider Foreign Trade Department of the Chamber of Trade and.
Literary Machines, zusammengestellt für ::COLLABOR:: von H. Mittendorfer Literary MACHINES 1980 bis 1987, by Theodor Holm NELSON ISBN
Computer Services Business challenge
 Präsentation transkript:

Vorlesung Einführung in die Computerlinguistik Teil 2 Sprachtechnologie Hans Uszkoreit Universität des Saarlandes und Deutsches Forschungszentrum für Künstliche Intelligenz

Überblick Aufgaben und Probleme der Sprachtechnologie Gesprochene Sprache Texttechnologien Ausblick

WAS IST SPRACHTECHNOLOGIE Eigentlich eine Klasse von Technologien in der Informations- technologie, die Wissen über die Struktur des menschlichen Sprachen verwenden, um die maschinelle Verarbeitung der Sprache zu ermöglichen bzw. zu verbessern. Beispiel: Microsoft Word verarbeitet zwar Sprache, enthält aber im Kern nur sehr wenig Sprachtechnologie. Sprachtechnologie steckt aber in der Erkennung von Satzgrenzen für die Formatierung, in der automatischen Silbentrennung, in der Rechtschreibkontrolle und in der Grammatikkontrolle. Nach Meinung der führenden Experten in der Computerindustrie ist die Sprachtechnologie eine Schlüsseltechnologie für den weiteren Fortschritt in der Computertechnik.

Aber der Computer beherrscht die menschliche Sprache nicht! Akzeptanz Das Hauptproblem für die Akzeptanz des Computers ist das Sprachproblem Der Standardanwender beherrscht keine Computersprachen. Der Standardanwender mag Computersprachen nicht. Der Standardanwender will auch keine Computersprachen lernen. Die Sprache, die der Mensch bestens beherrscht, ist seine Muttersprache. Das natürlichste Medium für die unmittelbare Übermittlung von Information ist die gesprochene Sprache. Die wichtigste Klasse von Daten sind Texte. Der Standardanwender verwendet die Maschine zur Produktion von Texten in menschlicher Sprache. Computer tun sich schwer in der Verarbeitung und Verwaltung von Texten Aber der Computer beherrscht die menschliche Sprache nicht!

Textverstehen phonetische Verarbeitung orthographische Verarbeitung akustische Form geschriebene Form phonetische Verarbeitung orthographische Verarbeitung phonetische o. graphemische Repräsentation morphonologische Verarbeitung morphonologische Repräsentation syntaktische Verarbeitung syntaktische Repräsentation semantische Verarbeitung semantische Repräsentation pragmatische Verarbeitung - Wissensverarbeitung Repräsentation der vollen Bedeutung

Diktat phonetische Verarbeitung orthographische Verarbeitung akustische Form geschriebene Form phonetische Verarbeitung orthographische Verarbeitung phonetische o. graphemische Repräsentation morphonologische Verarbeitung morphonologische Repräsentation das Boot auf dem Main oder daß bot auf dem mein syntaktische Verarbeitung syntaktische Repräsentation semantische Verarbeitung semantische Repräsentation pragmatische Verarbeitung - Wissensverarbeitung Repräsentation der vollen Bedeutung

MaschinelleÜbersetzung akustische Form geschriebene Form phonetische Verarbeitung orthographische Verarbeitung phonetische o. graphemische Repräsentation morphonologische Verarbeitung morphonologische Repräsentation syntaktische Verarbeitung syntaktische Repräsentation semantische Verarbeitung semantische Repräsentation pragmatische Verarbeitung - Wissensverarbeitung Repräsentation der vollen Bedeutung

Performanzkriterien efficiency geringer Zeit- und Speicherbedarf accuracy Fähigkeit, linguistisch korrekte Lösungen zu liefern robustness Fähigkeit, mit allen möglichen Eingaben fertigzuwerden coverage größtmögliche Abdeckung der Sprache specificity Fähigkeit, die richtige Analyse zu selegieren

Dimensionen des Problems Das Problem der Sprachbeherrschung ist zu komplex Worterkennung Morphologie Syntax Sprachtiefe Semantik Pragmatik Wissensverarbeitung Gegenstandsbereich(e) Sprachumfang Es gibt viele Anwendungen, die nur begrenzte Sprachbeherrschung benötigen!

Grenzen der Technologie Der Computer wird so bald nicht sprechen und schreiben wie wir KEINE SPRACHBEHERRSCHUNG OHNE ALLGEMEINES WISSEN Wörterbücher und Grammatiken können wir formalisieren Bei der Semantik wird es schon schwerer Dialog kann nur recht einfach modelliert werden Begrenztes Domänenwissen ist machbar Beim Allgemeinwissen und großem Fachwissen hört es auf Wir müssen das Problem begrenzen, um zu vernünftigen Anwendungen zu gelangen

Weder Sprachverstehen noch Sprachproduktion sind bisher gelöst. STAND DER KUNST Weder Sprachverstehen noch Sprachproduktion sind bisher gelöst. Aber: Wir besitzen heute sogenannte flache Verfahren, die zwar kein Verstehen ermöglichen, aber für viele Anwendungen oft völlig ausreichen. flache Ansätze (effizient und robust) statistische Methoden, Mustergrammatiken tiefe Ansätze (präzise und korrekt) linguistische Prinzipien, Constraints oder komplexe Regelwerke

Diese Anwendung ist die bisher meistverkaufte Sprachanwendung Lexikon Rechtschreibkontrolle Morphologie Syntax Sprachtiefe Semantik Pragmatik Wissensverarbeitung Gegenstandsbereich(e) Sprachumfang Diese Anwendung ist die bisher meistverkaufte Sprachanwendung

Hier beginnt das Geschäft erst gerade GRAMMATIKKONTROLLE Lexikon Grammatikkontrolle Morphologie Syntax Sprachtiefe Semantik Pragmatik Wissensverarbeitung Gegenstandsbereich(e) Sprachumfang Hier beginnt das Geschäft erst gerade

Der Bedarf kommt mit der akustischen Spracherkennung EINFACHE ABFRAGESYSTEME Worterkennung Morphologie Syntax Sprachtiefe Semantik Pragmatik Wissensverarbeitung Gegenstandsbereich(e) Sprachumfang Der Bedarf kommt mit der akustischen Spracherkennung

Sprachtiefe Gegenstandsbereich(e) Sprachumfang

Language Technologies Speech Technologies speaker recognition language verification command recognition speech-to-text speech translation spoken dialogue systems text-to-speech concept-to-speech Text Technologies indexing summarization categorization information extraction spell checking grammar checking text translation abstracting written dialogue systems report generation text generation

Probleme Die Sprache hat Seiten, die dem Menschen leichtfallen, dem Computer hingegen schwer. Insbesondere: Ambiguität: viele Wörter und Phrasen haben mehrere Bedeutungen Paraphrasen: es gibt viele Möglichkeiten, das Gleiche auszudrücken Ungenauigkeit: oft ist die Bedeutung von Ausdrücken unscharf

Speaker Identification Language Indentification Speech Verification Speech Technologies Voice Recognition/ Speaker Identification Language Indentification Speech Verification Speech Recognition Voice Modelling Speech Synthesis Speech Production Spoken Dialogue Systems Speech Translation Systems

Speech Technology Applications Voice Control Systems Dication Systems Text-to-Speech Systems Identification and Verification Systems Information Access Spoken Dialogue Systems Speech Translation Systems

Call Center Applications Call Routing Dialogues Simple Information Dialogues Information Access by Call Center Agents Speech-Synthesis of Information for Customer Retrieval of Recorded Calls Text Technologies for Information Retrieval Text Technologies for Information Fusion/reporting

SL (D) S Eigenschaften und Kriterien Vokabular (vorgegeben und erweiterbar) Sprecherabhängigkeit (Trainingsaufwand) Einzelwort vs. kontinuierlich Spontansprache Sprachmodell Initiative Barge-in Bandweite und Eingabegeräte Archivierung

Systeme zum Ausprobieren Nuance's Travel Plan Demo (Fluginformation) Firma: Nuance Corporation (Technologie von SRI International) Tel.: +1 650 847 7427 URL: http://www.nuancecom.com/demos/travel.htm Flugverbindungen zwischen 250 Städten in den USA PureSpeech Travel Planning Firma: PureSpeech Inc. Tel.:+1 617 497 7973 Info zu 850 Reisezielen Noch kein echter Datenbankzugriff Philips Intercity Zugauskunft Firma: Philips Research Laboratories Deutschland (Testsystem) Tel.:+49 241 604020 Schweiz (im Einsatz bei den SBB): Tel.: +41 157 02 22

Language Indentification Information Retrieval Document Categorization Text Technologies Language Indentification Information Retrieval Document Categorization Document Clustering Text Summarization Information Extraction Spell Checking Grammar/Style Checking Abstract Generation Report Generation Text Generation Document Production Written Dialogue Systems Text Translation Systems

Text Technology Applications Spell Checkers Machine-Assisted Human Translation Indicative Machine Translation Grammar Checkers Human Assisted Machine Translation High Quality Text Translation Text Generation Systems

Heutige Suchtechnologie Wort-Index Boolsche Kombinationen verschiedene Indexierungsverfahren eingeschränkte Morphologie Sortierung nach Relevanz Suche in mehreren Sprachen Order your free beer today Lyca Surcha More than 35 terabytes served surch the web for: sweat AND tears  First 10 of 45.677 matches: 1. research in mutlilingual IR an easier... 2. Sir Winston S. Churchill homepage of... 3. Shinjuku Yamabuki B$h$&$3$=;%5! 4. 60s Rock Timeline remember the best...

Suche Sie finden nicht genug! Andere Wortformen der Herzog, des Herzogs, die Herzöge Unter- und Überbegriffe Alfa Romeo Zagato roadster sports car car motor vehicle vehicle Paraphrasen steuerliche Gründe, Steuergründe, steuerliche Erwägungen, steuerliche Überlegungen, fiskalische Erwägungen, um Steuern zu sparen, ...

Suche Nehmen wir an, Sie suchten nach Automobilfirmen und gäben daher der Suchmaschine (z.B. HOTBOT) den Suchbegriff “Automobilfirmen” Im Englischen suchten Sie nach: “automobile companies”

Suche automobile companies 704 Automobilfirmen 55

Suche automobile companies 704 car builders 233 car makers 1846 auto makers 2307 automobile makers 181 car companies 3046 cars companies 14 motor companies 194 auto companies 1345 car manufacturers 3056 motor manufacturers 582 automobile manufacturers 4263 manufacturers of cars 151 manufacturers of autos 15 manufacturers of automobiles 165 manufacturers of motor vehicles 55 Automobilfirmen 55 Autohersteller 320 Autobauer 131 Autoproduzenten 26 Autofabrikant 89 Autofirmen 86 Pkw Hersteller 15 Automobilunternehmen 57 Automobilhersteller 602 Kfz-Hersteller 42 Autounternehmen 9 Automobilkonzerne 83 Unternehmen der Automobilbranche 4 Hersteller von Autos 4 Hersteller von Automobilen 13 Hersteller von Kraftfahrzeugen 3

Suche Sie finden zu viel! Ambiguität deutsch: Zug, Bahn, Leitung, Schalter englisch: terminal, line, engine Polysemie Buch, Schule, printer Eigennamen Personennamen: Maurer, Washington, Chase Ortsbezeichnungen: Essen, Halle, Bismarck

AUFGABEN DES IM Die Aufgabe des Informationsmanagements ist die Verwaltung und Nutzbarmachung von sehr großen Informationsmengen, wie wie sie heute bereits auf dem WWW, in Intranets und in großen Text-Datenbanken finden. Das Netz macht sie erst einmal nur verfügbar. Im Gegensatz zu herkömmlichen Datenbanken ist die Information viel weniger vorstrukturiert (in Sinne der Strukturierung von Computerdaten). Auf der anderen Seiten sind die relevanten inhaltlichen Strukturen natürlich weitaus komplexer. Durch die Digitalisierung von großen Teilen des menschlichen Wissen (z.B. digitale Bibiliotheken, Filmarchive etc.) wird dieses Problem noch zunehmen.

PROBLEME DES INFORMATION MANAGEMENT Distributivität Die Information liegt auf verschiedenen Maschinen Heterogenität Vielzahl von Dokumentformaten Multilingualität Multimedialität (z.B. Sprache, Bilder, Klänge), Multimodalität (z.B. geschr. u. gesprochene Sprache, Filmdateien o. Realzeitübertragungen) Unstrukturiertheit keine einheitliche Klassifikation, keine einheitliche interne Strukturierung. keine einheitliche u. verläßliche Hypertextverknüpfung Redundanz Viele Informationen sind mehrfach vorhanden.

INFORMATION MANAGEMENT Information wird gewonnen kategorisiert gefiltert zusammengeführt strukturiert dem Benutzer zugeführt adäquat präsentiert

SPRACHTECHNOLOGIEN FÜR DAS I&WM Sammeln (gathering) Indizieren (indexing) Kategorisierung (categorization) Gruppierung (clustering) Zusammenfassung (summarization) Informationsextraktion (information extraction) Automatische Verknüpfung (automatic hyperlinking) Datenschürfen (text data mining) Informationsfusion (information fusion) Berichtsgenerierung (report generation) Textübersetzung (text translation)

INFORMATIONSGEWINNUNG Sammeln (gathering) Data Mining auch Text Mining Konversion z.B. Einscannen, OCR, Transkription Agenten z.B. NetBots, WebBots

STRUKTURIERUNG UND SPEICHERUNG Indizieren (indexing) Kategorisierung (categorization) Gruppierung (clustering) Zusammenfassung (summarization)

INFORMATIONSAUFBEREITUNG Informationsextraktion (information extraction) Hyperverknüpfung (hyperlinking) Informationsfusion (information fusion) Trendanalyse (trend analysis) Berichtsgenerierung (report generation)

INFORMATIONSZUGRIFF Suchschlüsselerweiterung (query expansion) Relevanzsortierung (relevance ranking) Dublettenerkennung (redundancy check) thematische Gruppierung (thematic clustering) Erkennung verwandter Information (information association)

PRÄSENTATION Ergebnispräsentation (result presentation) Informationsvisualisierung (information visualization) virtuelle Navigation (virtual navigation)

INFORMATIONSEXTRAKTION Robuste Extraktion von relevanten Begriffen, Phrasen, Aussagen aus Texten. Erfolgsraten (Vollständigkeit und Präzision) hängen von der Aufgabe und vom Gegenstandsbereich ab. Bereits eingesetzt in verschiedenen Anwendungen, z.B. für Firmennamenerkennung, Nachrichtenkategorisierung, Übersichten zu Firmenindikatoren (Umsatz, Gewinn, Kurse) Nachrichtenübersichten zu speziellen Themenbereichen

Informationsextraktion In der IE werden gezielt relevante Informationen aus Texten herausgesucht und strukturiert. Bremen, 14. 10. 1997, wiwo: Lagersoftware weiter im Aufwind Die Bremer Firma Trade Consult hat auf einer Pressekonferenz in Hannover die Version 2.0 ihrer erfolgreichen Lagerverwal-tungssoftware Store Age vorgestellt.. Die neue Version ermöglicht jetzt auch ... Auf der Pressekonferenz gab Geschäftsführer Franz Merleback auch die Umsatzzahlen der Softwareschmiede für das 3. Quartal bekannt. Wurden im zweiten Quartal bereits über 30 Millionen Mark umgesetzt, so konnte Merleback jetzt das stolze Ergebnis von 42,5 Millionen verkünden. ...

Informationsextraktion In der IE werden gezielt relevante Informationen aus Texten herausgesucht und strukturiert. Bremen, 14. 10. 1997, wiwo: Lagersoftware weiter im Aufwind Die Bremer Firma Trade Consult hat auf einer Pressekonferenz in Hannover die Version 2.0 ihrer erfolgreichen Lagerverwal-tungssoftware Store Age vorgestellt.. Die neue Version ermöglicht jetzt auch ... Auf der Pressekonferenz gab Geschäftsführer Franz Merleback auch die Umsatzzahlen der Softwareschmiede für das 3. Quartal bekannt. Wurden im zweiten Quartal bereits über 30 Millionen Mark umgesetzt, so konnte Merleback jetzt das stolze Ergebnis von 42,5 Millionen verkünden. ...

BEISPIEL: INFORMATIONSEXTRAKTION (2) Firma 96Q4 1996 97Q1 97Q2 97Q3 97Q4 1997 Diff ComSoft 120Mio 110Mio Trade Consult 30 Mio 42,5Mio Z&M 71,0Mio

Crosslingual Information Retrieval (CLIR) multilinguale Schnittstelle für die Navigation multilinguale Suche mehrsprachiges Angebot im Web

MULINEX System

Suchschlüssel-Eingabe

Anfrageassistent

Scope Classical Areas of Computational Linguistics: computational morphology, computational syntax computational semantics computational pragmatics Text Applications of Language Technology: indexing categorization summarization information extraction report generation

Different Goals Classical Goal: understanding and production of text Die Bremer Firma Trade Consult hat auf einer Pressekonferenz in Hannover die Version 2.0 ihrer erfolgreichen Lagerverwaltungssoftware Store Age vorgestellt. Die neue Version ermöglicht jetzt auch die zentrale Verwaltung mehrerer Lager und integriert die Lagerhaltung in das Supply Chain Management auf der Basis von SAP Software. Auf der Pressekonferenz gab Geschäftsführer Franz Merleback auch die Umsatzzahlen der Softwareschmiede für das 3.Quartal bekannt. Wurden im zweiten Quartal bereits über 30 Millionen Mark umgesetzt, so konnte Merleback jetzt das stolze Ergebnis von 42,5 Millionen verkünden. erfolgreichen Lagerverwaltungssoftware Store Age . n Lagerverwaltungssoftware

Different Goals Classical Goal: understanding and production of text Die Bremer Firma Trade Consult hat auf einer Pressekonferenz in Hannover die Version 2.0 ihrer erfolgreichen Lagerverwaltungssoftware Store Age vorgestellt. Die neue Version ermöglicht jetzt auch die zentrale Verwaltung mehrerer Lager und integriert die Lagerhaltung in das Supply Chain Management auf der Basis von SAP Software. Auf der Pressekonferenz gab Geschäftsführer Franz Merleback auch die Umsatzzahlen der Softwareschmiede für das 3.Quartal bekannt. Wurden im zweiten Quartal bereits über 30 Millionen Mark umgesetzt, so konnte Merleback jetzt das stolze Ergebnis von 42,5 Millionen verkünden. erfolgreichen Lagerverwaltungssoftware Store Age . n Lagerverwaltungssoftware

Different Goals Classical Goal: understanding and production of text highly accurate and comprehensive in depth could be used by automatic inferencing but lacking efficiency, robustness, coverage

building an index Different Goals Classical Goal: understanding and production of text Goals of Text Technologies recognition of relevant elements or generation of short passages from DB entries Die Bremer Firma Trade Consult hat auf einer Pressekonferenz in Hannover die Version 2.0 ihrer erfolgreichen Lagerverwaltungssoftware Store Age vorgestellt. Die neue Version ermöglicht jetzt auch die zentrale Verwaltung mehrerer Lager und integriert die Lagerhaltung in das Supply Chain Management auf der Basis von SAP Software. Auf der Pressekonferenz gab Geschäftsführer Franz Merleback auch die Umsatzzahlen der Softwareschmiede für das 3.Quartal bekannt. Wurden im zweiten Quartal bereits über 30 Millionen Mark umgesetzt, so konnte Merleback jetzt das stolze Ergebnis von 42,5 Millionen verkünden. building an index

Trade Consult Umsatzzahlen Different Goals Classical Goal: understanding and production of text Goals of Text Technologies recognition of relevant elements or generation of short passages from DB entries Die Bremer Firma Trade Consult hat auf einer Pressekonferenz in Hannover die Version 2.0 ihrer erfolgreichen Lagerverwaltungssoftware Store Age vorgestellt. Die neue Version ermöglicht jetzt auch die zentrale Verwaltung mehrerer Lager und integriert die Lagerhaltung in das Supply Chain Management auf der Basis von SAP Software. Auf der Pressekonferenz gab Geschäftsführer Franz Merleback auch die Umsatzzahlen der Softwareschmiede für das 3.Quartal bekannt. Wurden im zweiten Quartal bereits über 30 Millionen Mark umgesetzt, so konnte Merleback jetzt das stolze Ergebnis von 42,5 Millionen verkünden. extracting the topic Trade Consult Umsatzzahlen

extracting relations Different Goals Classical Goal: understanding and production of text Goals of Text Technologies recognition of relevant elements or generation of short passages from DB entries Die Bremer Firma Trade Consult hat auf einer Pressekonferenz in Hannover die Version 2.0 ihrer erfolgreichen Lagerverwaltungssoftware Store Age vorgestellt. Die neue Version ermöglicht jetzt auch die zentrale Verwaltung mehrerer Lager und integriert die Lagerhaltung in das Supply Chain Management auf der Basis von SAP Software. Auf der Pressekonferenz gab Geschäftsführer Franz Merleback auch die Umsatzzahlen der Softwareschmiede für das 3.Quartal bekannt. Wurden im zweiten Quartal bereits über 30 Millionen Mark umgesetzt, so konnte Merleback jetzt das stolze Ergebnis von 42,5 Millionen verkünden. extracting relations Firma 96Q4 1996 97Q1 97Q2 97Q3 97Q4 1997 Diff Hahnemann 105 Mio 110Mio Trade Consult 30 Mio 42,5Mio Z&M 12,0Mio 14 Mio

Information Extraction Bremen, 14. 10. 1997, wiwo: Lagersoftware weiter im Aufwind Die Bremer Firma Trade Consult hat auf einer Pressekonferenz in Hannover die Version 2.0 ihrer erfolgreichen Lagerverwaltungssoftware Store Age vorgestellt. Die neue Version ermöglicht jetzt auch die zentrale Verwaltung mehrerer Lager und integriert die Lagerhaltung in das Supply Chain Management auf der Basis von SAP Software. Auf der Pressekonferenz gab Geschäftsführer Franz Merleback auch die Umsatzzahlen der Softwareschmiede für das 3.Quartal bekannt. Wurden im zweiten Quartal bereits über 30 Millionen Mark umgesetzt, so konnte Merleback jetzt das stolze Ergebnis von 42,5 Millionen verkünden.

IE Result Firma 96Q4 1996 97Q1 97Q2 97Q3 97Q4 1997 Diff ComSoft 120Mio 110Mio Trade Consult 30 Mio 42,5Mio Z&M 71,0Mio

extracting relations Different Goals Classical Goal: understanding and production of text Goals of Text Technologies recognition of relevant elements or generation of short passages from DB entries Die Bremer Firma Trade Consult hat auf einer Pressekonferenz in Hannover die Version 2.0 ihrer erfolgreichen Lagerverwaltungssoftware Store Age vorgestellt. Die neue Version ermöglicht jetzt auch die zentrale Verwaltung mehrerer Lager und integriert die Lagerhaltung in das Supply Chain Management auf der Basis von SAP Software. Auf der Pressekonferenz gab Geschäftsführer Franz Merleback auch die Umsatzzahlen der Softwareschmiede für das 3.Quartal bekannt. Wurden im zweiten Quartal bereits über 30 Millionen Mark umgesetzt, so konnte Merleback jetzt das stolze Ergebnis von 42,5 Millionen verkünden. extracting relations Firma 96Q4 1996 97Q1 97Q2 97Q3 97Q4 1997 Diff ComSoft 120Mio 110Mio Trade Consult 30 Mio 42,5Mio Z&M 71,0 Mio 88,0 Mio

Different Goals Classical Goal: understanding and production of text Goals of Text Technologies recognition of relevant elements or generation of short passages from DB entries robust and efficient support for human inferencing but shallow -- do not get to the contents -- lacking accuracy

Application and Foundation Deep Methods Shallow Methods Application Foundation

Empirical Methodology formal methods algorithmic methods empirical methods availability of large electronic corpora computational tools for handling large sets of data increased computing power means for data interpretation

Statistical Methods symbolic statistical methods methods shallow processing categorization summarization inf. extraction answer extraction shallow parsing PS parsing deep parsing deep parsing with semantic construction deep processing

Statistical Methods symbolic methods hybrid methods statistical shallow processing categorization summarization inf. extraction answer extraction shallow parsing PS parsing deep parsing deep parsing with semantic construction deep processing

Corpus-Based Methods Corpus-based statistical methods are especially relevant for: acquisition of grammar and lexicon acquisition and modelling of soft constraints acquisition and modelling of performance preferences However, we need linguistically interpreted corpora.

Combinig Shallow and Deep Three ways of combining shallow and deep processing: shallow processing as a preprocessor for deep processing deep processing as a servant to shallow processing deep processing techniques are integrated into shallow processing

Information Extraction Instead of extraction - Enrichment of texts by structural information Structuring of information through IE technology Transformation of unstructured text into semi-structured documents Application in document conversion to XML and in XML document authoring

A Continuum from Shallow to Deep Topic Recognition Terminology Recognition Named Entity Recognition Simple Relation Recognition Complex Relation Recognition (template filling) Answer Recognition Information Fusion - Template Merging

Performance Modelling in the Past Coverage large scale HPSG grammar development in several languages lexical work on the morphological and syntactic side first steps towards learning from corpora Robustness robust semantic processing with underspecification work on soft constraints and preferences Efficiency efficient HPSG and DG processing efficiency in semantic processing by ambiguity reduction

deutsch: Zug, Bahn, Leitung, Schalter englisch: terminal, line, engine Weitere Probleme Sie finden zu viel! Ambiguität deutsch: Zug, Bahn, Leitung, Schalter englisch: terminal, line, engine Polysemie Buch, Schule, printer Eigennamen Personennamen: Maurer, Washington, Chase Ortsbezeichnungen: Essen, Halle, Bismarck

Das Web ist mutlilingual Das WWW war anfangs vorherrschend monolingual (1994 - 96% aller WWW Seiten englisch) Nicht-englische Inhalte nehmen schneller zu. (1996 - 91% englisch, heute ca. 85%)

GLOBALIZATION OF THE USER BASE Share of US Web Users 100 90 80 80 70 60 65 50 US Web Users in % Source: Computer Industry Almanac Inc. January 1998 55 40 40 30 20 10 1991 1995 1998 2000

Relevante Faktoren Entwicklung vom Avantgardemedium zum Massenmedium Ausbreitung in neue Regionen (Lateinamerika, Asien, arabische Welt) Digitalisierung großer Bibliotheken in vielen Ländern Rolle des WWW als globaler Handelsplatz Rolle des WWW als Medium für politische Information und Propaganda Zunahme sozialer und kultureller Inhalte Die Zukunft des WWW ist vielsprachig.

Noch mehr Probleme! Andere Schriftsysteme müssen kodiert und dargestellt werden: Chinesisch, Japanisch, Arabisch, Griechisch,... Die Wortbildungsregeln der Sprachen geraten sich ins Gehege: Skat skating Limes lime Sprachübergreifende Ambiguität stört bei der Suche: Brief brief overview Post post messages Porto Porto travel information Haut Haut Barr cute cute girls

Multilingualität als Herausforderung Eine große Chance tut sich auf: Es wird möglich sein, durch das niedergeschriebene Wissen der Menschheit zu navigieren, ohne an der Sprachgrenze stehenbleiben zu müssen. Diese technologische Herausforderung erfordert aber Fortschritte auf den folgenden Gebieten: lexikalische Semantik konzeptuelle Strukturierung Verbesserungen in maschineller Übersetzung

Sprache im WWW Sprache ist nur ein Medium auf dem WWW. Aber unter den verschiedenen Medien hat die Sprache einen besonderen Status. Bücher, Filme, Bilder, Musikstücke und Computerprogramme beschreiben und finden wir am besten mit Sprache. Nur mithilfe der Sprache können wir Wissen strukturieren und sinnvoll vernetzen. Die Sprache ist das Gewebe des World Wide Web

Menschliche Sprache Die Sprache hat Seiten, die dem Menschen leichtfallen, dem Computer hingegen schwer. Insbesondere: Ambiguität: viele Wörter und Phrasen haben mehrere Bedeutungen Paraphrasen: es gibt viele Möglichkeiten, das Gleiche auszudrücken Ungenauigkeit: oft ist die Bedeutung von Ausdrücken unscharf

Maschinelle Übersetzung Die vollautomatische maschinelle Übersetzung (fully automatic maschine translation – FAMT) beliebiger Texte ist heute nicht möglich. Das liegt nicht an der linguistischen Verarbeitung der Texte, sondern am fehlenden Wissen der Maschine über die Inhalte. Für sehr eingeschränkte Gegenstandsbereiche und Textarten können aber brauchbare Übersetzungen geliefert werden. Ansonsten dient die maschinelle Übersetzung heute erfolgreich als Vorstufe für menschliche Übersetzung (machine-assisted human translation – MAHT).

MÜ ist dennoch brauchbar Eine zufriedenstellende automatische Übersetzung beliebiger Texte ist heute also nicht möglich. Aber die Technologie liefert Übersetzungen, die den Leser sehr wohl das Thema und die wesentlichsten Inhalte erkennen lassen. Wir arbeiten mit dem Übersetzungssystem LOGOS. Andere große Übersetzungssysteme (SYSTRAN, METAL) werden ebenfalls für WWW Anwendungen eingesetzt. Die Übersetzungen nennen wir indikative Übersetzungen.

Indikative Übersetzung

Multilinguale Navigation Schnittstelle für die Navigation multilinguale Suche mehrsprachiges Angebot im Web

Konzeptindex car

Konzeptindex car Personenauto Auto Automobil

Konzeptindex car Personenauto Auto Automobil ...Kraftfahrzeuge für Personen...

Konzeptindex automobile auto car car Personenauto Auto Automobil ...Kraftfahrzeuge für Personen...

Konzeptindex automobile auto car car Personenauto Auto Automobil ...Kraftfahrzeuge für Personen... ...location de voitures...

Konzeptindex motor vehicle automobile auto car car truck Personenauto sports car tank truck ...Kraftfahrzeuge für Personen... ...location de voitures...

BenötigteTechnologien: Konzeptindex BenötigteTechnologien: Sprachidentifikation Lexikalische Desambiguierung Flache syntaktische Analysetechniken Aufbau eines phrasalen Index multilinguale Terminologien Paraphrasen-Glossare

Informationsextraktion In der IE werden gezielt relevante Informationen aus Texten herausgesucht und strukturiert. Bremen, 14. 10. 1997, wiwo: Lagersoftware weiter im Aufwind Die Bremer Firma Trade Consult hat auf einer Pressekonferenz in Hannover die Version 2.0 ihrer erfolgreichen Lagerverwaltungssoftware Store Age vorgestellt.. Die neue Version ermöglicht jetzt auch ... Auf der Pressekonferenz gab Geschäftsführer Franz Merleback auch die Umsatzzahlen der Softwareschmiede für das 3. Quartal bekannt. Wurden im zweiten Quartal bereits über 30 Millionen Mark umgesetzt, so konnte Merleback jetzt das stolze Ergebnis von 42,5 Millionen verkünden. ...

Informationsextraktion In der IE werden gezielt relevante Informationen aus Texten herausgesucht und strukturiert. Bremen, 14. 10. 1997, wiwo: Lagersoftware weiter im Aufwind Die Bremer Firma Trade Consult hat auf einer Pressekonferenz in Hannover die Version 2.0 ihrer erfolgreichen Lagerverwaltungssoftware Store Age vorgestellt.. Die neue Version ermöglicht jetzt auch ... Auf der Pressekonferenz gab Geschäftsführer Franz Merleback auch die Umsatzzahlen der Softwareschmiede für das 3. Quartal bekannt. Wurden im zweiten Quartal bereits über 30 Millionen Mark umgesetzt, so konnte Merleback jetzt das stolze Ergebnis von 42,5 Millionen verkünden. ...

Ausgabe in tabellarischer Form Firma 96Q4 1996 97Q1 97Q2 97Q3 97Q4 1997 Diff. ComSoft 120Mio 110Mio Trade Consult 30 Mio 42,5Mio Z&M 71,0Mio

denn die Sprache ist das Gewebe des Wissens. Ausblick Die Strukturierung des digitalen menschlichen Wissens ist eine der großen Herausforderungen des nächsten Jahrhunderts. Die Sprachtechnologie ist eine Schlüsseltechnologie für dieses ehrgeizige Vorhaben, denn die Sprache ist das Gewebe des Wissens.

Ziele Verbesserung der Informationsvernetzung durch automatisches Erstellen, Einfügen und Verwalten von getypten Hyperlinks in WWW-Dokumenten Methode Hyperlinks (Anchors und Targets) werden unabhängig von Dokumenten gespeichert und verwaltet Mithilfe von modernen Methoden der Sprachtechnologie werden in WWW-Dokumente Stellen (Begriffe, Textteile) identifiziert, die einem vordefinierten Anchor entsprechen Mit diesem Anchor wird offline oder online der entsprechende Hyperlink assoziiert und in das Dokument automatisch eingefügt, Der Typ des Target (z.B. Begriffesdefinition, Homepage, Hintergrundinformation, Bildmaterial, etc.) wird bereits in dem Ausgangsdokument markiert

THE ONE-CLICK APPROACH New wireless voice technology introduced Posted at 5:09 PM PT, Feb 8, 1999 By Stephen Lawson, InfoWorld Electric NTT Labs on Monday brought Dick Tracy into the enterprise, introducing a wireless voice and data system that can use a wrist radio at the Demo 99 conference. AirWave technology, demonstrated for the first time in the United States at this week's confe- rence in Indian Wells, Calif., is based on a wireless PBX. Small, handheld phones -- and a wrist radio that looks like an oversized watch -- can be used to make voice calls and exchange data around a building or campus. The handheld phones can be switched to a public cellular mode to become conventional cell phones. Company representatives touted the system as offering higher voice quality than a typical PBX. Airwave is based on NTT's Personal Handyphone System, which is currently deployed by more than 600 users in Japan, according to the company. Modems built in to both devices allow users to plug in a notebook or portable device for dial-up data connections as fast as 64Kbps. Users can exchange files or e-mail, or access a LAN or the Internet. There is no airtime charge for AirWave communications in the building or campus. AirWave systems are scheduled to be available through distribution partners by the end of this year, priced as low as $400 per user. NTT Labs, the research and development arm of NTT Corp., in Tokyo, can be reached at www.nttlabs.com.

THE ONE-CLICK APPROACH New wireless voice technology introduced Posted at 5:09 PM PT, Feb 8, 1999 By Stephen Lawson, InfoWorld Electric NTT Labs on Monday brought Dick Tracy into the enterprise, introducing a wireless voice and data system that can use a wrist radio at the Demo 99 conference. AirWave technology, demonstrated for the first time in the United States at this week's confe- rence in Indian Wells, Calif., is based on a wireless PBX. Small, handheld phones -- and a wrist radio that looks like an oversized watch -- can be used to make voice calls and exchange data around a building or campus. The handheld phones can be switched to a public cellular mode to become conventional cell phones. Company representatives touted the system as offering higher voice quality than a typical PBX. Airwave is based on NTT's Personal Handyphone System, which is currently deployed by more than 600 users in Japan, according to the company. Modems built in to both devices allow users to plug in a notebook or portable device for dial-up data connections as fast as 64Kbps. Users can exchange files or e-mail, or access a LAN or the Internet. There is no airtime charge for AirWave communications in the building or campus. AirWave systems are scheduled to be available through distribution partners by the end of this year, priced as low as $400 per user. NTT Labs, the research and development arm of NTT Corp., in Tokyo, can be reached at www.nttlabs.com.

THE ONE-CLICK APPROACH New wireless voice technology introduced Posted at 5:09 PM PT, Feb 8, 1999 By Stephen Lawson, InfoWorld Electric NTT Labs on Monday brought Dick Tracy into the enterprise, introducing a wireless voice and data system that can use a wrist radio at the Demo 99 conference. AirWave technology, demonstrated for the first time in the United States at this week's confe- rence in Indian Wells, Calif., is based on a wireless PBX. Small, handheld phones -- and a wrist radio that looks like an oversized watch -- can be used to make voice calls and exchange data around a building or campus. The handheld phones can be switched to a public cellular mode to become conventional cell phones. Company representatives touted the system as offering higher voice quality than a typical PBX. Airwave is based on NTT's Personal Handyphone System, which is currently deployed by more than 600 users in Japan, according to the company. Modems built in to both devices allow users to plug in a notebook or portable device for dial-up data connections as fast as 64Kbps. Users can exchange files or e-mail, or access a LAN or the Internet. There is no airtime charge for AirWave communications in the building or campus. AirWave systems are scheduled to be available through distribution partners by the end of this year, priced as low as $400 per user. NTT Labs, the research and development arm of NTT Corp., in Tokyo, can be reached at www.nttlabs.com.

THE ONE-CLICK APPROACH New wireless voice technology introduced Posted at 5:09 PM PT, Feb 8, 1999 By Stephen Lawson, InfoWorld Electric NTT Labs on Monday brought Dick Tracy into the enterprise, introducing a wireless voice and data system that can use a wrist radio at the Demo 99 conference. AirWave technology, demonstrated for the first time in the United States at this week's confe- rence in Indian Wells, Calif., is based on a wireless PBX. Small, handheld phones -- and a wrist radio that looks like an oversized watch -- can be used to make voice calls and exchange data around a building or campus. The handheld phones can be switched to a public cellular mode to become conventional cell phones. Company representatives touted the system as offering higher voice quality than a typical PBX. Airwave is based on NTT's Personal Handyphone System, which is currently deployed by more than 600 users in Japan, according to the company. Modems built in to both devices allow users to plug in a notebook or portable device for dial-up data connections as fast as 64Kbps. Users can exchange files or e-mail, or access a LAN or the Internet. There is no airtime charge for AirWave communications in the building or campus. AirWave systems are scheduled to be available through distribution partners by the end of this year, priced as low as $400 per user. NTT Labs, the research and development arm of NTT Corp., in Tokyo, can be reached at www.nttlabs.com.

THE ONE-CLICK APPROACH New wireless voice technology introduced Posted at 5:09 PM PT, Feb 8, 1999 By Stephen Lawson, InfoWorld Electric NTT Labs on Monday brought Dick Tracy into the enterprise, introducing a wireless voice and data system that can use a wrist radio at the Demo 99 conference. AirWave technology, demonstrated for the first time in the United States at this week's confe- rence in Indian Wells, Calif., is based on a wireless PBX. Small, handheld phones -- and a wrist radio that looks like an oversized watch -- can be used to make voice calls and exchange data around a building or campus. The handheld phones can be switched to a public cellular mode to become conventional cell phones. Company representatives touted the system as offering higher voice quality than a typical PBX. Airwave is based on NTT's Personal Handyphone System, which is currently deployed by more than 600 users in Japan, according to the company. Modems built in to both devices allow users to plug in a notebook or portable device for dial-up data connections as fast as 64Kbps. Users can exchange files or e-mail, or access a LAN or the Internet. There is no airtime charge for AirWave communications in the building or campus. AirWave systems are scheduled to be available through distribution partners by the end of this year, priced as low as $400 per user. NTT Labs, the research and development arm of NTT Corp., in Tokyo, can be reached at www.nttlabs.com. Company Info Homepage Other News Products Indicators Contact Experts Contacts Accounts

Ziele und Vorteile Information kann dichter vernetzt werden, da Links nicht mehr manuell eingefügt werden müssen Verwaltung von Internet-Sites und Linkstrukturen kann wesentlich vereinfacht und automatisiert werden Konsistenz von Vernetzungen wird erhöht, da jeder Link nur an einer Stelle verwaltet werden muss Typisierung von Links führt zu erhöhter Transparenz für den Benutzer, der bereits am Link erkennen kann, wo dieser hinführt. Neue Qualität der Vernetzung führt zu einer neuen Qualität des Informations- und Wissensmanagement

Umfeld und Einbettung Internet-Applikationen, Informations- und Wissensmanagement sind zentrale Kompetenzen des DFKI LT-Labs Die Entwicklung innovative Merkmale und Funktionalitäten ist für die Bewahrung der führenden Position im Bereich mehrsprachige Internetportale, Suchmaschinen und Informationssysteme essentiell Beispielanwendungen: HYPERCODE (Dresdner Bank) Dichte automatische Vernetzung von Programmcode und Dokumentation MIETTA (Mehrsprachiges WWW-Tourismus-Informationsystem) Automatische Vernetzung von Tourismusinformation Beispiel: Ortsnamen etc. in WWW-Dokumenten werden automatisch mit Homepages der Kommunen verbunden

DAS PROBLEM Lexikon Morphologie Syntax Sprachtiefe Semantik Pragmatik Wissensverarbeitung Gegenstandsbereich(e) Sprachumfang

DAS PROBLEM Lexikon Rechtschreibkontrolle Morphologie Syntax Sprachtiefe Semantik Pragmatik Wissensverarbeitung Gegenstandsbereich(e) Sprachumfang

DAS PROBLEM Informationsextraktion Lexikon Morphologie Syntax Sprachtiefe Semantik Pragmatik Wissensverarbeitung Gegenstandsbereich(e) Sprachumfang

MaschinelleÜbersetzung akustische Form geschriebene Form phonetische Verarbeitung orthographische Verarbeitung phonetische o. graphemische Repräsentation morphonologische Verarbeitung morphonologische Repräsentation syntaktische Verarbeitung syntaktische Repräsentation semantische Verarbeitung semantische Repräsentation pragmatische Verarbeitung - Wissensverarbeitung Repräsentation der vollen Bedeutung

Systeme in der Anwendung Systran (Systran, EU) Metal/Comprendium (Siemens, Sietec, L&H, SAIL Labs) Logos (Logos, Global Words) Personal Translator (IBM)

Maschinelle Übersetzung Die vollautomatische maschinelle Übersetzung (fully automatic maschine translation – FAMT) beliebiger Texte ist heute nicht möglich. Das liegt nicht an der linguistischen Verarbeitung der Texte, sondern am fehlenden Wissen der Maschine über die Inhalte. Für sehr eingeschränkte Gegenstandsbereiche und Textarten können aber brauchbare Übersetzungen geliefert werden. Ansonsten dient die maschinelle Übersetzung heute erfolgreich als Vorstufe für menschliche Übersetzung (machine-assisted human translation – MAHT).

FAMT MACHINE TRANSLATION TODAY quality perfect ready-to-use understandable FAMT coverage of subject domains and text sorts

FAMT MACHINE TRANSLATION TODAY MAHT quality perfect ready-to-use understandable FAMT coverage of subject domains and text sorts

MACHINE TRANSLATION TODAY quality perfect ready-to-use understandable CONTROLLED LANGUAGE MT coverage of subject domains and text sorts

FAMT MACHINE TRANSLATION TODAY quality perfect ready-to-use understandable indicative FAMT coverage of subject domains and text sorts

CONCEPT INDEX car

CONCEPT INDEX car Personenauto Auto Automobil

CONCEPT INDEX car Personenauto Auto Automobil ...Kraftfahrzeuge für Personen...

CONCEPT INDEX automobile auto car car Personenauto Auto Automobil ...Kraftfahrzeuge für Personen...

CONCEPT INDEX automobile auto car car Personenauto Auto Automobil ...Kraftfahrzeuge für Personen... ...location de voitures...

CONCEPT INDEX motor vehicle automobile auto car car truck Personenauto sports car tank truck ...Kraftfahrzeuge für Personen... ...location de voitures...

LANGUAGE TECHNOLOGIES Text Technologies Speech Technologies

LANGUAGE TECHNOLOGIES gathering indexing categorization clustering summarization Text Technologies Speech Technologies

LANGUAGE TECHNOLOGIES text understanding text translation information extraction report generation Text Technologies Speech Technologies

LANGUAGE TECHNOLOGIES Voice Recognition Speech Verification Speech Recognition Voice Modelling Speech Synthesis Speaker Identification Language Indentification Text Technologies Speech Technologies

LANGUAGE TECHNOLOGIES Speech Generation Speech Unterstanding Spoken Dialogue Systems Speech Translation Systems Text Technologies Speech Technologies

LANGUAGE TECHNOLOGIES language understanding language generation dialogue modelling machine translation Text Technologies Speech Technologies

LANGUAGE TECHNOLOGIES gathering indexing categorization clustering summarization Text Technologies Speech Technologies

ACQUISITION Scanning Collecting by Email and Push Services Gathering from the Net Sound Recordings

CATEGORIZATION Hypermedia, in computer science, the integration of graphics, sound, video, or any combination into a primarily associative system of information storage and retrieval. Hypermedia, especially in an interactive format where choices are controlled by the user, is structured around the idea of offering a working and learning environment that parallels human thinking˘that is, an environment that allows the user to make associations between topics rather than move sequentially from one to the next, as in an alphabetic list. The term hypertext was coined in 1965 by Ted Nelson to describe documents, as presented by a Computer, that express the no nlinear structure of ideas, as opposed to the linear format of books, film, and speech. The term hypermedia, more recently introduced, is nearly synonymous but emphasizes the nontextual components of hypertext, such as animation, recorded sound, and video. Hypermedia topics are thus linked in a manner that allows the user to jump from subject to related subject in searching for information. For example, a hypermedia presentation on navigation might include links to such topics as astronomy, bird migration, geography, satellites, and radar. If the information is primarily in text form, the product is hypertext; if video, music, animation, or other elements are included, the product is hypermedia. Microsoft (R) Encarta. Copyright (c) 1993 Microsoft Corporation. Copyright (c) 1993 Funk & Wagnall's Corporation Hypermedia, in computer science, the integration of graphics, sound, video, or any combination into a primarily associative system of information storage and retrieval. Hypermedia, especially in an interactive format where choices are controlled by the user, is structured around the idea of offering a working and learning environment that parallels human thinking˘that is, an environment that allows the user to make associations between topics rather than move sequentially from one to the next, as in an alphabetic list. The term hypertext was coined in 1965 by Ted Nelson to describe documents, as presented by a Computer, that express the nonlinear structure of ideas, as opposed to the linear format of books, film, and speech. The term hypermedia, more recently introduced, is nearly synonymous but emphasizes the nontextual components of hypertext, such as animation, recorded sound, and video. Hypermedia topics are thus linked in a manner that allows the user to jump from subject to related subject in searching for information. For example, a hypermedia presentation on navigation might include links to such topics as astronomy, bird migration, geography, satellites, and radar. If the information is primarily in text form, the product is hypertext; if video, music, animation, or other elements are included, the product is hypermedia. Microsoft (R) Encarta. Copyright (c) 1993 Microsoft Corporation. Copyright (c) 1993 Funk & Wagnall's Corporation Hypermedia, in computer science, the integration of graphics, sound, video, or any combination into a primarily associative system of information storage and retrieval. Hypermedia, especially in an interactive format where choices are controlled by the user, is structured around the idea of offering a working and learning environment that parallels human thinking˘that is, an environment that allows the user to make associations between topics rather than move sequentially from one to the next, as in an alphabetic list. The term hypertext was coined in 1965 by Ted Nelson to describe documents, as presented by a Computer, that express the nonlinear structure of ideas, as opposed to the linear format of books, film, and speech. The term hypermedia, more recently introduced, is nearly synonymous but emphasizes the nontextual components of hypertext, such as animation, recorded sound, and video. Hypermedia topics are thus linked in a manner that allows the user to jump from subject to related subject in searching for information. For example, a hypermedia presentation on navigation might include links to such topics as astronomy, bird migration, geography, satellites, and radar. If the information is primarily in text form, the product is hypertext; if video, music, animation, or other elements are included, the product is hypermedia. Microsoft (R) Encarta. Copyright (c) 1993 Microsoft Corporation. Copyright (c) 1993 Funk & Wagnall's Corporation Hypermedia, in computer science, the integration of graphics, sound, video, or any combination into a primarily associative system of information storage and retrieval. Hypermedia, especially in an interactive format where choices are controlled by the user, is structured around the idea of offering a working and learning environment that parallels human thinking˘that is, an environment that allows the user to make associations between topics rather than move sequentially from one to the next, as in an alphabetic list. The term hypertext was coined in 1965 by Ted Nelson to describe documents, as presented by a Computer, that express the nonlinear structure of ideas, as opposed to the linear format of books, film, and speech. The term hypermedia, more recently introduced, is nearly synonymous but emphasizes the nontextual components of hypertext, such as animation, recorded sound, and video. Hypermedia topics are thus linked in a manner that allows the user to jump from subject to related subject in searching for information. For example, a hypermedia presentation on navigation might include links to such topics as astronomy, bird migration, geography, satellites, and radar. If the information is primarily in text form, the product is hypertext; if video, music, animation, or other elements are included, the product is hypermedia. Microsoft (R) Encarta. Copyright (c) 1993 Microsoft Corporation. Copyright (c) 1993 Funk & Wagnall's Corporation tying count of the original complaint. Instead, it said it wants to investigate developments in the industry since the trial concluded and evaluate whether additional conduct-related provisions are necessary, especially in the absence of a breakup. In a statement issued Thursday morning, the Justice Department said it had taken these positions in an effort to "streamline the case with the goal of securing an effective remedy as quickly as possible.“ Instead of a breakup, the Justice Department said it will ask that Microsoft have certain restrictions placed on its conduct modeled on those the original trial judge imposed on the company in June 2000 but were postponed pending the appeal. In his original order, Judge Jackson imposed a series of restrictions on Microsoft's business practices which were to be effective as the companymoved to split its business in two. Among the conduct remedies Judge Jackson originally imposed were: prohibiting Microsoft from punishing hardware and software companies working on competing products; prohibiting it from favoring computer companies and software developers that helped Microsoft exclude competitors; makers under uniform prices and terms according to a publicly available schedule; and barring Microsoft from interfering with the way PC makers set up startup screens, this Windows desktop preferences, and Internet connection wizards. Since the appeals court first handed down its ruling in the case, Microsoft repeatedly has expressed its Hypermedia, in computer science, the integration of graphics, sound, video, or any combination into a primarily associative system of information storage and retrieval. Hypermedia, especially in an interactive format where choices are controlled by the user, is structured around the idea of offering a working and learning environment that parallels human thinking˘that is, an environment that allows the user to make associations between topics rather than move sequentially from one to the next, as in an alphabetic list. The term hypertext was coined in 1965 by Ted Nelson to describe documents, as presented by a Computer, that express the nonlinear structure of ideas, as opposed to the linear format of books, film, and speech. The term hypermedia, more recently introduced, is nearly synonymous but emphasizes the nontextual components of hypertext, such as animation, recorded sound, and video. Hypermedia topics are thus linked in a manner that allows the user to jump from subject to related subject in searching for information. For example, a hypermedia presentation on navigation might include links to such topics as astronomy, bird migration, geography, satellites, and radar. If the information is primarily in text form, the product is hypertext; if video, music, animation, or other elements are included, the product is hypermedia. Microsoft (R) Encarta. Copyright (c) 1993 Microsoft Corporation. Copyright (c) 1993 Funk & Wagnall's Corporation Hypermedia, in computer science, the integration of graphics, sound, video, or any combination into a primarily associative system of information storage and retrieval. Hypermedia, especially in an interactive format where choices are controlled by the user, is structured around the idea of offering a working and learning environment that parallels human thinking˘that is, an environment that allows the user to make associations between topics rather than move sequentially from one to the next, as in an alphabetic list. The term hypertext was coined in 1965 by Ted Nelson to describe documents, as presented by a Computer, that express the nonlinear structure of ideas, as opposed to the linear format of books, film, and speech. The term hypermedia, more recently introduced, is nearly synonymous but emphasizes the nontextual components of hypertext, such as animation, recorded sound, and video. Hypermedia topics are thus linked in a manner that allows the user to jump from subject to related subject in searching for information. For example, a hypermedia presentation on navigation might include links to such topics as astronomy, bird migration, geography, satellites, and radar. If the information is primarily in text form, the product is hypertext; if video, music, animation, or other elements are included, the product is hypermedia. Microsoft (R) Encarta. Copyright (c) 1993 Microsoft Corporation. Copyright (c) 1993 Funk & Wagnall's Corporation tying count of the original complaint. Instead, it said it wants to investigate developments in the industry since the trial concluded and evaluate whether additional conduct-related provisions are necessary, especially in the absence of a breakup. In a statement issued Thursday morning, the Justice Department said it had taken these positions in an effort to "streamline the case with the goal of securing an effective remedy as quickly as possible.“ Instead of a breakup, the Justice Department said it will ask that Microsoft have certain restrictions placed on its conduct modeled on those the original trial judge imposed on the company in June 2000 but were postponed pending the appeal. In his original order, Judge Jackson imposed a series of restrictions on Microsoft's business practices which were to be effective as the companymoved to split its business in two. Among the conduct remedies Judge Jackson originally imposed were: prohibiting Microsoft from punishing hardware and software companies working on competing products; prohibiting it from favoring computer companies and software developers that helped Microsoft exclude competitors; makers under uniform prices and terms according to a publicly available schedule; and barring Microsoft from interfering with the way PC makers set up startup screens, this Windows desktop preferences, and Internet connection wizards. Since the appeals court first handed down its ruling in the case, Microsoft repeatedly has expressed its Hypermedia, in computer science, the integration of graphics, sound, video, or any combination into a primarily associative system of information storage and retrieval. Hypermedia, especially in an interactive format where choices are controlled by the user, is structured around the idea of offering a working and learning environment that parallels human thinking˘that is, an environment that allows the user to make associations between topics rather than move sequentially from one to the next, as in an alphabetic list. The term hypertext was coined in 1965 by Ted Nelson to describe documents, as presented by a Computer, that express the nonlinear structure of ideas, as opposed to the linear format of books, film, and speech. The term hypermedia, more recently introduced, is nearly synonymous but emphasizes the nontextual components of hypertext, such as animation, recorded sound, and video. Hypermedia topics are thus linked in a manner that allows the user to jump from subject to related subject in searching for information. For example, a hypermedia presentation on navigation might include links to such topics as astronomy, bird migration, geography, satellites, and radar. If the information is primarily in text form, the product is hypertext; if video, music, animation, or other elements are included, the product is hypermedia. Microsoft (R) Encarta. Copyright (c) 1993 Microsoft Corporation. Copyright (c) 1993 Funk & Wagnall's Corporation tying count of the original complaint. Instead, it said it wants to investigate developments in the industry since the trial concluded and evaluate whether additional conduct-related provisions are necessary, especially in the absence of a breakup. In a statement issued Thursday morning, the Justice Department said it had taken these positions in an effort to "streamline the case with the goal of securing an effective remedy as quickly as possible.“ Instead of a breakup, the Justice Department said it will ask that Microsoft have certain restrictions placed on its conduct modeled on those the original trial judge imposed on the company in June 2000 but were postponed pending the appeal. In his original order, Judge Jackson imposed a series of restrictions on Microsoft's business practices which were to be effective as the companymoved to split its business in two. Among the conduct remedies Judge Jackson originally imposed were: prohibiting Microsoft from punishing hardware and software companies working on competing products; prohibiting it from favoring computer companies and software developers that helped Microsoft exclude competitors; makers under uniform prices and terms according to a publicly available schedule; and barring Microsoft from interfering with the way PC makers set up startup screens, this Windows desktop preferences, and Internet connection wizards. Since the appeals court first handed down its ruling in the case, Microsoft repeatedly has expressed its Hypermedia, in computer science, the integration of graphics, sound, video, or any combination into a primarily associative system of information storage and retrieval. Hypermedia, especially in an interactive format where choices are controlled by the user, is structured around the idea of offering a working and learning environment that parallels human thinking˘that is, an environment that allows the user to make associations between topics rather than move sequentially from one to the next, as in an alphabetic list. The term hypertext was coined in 1965 by Ted Nelson to describe documents, as presented by a Computer, that express the nonlinear structure of ideas, as opposed to the linear format of books, film, and speech. The term hypermedia, more recently introduced, is nearly synonymous but emphasizes the nontextual components of hypertext, such as animation, recorded sound, and video. Hypermedia topics are thus linked in a manner that allows the user to jump from subject to related subject in searching for information. For example, a hypermedia presentation on navigation might include links to such topics as astronomy, bird migration, geography, satellites, and radar. If the information is primarily in text form, the product is hypertext; if video, music, animation, or other elements are included, the product is hypermedia. Microsoft (R) Encarta. Copyright (c) 1993 Microsoft Corporation. Copyright (c) 1993 Funk & Wagnall's Corporation Hypermedia, in computer science, the integration of graphics, sound, video, or any combination into a primarily associative system of information storage and retrieval. Hypermedia, especially in an interactive format where choices are controlled by the user, is structured around the idea of offering a working and learning environment that parallels human thinking˘that is, an environment that allows the user to make associations between topics rather than move sequentially from one to the next, as in an alphabetic list. The term hypertext was coined in 1965 by Ted Nelson to describe documents, as presented by a Computer, that express the nonlinear structure of ideas, as opposed to the linear format of books, film, and speech. The term hypermedia, more recently introduced, is nearly synonymous but emphasizes the nontextual components of hypertext, such as animation, recorded sound, and video. Hypermedia topics are thus linked in a manner that allows the user to jump from subject to related subject in searching for information. For example, a hypermedia presentation on navigation might include links to such topics as astronomy, bird migration, geography, satellites, and radar. If the information is primarily in text form, the product is hypertext; if video, music, animation, or other elements are included, the product is hypermedia. Microsoft (R) Encarta. Copyright (c) 1993 Microsoft Corporation. Copyright (c) 1993 Funk & Wagnall's Corporation tying count of the original complaint. Instead, it said it wants to investigate developments in the industry since the trial concluded and evaluate whether additional conduct-related provisions are necessary, especially in the absence of a breakup. In a statement issued Thursday morning, the Justice Department said it had taken these positions in an effort to "streamline the case with the goal of securing an effective remedy as quickly as possible.“ Instead of a breakup, the Justice Department said it will ask that Microsoft have certain restrictions placed on its conduct modeled on those the original trial judge imposed on the company in June 2000 but were postponed pending the appeal. In his original order, Judge Jackson imposed a series of restrictions on Microsoft's business practices which were to be effective as the companymoved to split its business in two. Among the conduct remedies Judge Jackson originally imposed were: prohibiting Microsoft from punishing hardware and software companies working on competing products; prohibiting it from favoring computer companies and software developers that helped Microsoft exclude competitors; makers under uniform prices and terms according to a publicly available schedule; and barring Microsoft from interfering with the way PC makers set up startup screens, this Windows desktop preferences, and Internet connection wizards. Since the appeals court first handed down its ruling in the case, Microsoft repeatedly has expressed its Hypermedia, in computer science, the integration of graphics, sound, video, or any combination into a primarily associative system of information storage and retrieval. Hypermedia, especially in an interactive format where choices are controlled by the user, is structured around the idea of offering a working and learning environment that parallels human thinking˘that is, an environment that allows the user to make associations between topics rather than move sequentially from one to the next, as in an alphabetic list. The term hypertext was coined in 1965 by Ted Nelson to describe documents, as presented by a Computer, that express the nonlinear structure of ideas, as opposed to the linear format of books, film, and speech. The term hypermedia, more recently introduced, is nearly synonymous but emphasizes the nontextual components of hypertext, such as animation, recorded sound, and video. Hypermedia topics are thus linked in a manner that allows the user to jump from subject to related subject in searching for information. For example, a hypermedia presentation on navigation might include links to such topics as astronomy, bird migration, geography, satellites, and radar. If the information is primarily in text form, the product is hypertext; if video, music, animation, or other elements are included, the product is hypermedia. Microsoft (R) Encarta. Copyright (c) 1993 Microsoft Corporation. Copyright (c) 1993 Funk & Wagnall's Corporation Hypermedia, in computer science, the integration of graphics, sound, video, or any combination into a primarily associative system of information storage and retrieval. Hypermedia, especially in an interactive format where choices are controlled by the user, is structured around the idea of offering a working and learning environment that parallels human thinking˘that is, an environment that allows the user to make associations between topics rather than move sequentially from one to the next, as in an alphabetic list. The term hypertext was coined in 1965 by Ted Nelson to describe documents, as presented by a Computer, that express the nonlinear structure of ideas, as opposed to the linear format of books, film, and speech. The term hypermedia, more recently introduced, is nearly synonymous but emphasizes the nontextual components of hypertext, such as animation, recorded sound, and video. Hypermedia topics are thus linked in a manner that allows the user to jump from subject to related subject in searching for information. For example, a hypermedia presentation on navigation might include links to such topics as astronomy, bird migration, geography, satellites, and radar. If the information is primarily in text form, the product is hypertext; if video, music, animation, or other elements are included, the product is hypermedia. Microsoft (R) Encarta. Copyright (c) 1993 Microsoft Corporation. Copyright (c) 1993 Funk & Wagnall's Corporation Hypertext, in computer science, a metaphor for presenting information in which text, images, sounds, and actions become linked together in a complex, nonsequential web of associations that permit the user to browse through related topics, regardless of the presented order of the topics. These links are often established both by the author of a hypertext document and by the user, depending on the intent of the hypertext document. For example, traveling among the links to the word iron in an article might lead the user to the periodic table of the elements or a map of the migration of metallurgy in Iron Age Europe. The term hypertext was coined in 1965 by Ted Nelson to describe documents, as presented by a Computer, that express the nonlinear structure of ideas, as opposed to the linear format of books, film, and speech. The term hypermedia, more recently introduced, is nearly synonymous but emphasizes the nontextual components of hypertext, such as animation, recorded sound, and video. Hypermedia, in computer science, the integration of graphics, sound, video, or any combination into a primarily associative system of information storage and retrieval. Hypermedia, especially in an interactive format where choices are controlled by the user, is structured around the idea of offering a working and learning environment that parallels human thinking˘that is, an environment that allows the user to make associations between topics rather than move sequentially from one to the next, as in an alphabetic list. The term hypertext was coined in 1965 by Ted Nelson to describe documents, as presented by a Computer, that express the nonlinear structure of ideas, as opposed to the linear format of books, film, and speech. The term hypermedia, more recently introduced, is nearly synonymous but emphasizes the nontextual components of hypertext, such as animation, recorded sound, and video. Hypermedia topics are thus linked in a manner that allows the user to jump from subject to related subject in searching for information. For example, a hypermedia presentation on navigation might include links to such topics as astronomy, bird migration, geography, satellites, and radar. If the information is primarily in text form, the product is hypertext; if video, music, animation, or other elements are included, the product is hypermedia. Microsoft (R) Encarta. Copyright (c) 1993 Microsoft Corporation. Copyright (c) 1993 Funk & Wagnall's Corporation Hypermedia, in computer science, the integration of graphics, sound, video, or any combination into a primarily associative system of information storage and retrieval. Hypermedia, especially in an interactive format where choices are controlled by the user, is structured around the idea of offering a working and learning environment that parallels human thinking˘that is, an environment that allows the user to make associations between topics rather than move sequentially from one to the next, as in an alphabetic list. The term hypertext was coined in 1965 by Ted Nelson to describe documents, as presented by a Computer, that express the nonlinear structure of ideas, as opposed to the linear format of books, film, and speech. The term hypermedia, more recently introduced, is nearly synonymous but emphasizes the nontextual components of hypertext, such as animation, recorded sound, and video. Hypermedia topics are thus linked in a manner that allows the user to jump from subject to related subject in searching for information. For example, a hypermedia presentation on navigation might include links to such topics as astronomy, bird migration, geography, satellites, and radar. If the information is primarily in text form, the product is hypertext; if video, music, animation, or other elements are included, the product is hypermedia. Microsoft (R) Encarta. Copyright (c) 1993 Microsoft Corporation. Corporation Hypermedia, in computer science, the integration of graphics, sound, video, or any combination into a primarily associative system of information storage and retrieval. Hypermedia, especially in an interactive format where choices are controlled by the user, is structured around the idea of offering a working and learning environment that parallels human thinking˘that is, an environment that allows the user to make associations between topics rather than move sequentially from one to the next, as in an alphabetic list. The term hypertext was coined in 1965 by Ted Nelson to describe documents, as presented by a Computer, that express the nonlinear structure of ideas, as opposed to the linear format of books, film, and speech. The term hypermedia, more recently introduced, is nearly synonymous but emphasizes the nontextual components of hypertext, such as animation, recorded sound, and video. Hypermedia topics are thus linked in a manner that allows the user to jump from subject to related subject in searching for information. For example, a hypermedia presentation on navigation might include links to such topics as astronomy, bird migration, geography, satellites, and radar. If the information is primarily in text form, the product is hypertext; if video, music, animation, or other elements are included, the product is hypermedia. Microsoft (R) Encarta. Copyright (c) 1993 Microsoft Corporation. Copyright (c) 1993 Funk & Wagnall's Corporation Hypermedia, in computer science, the integration of graphics, sound, video, or any combination into a primarily associative system of information storage and retrieval. Hypermedia, especially in an interactive format where choices are controlled by the user, is structured around the idea of offering a working and learning environment that parallels human thinking˘that is, an environment that allows the user to make associations between topics rather than move sequentially from one to the next, as in an alphabetic list. The term hypertext was coined in 1965 by Ted Nelson to describe documents, as presented by a Computer, that express the nonlinear structure of ideas, as opposed to the linear format of books, film, and speech. The term hypermedia, more recently introduced, is nearly synonymous but emphasizes the nontextual components of hypertext, such as animation, recorded sound, and video. Hypermedia topics are thus linked in a manner that allows the user to jump from subject to related subject in searching for information. For example, a hypermedia presentation on navigation might include links to such topics as astronomy, bird migration, geography, satellites, and radar. If the information is primarily in text form, the product is hypertext; if video, music, animation, or other elements are included, the product is hypermedia. Microsoft (R) Encarta. Copyright (c) 1993 Microsoft Corporation. Copyright (c) 1993 Funk & Wagnall's Corporation Hypertext, in computer science, a metaphor for presenting information in which text, images, sounds, and actions become linked together in a complex, nonsequential web of associations that permit the user to browse through related topics, regardless of the presented order of the topics. These links are often established both by the author of a hypertext document and by the user, depending on the intent of the hypertext document. For example, traveling among the links to the word iron in an article might lead the user to the periodic table of the elements or a map of the migration of metallurgy in Iron Age Europe. The term hypertext was coined in 1965 by Ted Nelson to describe documents, as presented by a Computer, that express the nonlinear structure of ideas, as opposed to the linear format of books, film, and speech. The term hypermedia, more recently introduced, is nearly synonymous but emphasizes the nontextual components of hypertext, such as animation, recorded sound, and video. Hypermedia, in computer science, the integration of graphics, sound, video, or any combination into a primarily associative system of information storage and retrieval. Hypermedia, especially in an interactive format where choices are controlled by the user, is structured around the idea of offering a working and learning environment that parallels human thinking˘that is, an environment that allows the user to make associations between topics rather than move sequentially from one to the next, as in an alphabetic list. The term hypertext was coined in 1965 by Ted Nelson to describe documents, as presented by a Computer, that express the nonlinear structure of ideas, as opposed to the linear format of books, film, and speech. The term hypermedia, more recently introduced, is nearly synonymous but emphasizes the nontextual components of hypertext, such as animation, recorded sound, and video. Hypermedia topics are thus linked in a manner that allows the user to jump from subject to related subject in searching for information. For example, a hypermedia presentation on navigation might include links to such topics as astronomy, bird migration, geography, satellites, and radar. If the information is primarily in text form, the product is hypertext; if video, music, animation, or other elements are included, the product is hypermedia. Microsoft (R) Encarta. Copyright (c) 1993 Microsoft Corporation. Copyright (c) 1993 Funk & Wagnall's Corporation Hypermedia, in computer science, the integration of graphics, sound, video, or any combination into a primarily associative system of information storage and retrieval. Hypermedia, especially in an interactive format where choices are controlled by the user, is structured around the idea of offering a working and learning environment that parallels human thinking˘that is, an environment that allows the user to make associations between topics rather than move sequentially from one to the next, as in an alphabetic list. The term hypertext was coined in 1965 by Ted Nelson to describe documents, as presented by a Computer, that express the nonlinear structure of ideas, as opposed to the linear format of books, film, and speech. The term hypermedia, more recently introduced, is nearly synonymous but emphasizes the nontextual components of hypertext, such as animation, recorded sound, and video. Hypermedia topics are thus linked in a manner that allows the user to jump from subject to related subject in searching for information. For example, a hypermedia presentation on navigation might include links to such topics as astronomy, bird migration, geography, satellites, and radar. If the information is primarily in text form, the product is hypertext; if video, music, animation, or other elements are included, the product is hypermedia. Microsoft (R) Encarta. Copyright (c) 1993 Microsoft Corporation. Copyright (c) 1993 Funk & Wagnall's Corporation Hypermedia, in computer science, the integration of graphics, sound, video, or any combination into a primarily associative system of information storage and retrieval. Hypermedia, especially in an interactive format where choices are controlled by the user, is structured around the idea of offering a working and learning environment that parallels human thinking˘that is, an environment that allows the user to make associations between topics rather than move sequentially from one to the next, as in an alphabetic list. The term hypertext was coined in 1965 by Ted Nelson to describe documents, as presented by a Computer, that express the nonlinear structure of ideas, as opposed to the linear format of books, film, and speech. The term hypermedia, more recently introduced, is nearly synonymous but emphasizes the nontextual components of hypertext, such as animation, recorded sound, and video. Hypermedia topics are thus linked in a manner that allows the user to jump from subject to related subject in searching for information. For example, a hypermedia presentation on navigation might include links to such topics as astronomy, bird migration, geography, satellites, and radar. If the information is primarily in text form, the product is hypertext; if video, music, animation, or other elements are included, the product is hypermedia. Microsoft (R) Encarta. Copyright (c) 1993 Microsoft Corporation. Copyright (c) 1993 Funk & Wagnall's Corporation Hypermedia, in computer science, the integration of graphics, sound, video, or any combination into a primarily associative system of information storage and retrieval. Hypermedia, especially in an interactive format where choices are controlled by the user, is structured around the idea of offering a working and learning environment that parallels human thinking˘that is, an environment that allows the user to make associations between topics rather than move sequentially from one to the next, as in an alphabetic list. The term hypertext was coined in 1965 by Ted Nelson to describe documents, as presented by a Computer, that express the nonlinear structure of ideas, as opposed to the linear format of books, film, and speech. The term hypermedia, more recently introduced, is nearly synonymous but emphasizes the nontextual components of hypertext, such as animation, recorded sound, and video. Hypermedia topics are thus linked in a manner that allows the user to jump from subject to related subject in searching for information. For example, a hypermedia presentation on navigation might include links to such topics as astronomy, bird migration, geography, satellites, and radar. If the information is primarily in text form, the product is hypertext; if video, music, animation, or other elements are included, the product is hypermedia. Microsoft (R) Encarta. Copyright (c) 1993 Microsoft Corporation. Copyright (c) 1993 Funk & Wagnall's Corporation Hypertext, in computer science, a metaphor for presenting information in which text, images, sounds, and actions become linked together in a complex, nonsequential web of associations that permit the user to browse through related topics, regardless of the presented order of the topics. These links are often established both by the author of a hypertext document and by the user, depending on the intent of the hypertext document. For example, traveling among the links to the word iron in an article might lead the user to the periodic table of the elements or a map of the migration of metallurgy in Iron Age Europe. The term hypertext was coined in 1965 by Ted Nelson to describe documents, as presented by a Computer, that express the nonlinear structure of ideas, as opposed to the linear format of books, film, and speech. The term hypermedia, more recently introduced, is nearly synonymous but emphasizes the nontextual components of hypertext, such as animation, recorded sound, and video. Hypermedia, in computer science, the integration of graphics, sound, video, or any combination into a primarily associative system of information storage and retrieval. Hypermedia, especially in an interactive format where choices are controlled by the user, is structured around the idea of offering a working and learning environment that parallels human thinking˘that is, an environment that allows the user to make associations between topics rather than move sequentially from one to the next, as in an alphabetic list. The term hypertext was coined in 1965 by Ted Nelson to describe documents, as presented by a Computer, that express the nonlinear structure of ideas, as opposed to the linear format of books, film, and speech. The term hypermedia, more recently introduced, is nearly synonymous but emphasizes the nontextual components of hypertext, such as animation, recorded sound, and video. Hypermedia topics are thus linked in a manner that allows the user to jump from subject to related subject in searching for information. For example, a hypermedia presentation on navigation might include links to such topics as astronomy, bird migration, geography, satellites, and radar. If the information is primarily in text form, the product is hypertext; if video, music, animation, or other elements are included, the product is hypermedia. Microsoft (R) Encarta. Copyright (c) 1993 Microsoft Corporation. Copyright (c) 1993 Funk & Wagnall's Corporation

INDEXING Hypermedia, in computer science, the integration of graphics, sound, video, or any combination into a primarily associative system of information storage and retrieval. Hypermedia, especially in an interactive format where choices are controlled by the user, is structured around the idea of offering a working and learning environment that parallels human thinking˘that is, an environment that allows the user to make associations between topics rather than move sequentially from one to the next, as in an alphabetic list. The term hypertext was coined in 1965 by Ted Nelson to describe documents, as presented by a Computer, that express the nonlinear structure of ideas, as opposed to the linear format of books, film, and speech. The term hypermedia, more recently introduced, is nearly synonymous but emphasizes the nontextual components of hypertext, such as animation, recorded sound, and video. Hypermedia topics are thus linked in a manner that allows the user to jump from subject to related subject in searching for information. For example, a hypermedia presentation on navigation might include links to such topics as astronomy, bird migration, geography, satellites, and radar. If the information is primarily in text form, the product is hypertext; if video, music, animation, or other elements are included, the product is hypermedia. Microsoft (R) Encarta. Copyright (c) 1993 Microsoft Corporation. Copyright (c) 1993 Funk & Wagnall's Corporation Hypertext, in computer science, a metaphor for presenting information in which text, images, sounds, and actions become linked together in a complex, nonsequential web of associations that permit the user to browse through related topics, regardless of the presented order of the topics. These links are often established both by the author of a hypertext document and by the user, depending on the intent of the hypertext document. For example, traveling among the links to the word iron in an article might lead the user to the periodic table of the elements or a map of the migration of metallurgy in Iron Age Europe. The term hypertext was coined in 1965 by Ted Nelson to describe documents, as presented by a Computer, that express the nonlinear structure of ideas, as opposed to the linear format of books, film, and speech. The term hypermedia, more recently introduced, is nearly synonymous but emphasizes the nontextual components of hypertext, such as animation, recorded sound, and video. tying count of the original complaint. Instead, it said it wants to investigate developments in the industry since the trial concluded and evaluate whether additional conduct-related provisions are necessary, especially in the absence of a breakup. In a statement issued Thursday morning, the Justice Department said it had taken these positions in an effort to "streamline the case with the goal of securing an effective remedy as quickly as possible.“ Instead of a breakup, the Justice Department said it will ask that Microsoft have certain restrictions placed on its conduct modeled on those the original trial judge imposed on the company in June 2000 but were postponed pending the appeal. In his original order, Judge Jackson imposed a series of restrictions on Microsoft's business practices which were to be effective as the companymoved to split its business in two. Among the conduct remedies Judge Jackson originally imposed were: prohibiting Microsoft from punishing hardware and software companies working on competing products; prohibiting it from favoring computer companies and software developers that helped Microsoft exclude competitors; makers under uniform prices and terms according to a publicly available schedule; and barring Microsoft from interfering with the way PC makers set up startup screens, this Windows desktop preferences, and Internet connection wizards. Since the appeals court first handed down its ruling in the case, Microsoft repeatedly has expressed its Hypermedia, in computer science, the integration of graphics, sound, video, or any combination into a primarily associative system of information storage and retrieval. Hypermedia, especially in an interactive format where choices are controlled by the user, is structured around the idea of offering a working and learning environment that parallels human thinking˘that is, an environment that allows the user to make associations between topics rather than move sequentially from one to the next, as in an alphabetic list. The term hypertext was coined in 1965 by Ted Nelson to describe documents, as presented by a Computer, that express the nonlinear structure of ideas, as opposed to the linear format of books, film, and speech. The term hypermedia, more recently introduced, is nearly synonymous but emphasizes the nontextual components of hypertext, such as animation, recorded sound, and video. Hypermedia topics are thus linked in a manner that allows the user to jump from subject to related subject in searching for information. For example, a hypermedia presentation on navigation might include links to such topics as astronomy, bird migration, geography, satellites, and radar. If the information is primarily in text form, the product is hypertext; if video, music, animation, or other elements are included, the product is hypermedia. Microsoft (R) Encarta. Copyright (c) 1993 Microsoft Corporation. Copyright (c) 1993 Funk & Wagnall's Corporation Dokument DB

SUMMARIZATION No Microsoft breakup:... September 6, 2001: 4:39 p.m. ET Hypermedia, in computer science, the integration of graphics, sound, video, or any combination into a primarily associative system of information storage and retrieval. Hypermedia, especially in an interactive format where choices are controlled by the user, is structured around the idea of offering a working and learning environment that parallels human thinking˘that is, an environment that allows the user to make associations between topics rather than move sequentially from one to the next, as in an alphabetic list. The term hypertext was coined in 1965 by Ted Nelson to describe documents, as presented by a Computer, that express the nonlinear structure of ideas, as opposed to the linear format of books, film, and speech. The term hypermedia, more recently introduced, is nearly synonymous but emphasizes the nontextual components of hypertext, such as animation, recorded sound, and video. Hypermedia topics are thus linked in a manner that allows the user to jump from subject to related subject in searching for information. For example, a hypermedia presentation on navigation might include links to such topics as astronomy, bird migration, geography, satellites, and radar. If the information is primarily in text form, the product is hypertext; if video, music, animation, or other elements are included, the product is hypermedia. Microsoft (R) Encarta. Copyright (c) 1993 Microsoft Corporation. Copyright (c) 1993 Funk & Wagnall's Corporation September 6, 2001: 4:39 p.m. ET No Microsoft breakup:... The U.S. Justice Department said Thursday it will not ask that Microsoft be broken in two... The U.S. Court of Appeals for the District of Columbia in late June had overturned a lower court's order.... ... it upheld the lower court's conclusion that Microsoft has a monopoly in the market for computer operating systems and maintains that monopoly power Hypermedia, in computer science, the integration of graphics, sound, video, or any combination into a primarily associative system of information storage and retrieval. Hypermedia, especially in an interactive format where choices are controlled by the user, is structured around the idea of offering a working and learning environment that parallels human thinking˘that is, an environment that allows the user to make associations between topics rather than move sequentially from one to the next, as in an alphabetic list. The term hypertext was coined in 1965 by Ted Nelson to describe documents, as presented by a Computer, that express the nonlinear structure of ideas, as opposed to the linear format of books, film, and speech. The term hypermedia, more recently introduced, is nearly synonymous but emphasizes the nontextual components of hypertext, such as animation, recorded sound, and video. Hypermedia topics are thus linked in a manner that allows the user to jump from subject to related subject in searching for information. For example, a hypermedia presentation on navigation might include links to such topics as astronomy, bird migration, geography, satellites, and radar. If the information is primarily in text form, the product is hypertext; if video, music, animation, or other elements are included, the product is hypermedia. Microsoft (R) Encarta. Copyright (c) 1993 Microsoft Corporation. Corporation tying count of the original complaint. Instead, it said it wants to investigate developments in the industry since the trial concluded and evaluate whether additional conduct-related provisions are necessary, especially in the absence of a breakup. In a statement issued Thursday morning, the Justice Department said it had taken these positions in an effort to "streamline the case with the goal of securing an effective remedy as quickly as possible.“ Instead of a breakup, the Justice Department said it will ask that Microsoft have certain restrictions placed on its conduct modeled on those the original trial judge imposed on the company in June 2000 but were postponed pending the appeal. In his original order, Judge Jackson imposed a series of restrictions on Microsoft's business practices which were to be effective as the companymoved to split its business in two. Among the conduct remedies Judge Jackson originally imposed were: prohibiting Microsoft from punishing hardware and software companies working on competing products; prohibiting it from favoring computer companies and software developers that helped Microsoft exclude competitors; makers under uniform prices and terms according to a publicly available schedule; and barring Microsoft from interfering with the way PC makers set up startup screens, this Windows desktop preferences, and Internet connection wizards. Since the appeals court first handed down its ruling in the case, Microsoft repeatedly has expressed its

INFORMATION EXTRACTION proper names: persons, companies, places... special expressions: dates, prices, percentages simple relations: company - location, product - price complex relations: accident affected parties cause time place damage answers to questions: Where is the headquarter of IBM?

INFORMATION EXTRACTION Bremen, 14. 10. 1997, wiwo: Lagersoftware weiter im Aufwind Die Bremer Firma Trade Consult hat auf einer Pressekonferenz in Hannover die Version 2.0 ihrer erfolgreichen Lagerverwaltungssoftware Store Age vorgestellt... Die neue Version ermöglicht jetzt auch ... Auf der Pressekonferenz gab Geschäftsführer Franz Merleback auch die Umsatzzahlen der Softwareschmiede für das 3. Quartal bekannt. Wurden im zweiten Quartal bereits über 30 Millionen Mark umgesetzt, so konnte Merleback jetzt das stolze Ergebnis von 42,5 Millionen verkünden. ...

INFORMATION EXTRACTION Bremen, 14. 10. 1997, wiwo: Lagersoftware weiter im Aufwind Die Bremer Firma Trade Consult hat auf einer Pressekonferenz in Hannover die Version 2.0 ihrer erfolgreichen Lagerverwaltungssoftware Store Age vorgestellt... Die neue Version ermöglicht jetzt auch ... Auf der Pressekonferenz gab Geschäftsführer Franz Merleback auch die Umsatzzahlen der Softwareschmiede für das 3. Quartal bekannt. Wurden im zweiten Quartal bereits über 30 Millionen Mark umgesetzt, so konnte Merleback jetzt das stolze Ergebnis von 42,5 Millionen verkünden. ...

IE RESULT Firma 96Q4 1996 97Q1 97Q2 97Q3 97Q4 1997 Diff ComSoft 120Mio 110Mio -10 Mio Trade Consult 30 Mio 42,5Mio 12,5 Mio Z&M 71,0Mio