The Automatic Generation of Formal Annotations in a MultiMedia Indexing and Searching Environment Thierry Declerck, Peter Wittenburg and Hamish Cunningham.

Slides:



Advertisements
Ähnliche Präsentationen
Cadastre for the 21st Century – The German Way
Advertisements

PSI and Competition The General Framework
Finding the Pattern You Need: The Design Pattern Intent Ontology
E-Solutions mySchoeller.com for Felix Schoeller Imaging
H - A - M - L - E - IC T Teachers Acting Patterns while Teaching with New Media in the Subjects German, Mathematics and Computer Science Prof. S. Blömeke,
R. Zankl – Ch. Oelschlegel – M. Schüler – M. Karg – H. Obermayer R. Gottanka – F. Rösch – P. Keidler – A. Spangler th Expert Meeting Business.
Time Recording SAP Best Practices Baseline package
Time Entry with Funds or Grants Management (982)
Internal Maintenance SAP Best Practices Baseline Package
Scenario Overview – 1 Purpose and Benefits: Purpose Benefits
Scenario Overview – 1 Purpose and Benefits: Purpose Benefits
Die ZBW ist Mitglied der Leibniz-Gemeinschaft Copyright © ZBW 2010 Seite 1 Potenziale semantischer Technologien für die Bibliothek der Zukunft Klaus Tochtermann.
© 2006 Open Grid Forum OGF26 - Chapel Hill, May 2009 Addressing Metadata Challenges OGF Digital Repositories RG.
Steinbeis Forschungsinstitut für solare und zukunftsfähige thermische Energiesysteme Nobelstr. 15 D Stuttgart WP 4 Developing SEC.
Institut für Softwaresysteme in Wirtschaft, Umwelt und Verwaltung Folie 1 DER UMWELT CAMPUS BIRKENFELD ISS Institut für Softwaresysteme in Wirtschaft,
Fakultät für informatik informatik 12 technische universität dortmund Optimizations Peter Marwedel TU Dortmund Informatik 12 Germany 2009/01/17 Graphics:
Peter Marwedel TU Dortmund, Informatik 12
Das Integrierte EU-Projekt ORCHESTRA - Konzeption einer offenen Dienstearchitektur im Kontext der INSPIRE-Initiative Ulrich Bügel, Thomas Usländer, Fraunhofer.
Institut für Verkehrsführung und Fahrzeugsteuerung > Technologien aus Luft- und Raumfahrt für Straße und Schiene Automatic Maneuver Recognition in the.
Insulin pump therapy in adults allows metabolic control at lower rates of hypoglycemia along with reduced insulin doses – results from the nationwide DPV-survey.
Introduction to the topic. Goals: Improving the students essay style in general Finding special words and expressions that can be used in essay writing.
Deutsche Gesellschaft für Technische Zusammenarbeit GmbH Integrated Experts as interface between technical cooperation and the private sector – An Example.
Methods Fuzzy- Logic enables the modeling of rule based knowledge by the use of fuzzy criteria instead of exact measurement values or threshold values.
virtPresenter „lecture recording framework“
Haben – to have ich habe du hast er/sie hat es hat man hat wir haben
“Wintergedicht” von Fallersleben ( )
Laurie Clarcq The purpose of language, used in communication, is to create a picture in the mind and/or the heart of another.
Case Study Session in 9th GCSM: NEGA-Resources-Approach
Deutsch I Kapitel 3 Test Review. A. Conjugation of to have haben ich du er,sie,es wir, Sie,sie ihr habe hast hat haben habt.
Machen Sie sich schlau am Beispiel Schizophrenie.
die Zeiten (The Tenses) das Aktiv (Active Voice)
Schweizerische Landesbibliothek ODOK05 - Workshop der VÖB-Kommission für Sacherschliessung Bozen, September 13, 2005 Cross-Language Access to Information.
Institut AIFB, Universität Karlsruhe (TH) Forschungsuniversität gegründet 1825 Towards Automatic Composition of Processes based on Semantic.
| DC-IAP/SVC3 | © Bosch Rexroth Pneumatics GmbH This document, as well as the data, specifications and other information set forth in.
VOCABULARY SOCCER QUESTIONS PRESENT CONTINOUS WORDLIST – CELL PHONE Unit 2 - Trainer.
BAS5SE | Fachhochschule Hagenberg | Daniel Khan | S SPR5 MVC Plugin Development SPR6P.
Deutsch 1 G Stunde. Donnerstag, der 25. Oktober 2012 Deutsch 1, G Stunde Heute ist ein B- Tag Unit: Family & homeFamilie & Zuhause Objectives: Phrases.
Deutsch 1 G Stunde. Donnerstag, der 18. Oktober 2012 Deutsch 1, G Stunde Heute ist ein E- Tag Unit: Family & homeFamilie & Zuhause Objectives: Phrases.
Deutsch 1 G Stunde. Montag, der 10. September 2012 Deutsch 1 (G Stunde)Heute ist ein D - Tag Unit: Introduction to German & Germany Objectives: Introducing.
INTAKT- Interkulturelle Berufsfelderkundungen als ausbildungsbezogene Lerneinheiten in berufsqualifizierenden Auslandspraktika DE/10/LLP-LdV/TOI/
XML IV: Cocoon 2.
DEUTSCHLAND UND DIE MEDIEN
Institut für Umweltphysik/Fernerkundung Physik/Elektrotechnik Fachbereich 1 K. Bramstedt, L. Amekudzi, J. Meyer IFE/IUP Tangent heights in occultation.
Verben Wiederholung Deutsch III Notizen.
Einführung Bild und Erkenntnis Einige Probleme Fazit Eberhard Karls Universität Tübingen Philosophische Fakultät Institut für Medienwissenschaft Epistemic.
Ein Projekt des Technischen Jugendfreizeit- und Bildungsvereins (tjfbv) e.V. kommunizieren.de Blended Learning for people with disabilities.
Talking about yourself
Demonstration of Performance of CASCOT 5.0
External Labels – The rules For all external labels the following rules apply (external labels are all labels which are not inside of a shape) - all labels.
Feste und Feiertage Treffpunkt Deutsch Sixth Edition.
© Boardworks Ltd of 8 Time Manner Place © Boardworks Ltd of 8 This icon indicates that the slide contains activities created in Flash. These.
Heute ist Montag, der 17. September 2012: Lernziel: 1.Diskussion: Sprechen 2.Wie kommst du zur Schule? 3.Leseverständnis (Reading Comprehension) 4.Quiz.
The Generation of formal Annotations The Role of Information Extraction in MUMIS.
3rd Review, Vienna, 16th of April 1999 SIT-MOON ESPRIT Project Nr Siemens AG Österreich Robotiker Technische Universität Wien Politecnico di Milano.
SIT-MOON ESPRIT Project Nr st Review, Brussels, 27th of April 1998 slide 1 Siemens AG Österreich Robotiker Technische Universität Wien Politecnico.
L.O. Der Schultag(1) Dienstag, den 04. März 2010
Sentence Structure Subject and verb are always together. Subject and verb are always together. Subject and verb must agree Subject and verb must agree.
KIT – die Kooperation von Forschungszentrum Karlsruhe GmbH und Universität Karlsruhe (TH) Vorlesung Knowledge Discovery - Institut AIFB Tempus fugit Towards.
German Word Order explained!
1 Intern | ST-IN/PRM-EU | | © Robert Bosch GmbH Alle Rechte vorbehalten, auch bzgl. jeder Verfügung, Verwertung, Reproduktion, Bearbeitung,
Forschungszentrum Karlsruhe Technik und Umwelt IRS /FzK W.M.Schikorr EUROTRANS DM1 Safety Bruxells, 17 March Transients to be Analysed for the LBE–cooled.
1 Stevens Direct Scaling Methods and the Uniqueness Problem: Empirical Evaluation of an Axiom fundamental to Interval Scale Level.
Honing Listening Skills with Audio & Video Deborah McGee Mifflin Johns Hopkins University GWATFL / MFLA Fall 2010 Conference Saturday,
Component 4 Frankfurt (Oder) & Slubice Seite 1 COMPONENT 4: – in Frankfurt (Oder) & Slubice 1.All participants presented themselves.
Lehrstuhl für Waldbau, Technische Universität MünchenBudapest, 10./11. December 2006 WP 1 Status (TUM) Bernhard Felbermeier.
Technische Universität München 1 CADUI' June FUNDP Namur G B I The FUSE-System: an Integrated User Interface Design Environment Frank Lonczewski.
TUM in CrossGrid Role and Contribution Fakultät für Informatik der Technischen Universität München Informatik X: Rechnertechnik und Rechnerorganisation.
Institut für Nachrichtentechnik U. Reimers Technische Universität Braunschweig The MultiMedia Home Platform (MHP): Hype or Reality ?
Lernziel: I will be able to express where I live and ask where someone else lives. Hausaufgabenkontrolle Neue Vokabeln Eating out in Germany.
Kapitel 2 Grammar INDEX 1.Subjects & Verbs 2.Conjugation of Verbs 3.Subject Verb Agreement 4.Person and Number 5.Present Tense 6.Word Order: Position of.
 Präsentation transkript:

The Automatic Generation of Formal Annotations in a MultiMedia Indexing and Searching Environment Thierry Declerck, Peter Wittenburg and Hamish Cunningham DFKI GmbH, Max-Planck-Institut für Psycholinguistik and University of Sheffield ACL/EACL2001 Workshop on Human Language Technology and Knowledge Management

The MUMIS Consortium CTITUniversity of Twente, Enschede, NLNLP/IE TSI University of Nijmegen, Nijmegen, NLASR DFKISaarbrücken, DNLP/IE MPI Nijmegen, NLOnline SW DCSUniversity of Sheffield, UKNLP/IE ESTEAMGothenburg, SE (location Athens, GR)Translation Software VDAHilversum, NL Dissemination

Objectives Technology development to automatically index (with formal annotations) lengthy multimedia recordings (off-line process) Find and annotate relevant events, together with the involved entities and relations Technology development to exploit indexed multimedia archives (on-line process) Search for interesting scenes and play them via Internet Test Domain: Soccer Games / UEFA Tournament 2000

Off-line Task Automatic Speech Recognition (Radio/TV Broadcasts) Automatically transforms the speech signals into texts (for 3 languages Dutch, English and German) Natural Language Processing (Information Extraction) Analyse all available textual documents (newspapers, speech transcripts, tickers, formal texts...), identify and extract interesting entities, relations and events Merging all the annotations produced so far Create a database with formal annotations

The Generation of Formal Annotations Metadata (type of game, teams, date, final score, players etc.), as they can be used a.o. for classifying and filtering videos in the MM digital archive Events (particular actions with time codes, involved entities and related events), as they can be extracted from the video sequences All Formal Annotations available in XML Standard

The Event Table Related to domain ontology and multilingual terminology. Guiding the generation of formal annotations Final whistle#90>t>120Subj=referee, score etc…Final score Goal kick#0>t>120Subj=pl, loc=loc, cons=cons,.. Dribbling#0>t>120Subj=pl, loc=loc, … Substitution#0>t>120Subj=pl, I.obj=pl, cause=c, …Team (adding pl) Red Card#0>t>120Subj=ref, I.obj=pl, cause=c, …Team (red at t) Goal#0>t>=penSubj=pl, I.obj=team, score=s,Order of goal … EventIDTimeSubcat/ModificationMetadata

Off-line Task Events indexed in video recording 1:0 60 m25 m SchollBasler CampbellMatthäusBaslerNeville DribblingFreekickFoul 28min24 min18 min17 min DefensePassGoalFreekick Radio Commenting 3 Languages Radio Commenting 3 Languages Radio Commenting 3 Languages Audio Commenting (TV, Radio) 3 Languages Newspaper Text Newspaper Text Newspaper Text Newspaper Texts 3 Languages Newspaper Text Newspaper Text Newspaper Text Close caption 3 Languages multilingual IE => event tables Merging of Annotations Event = goal Player = Basler Dist. = 25 m Time = 18 Score = 1:0 Event = goal Type = Freekick Player = Basler Dist. = 25 m Time = 17 Score: leading Event = goal Player= Basler Team = Germany Time = 18 Score = 1:0 Finalscore = 1:0 Event = goal Type = Freekick Player = Basler Team = Germany Time = 18 Score = 1:0 Final score = 1:0 Distance = 25 m

Information Extraction IE is generally subdivided in following tasks: - Named Entity task (NE) - Template Element task (TE) - Template Relation task (TR) - Scenario Template task (ST) - Co-reference task (CO)

The Role of IE in MUMIS Information Extraction (IE) is the task of identifying, collecting and normalizing relevant information for a specific application or user. The relevant information is typically represented in form of predefined templates, which are filled by means of Natural Language (NL) analysis (Template = Event Table in MUMIS) IE combines pattern matching mechanisms, (shallow) NLP and domain knowledge (terminology and ontology).

Extension of our IE system in MUMIS Multilingual and multisource IE. Incremental information building Cross-document co-reference resolution Combine Metadata and event extraction => better organisation and dynamic updating of information (KM) Multiple presentation of results: Template, Event table and Hyperlinks (Named Entities, rel. to KM)

Subtasks of IE Named Entity task (NE): Mark into the text each string that represents, a person, organization, or location name, or a date or time, or a currency or percentage figure. Template Element task (TE): Extract basic information related to organization, person, and artifact entities, drawing evidence from everywhere in the text.

Subtasks of IE (2) Template Relation task (TR): Extract relational information on employee_of, manufacture_of, location_of relations etc. (TR expresses domain- independent relationships). Scenario Template task (ST): Extract pre-specified event information and relate the event information to particular organization, person, or artifact entities (ST identifies domain and task specific entities and relations). Co-reference task (CO): Capture information on co- referring expressions, i.e. all mentions of a given entity, including those marked in NE and TE.

IE applied to soccer Terms as descriptors for the NE task Team: Titelverteidiger Brasilien, den respektlosen Außenseiter Schottland Player:Superstar Ronaldo, von Bewacher Calderwood noch von Abwehrchef Hendry, von Jackson als drittem Stürmer, Torschütze Cesar, von Roberto Carlos (16.), Referee: vom spanischen Schiedsrichter Garcia Aranda Trainer: Schottlands Trainer Brown, Kapitän Hendry seinen Keeper Leighton Location: im Stade de France von St. Denis (more fine-grained location detection would be: Stadion: im Stade de France and City: von St. Denis ) Attendance: Vor Zuschauern

IE applied to soccer (2) Terms for NE Task Time: in der 73. Minute, nach gerade einmal 3:50 Minuten, von Roberto Carlos (16.), nach einer knappen halben Stunde, scheiterte Rivaldo (49./52.) jeweils nur knapp, das vor der Pause Versäumte versuchten die Brasilianer nach Wiederbeginn,... Date: am Mittwoch, der Turnierstart (?), im WM-Eröffnungsspiel (?) Score/Result: Brasilien besiegt Schottland 2:1, einen 2:1 (1:1)- Sieg, der zwischenzeitliche Ausgleich, in der 4. Minute in Führung gebracht, köpfte zum 1:0 ein

IE applied to soccer (3) Relations for TR Task Opponents: Brasilien besiegt Schottland, feierte der Top-Favorit... einen glücklichen 2:1 (1:1)-Sieg über den respektlosen Außenseiter Schottland, Player_of: hatte Cesar Sampaio den vierfachen Weltmeister... in Führung gebracht, Collins gelang... der zwischenzeitliche Ausgleich für die Schotten, der Keeper des FC Aberdeen, Brasiliens Keeper Taffarel Trainer_of: Schottlands Trainer Brown...

IE applied to soccer (4) Events for ST task : Goal: in der 4. Minute in Führung gebracht, das schnellste Tor... markiert, Cesar Sampaio köpfte zum 1:0 ein, Collins (38.) verwandelte den Strafstoß, hätte Kapitän Hendry seinen Keeper Leighton um ein Haar zum zweiten Mal bezwungen, von dem der Ball ins Tor prallte Foul: als er den durchlaufenden Gallacher im Strafraum allzu energisch am Trikot zog Substitution: und mußte in der 59. Minute für Crespo Platz machen...

IE applied to soccer (5) Description of the Templates: Team team-template TACTIC [ ] SCORE [ ] NAME [ ] PLAYER [ ] TRAINER [ ] goal-template TIME [ ] SCORE [S] PLAYER [P] TEAM [team-templ ] TYPE [ ] SUCCESS [ ] team-template TACTIC [ ] SCORE [S] NAME [ ] PLAYER [P] TRAINER [ ]

Example of Processing Formal Texts Formal Text The Formal Text annotated with domain- specific information

Example of Processing Semi-Formal Texts Semi-Formal Text The Semi-Formal Text annotated with domain-specific information

Merging Component Acting on the generated formal annotations (Metadata and Events), but also interleaving with the generation process of those Checking consistency, eliminating redundancy (Template Merging), in accordance with domain ontology Completing the information with domain knowledge, inference Machine

On-line Tasks Searching and Displaying Search for interesting events with formal queries Give me all goals from Overmars shot with his head in 1. Half. Event=Goal; Player=Overmars; Time<=45; Previous-Event=Headball Indicate hits by thumbnails & let user select scene Play scene via the Internet & allow scrolling etc Of course: slow motion, fast play, start/stop, etc User Guidance (Lexica and Ontology)

On-line Tasks Knowledge Guided User Interface & Search Engine München - Ajax 1998 München - Porto 1996 Deutschland - Brasilien 1998 Prototype Demo Play Movie Fragment of that Game 1:0 60 m25 m SchollBasler CampbellMatthäusBaslerNeville DribblingFreekickFoul 28min24 min18 min17 min DefensePassGoalFreekick

More about MPEG (Moving Picture Coding Experts Group) MPEG-1: low-level media encoding and compression format (VHS quality, ~ 2-3 Mbps - good for streaming) MPEG-2: improved media encoding and compression format (S-VHS quality, ~ 5-10 Mbps, digital TV and DVD standard MPEG-4: Codes content as objects and enables those objects to be manipulated at the client, optimized compression

On-line SW Architecture Client Applet JMF WWW Server Java Server Media Server MPEG1 Media Server MPEG1 Media Server MPEG1 DB Server rDBMS Media Server MPEG1 File Server HTTP RMI (RTP, RTSP) JDBC Query interface: automatic pre-selection guided by domain knowledge interactive, visual feedback Client-Server structure: fully distributed JMF media presentation RMI-based interaction Client Objects Query Engine Objects MetadataAnnotations KeyframesMPEG Movies Lexica Ontology Media Server Objects

On-line HW Architecture efficient & reliable storage management (near-line capacity, media change, 2. Location) high storage capacity (n TB, 1 h MPEG1 = 1 GB) powerful media servers / powerful network RAID Tape Library FC Switch Media Server GB Switch Internet 1Gbps Gb-Switch Router

UI / Annotation UI optimization thumbnails not that informative which thumbnail? (several around time mark) automatic thumbnail adjustment? seamless operation for user lexicon/ontology guidance user driven input Manual annotation tools MediaTagger EUDICO

Gain - User Group What gets lost? Is it necessary? Potential: direct Internet Service, less dependencies Current ProcedureMUMIS Procedure Manual Video Annotation Automatic Video Annotation and DB Integration Integration Central DB Query via PC Results on PC And Select & Play Contact Video Archive Get Video Tapes Search on Tape on VCR Segment & Play

Acknowledgements UEFA DFB, FA, KNVB EBU, WDR, NOS, SWR Allez les Bleus!!