Die Präsentation wird geladen. Bitte warten

Die Präsentation wird geladen. Bitte warten

Oracle Data Warehouse Mit Big Data neue Horizonte für das Data Warehouse ermöglichen Alfred Schlaucher, Detlef Schroeder DATA WAREHOUSE.

Ähnliche Präsentationen


Präsentation zum Thema: "Oracle Data Warehouse Mit Big Data neue Horizonte für das Data Warehouse ermöglichen Alfred Schlaucher, Detlef Schroeder DATA WAREHOUSE."—  Präsentation transkript:

1 Oracle Data Warehouse Mit Big Data neue Horizonte für das Data Warehouse ermöglichen Alfred Schlaucher, Detlef Schroeder DATA WAREHOUSE

2 Big Data Buzz Word oder eine neue Dimension und Möglichkeiten Oracles Technologie zu Speichern von unstrukturierten und teilstrukturierten Massendaten Cloudera Framwork Connectors in die neue Welt Oracle Loader for Hadoop und HDFS Big Data Appliance Mit Oracle R Enterprise neue Analyse-Horizonte entdecken Big Data Analysen mit Endeca Themen

3 Customer Experience Management Endeca Information Discovery Firmenhauptsitz: Cambridge, Massachusetts 600+ Kunden, 33% aus den Fortune Produktlinien: Oracle Endeca | Steckbrief Geführte Suche - integriert in Web-Sites = Endeca Kernkompetenz Kombination aus Textbasierte Suche + Business Intelligence Copyright 2012 Oracle and it's affiliates. All rights reserved.

4 Oracle Endeca | Guided Search Dynam. Angabe der Treffermenge, sehr schnelle Aktualisierung Zutreffende Attribut- gruppen: thematisch sortiert mit dynam. Häufigkeits- angabe Graphisch festlegbare Filterkriterien Copyright 2012 Oracle and it's affiliates. All rights reserved.

5 Oracle Endeca | Information Discovery (OEID) Copyright 2012 Oracle and it's affiliates. All rights reserved.

6 Oracle Endeca | Information Discovery (OEID) OEID kombiniert Einfachheit der Suche mit Business Intelligence Analyse-Funktionen/-Power Basiert auf 10 Jahren Design- Erfahrung im E-Commerce (B2C und B2B) Suche + Faceted Navigation (Facetten Suche) + Visuelle Analyse –Suche und Attributauswahl ähnlich wie auf einer Web-Site –User Interface Konzept ist teilw. vergleichbar mit dem Tool Infozoom (ist aber ein Fat Client) –Ergebnisse mit Karten, Tag Clouds, etc. visualisierbar Schnell reagierender Endeca Server erlaubt interaktive Analysen bzw. den Aufbau agiler BI Anwendungen Copyright 2012 Oracle and it's affiliates. All rights reserved.

7 Advanced Search Search look-ahead Spell-correction Data-driven filtering Visual Analysis Charting & crosstabs Geographic visualization Tag clouds Faceted Navigation Select attributes, like a web site ++ Interaktiv Daten untersuchen und neue Zusammenhänge entdecken Oracle Endeca | Information Discovery (OEID) Copyright 2012 Oracle and it's affiliates. All rights reserved.

8 E-Commerce Customer Experience Mgmt. Intuitive, easy-to-use Benutzeroberflächen für Konsumenten Schnell Öffentlicher Dienst Information Discovery High Performance, Skalierbarkeit, Zugriffssicherheit Oracle Endeca | Einsatzgebiete Unternehmen Suche / Information Discovery verteilt, komplex, veränderte Daten und Inhalte Copyright 2012 Oracle and it's affiliates. All rights reserved.

9 OEID Architektur | Server Hybride Technologie: Suchmaschine und analytische Datenbank in einem Umfassende Suchfunktionen, Navigation und Analytik über unterschiedliche und sich ändernde Daten(-quellen) Columnar Storage Model / In-Memory Verarbeitung –Datenspeicherung auf Festplatte –Überführung in RAM, sobald Daten referenziert werden –Embedded Index-Trees: nur benötigte Daten werden gescannt Faceted Data Model Parallelisierbar Endeca Server OEID Studio OEID Information Integration Copyright 2012 Oracle and it's affiliates. All rights reserved.

10 Original System (z.B. Datenbank) Care Team IDGenderOHIP NumberPatient IDPatient CityDisease Medikamentbeschreibung (Text) Endeca Index Gender OHIP Number CareTeam ID Disease Sulfonylureas Shaun Mahal Drug Description Metformin Shaun Mahal | June 20, 2010 Metformin was approved for use in the U.S. for treatment of type 2 diabetes in December, It is sold under the brand name Glucophage and is also available generically. Metformin is approved for treatment with sulfonylureas, or with insulin, or as monotherapy (by itself). Glucophage XR Extended Release tablets, a once daily version of metformin, is available. Also, metformin is available… Metformin OEID Architektur | Server – Faceted Data Model Copyright 2012 Oracle and it's affiliates. All rights reserved.

11 OEID Architektur | Server – Faceted Data Model TxnID = ProductID = 506 Category = Mountain Bike Amount = $ Suspension = Fox 32 F-Series FrameType = Aluminium Saddle = Bontrager SSR Mountain Accessories = Fork and shock sag meter Mountain Accessories = Water Bottle Review = A great bike for off road. Smooth ride over the bumps ReviewSentiment = Positive ReviewTerm = Great ReviewTerm = Off Road ReviewTerm = Smooth ReviewTerm = Bumps TxnID = ProductID = 506 Category = Mountain Bike Amount = $ Suspension = Fox 32 F-Series FrameType = Aluminium Saddle = Bontrager SSR Mountain Accessories = Fork and shock sag meter Mountain Accessories = Water Bottle Review = A great bike for off road. Smooth ride over the bumps ReviewSentiment = Positive ReviewTerm = Great ReviewTerm = Off Road ReviewTerm = Smooth ReviewTerm = Bumps TxnID = ProductID = 507 Category = Road Bike Amount = $ Weight = 20lb. FrameType = Composite Saddle = Bontrager Race Review = Disappointing for the price. The frame feels heavier than I expected. ReviewSentiment = Negative ReviewTerm = Disappointing ReviewTerm = Price ReviewTerm = Heavier TxnID = ProductID = 507 Category = Road Bike Amount = $ Weight = 20lb. FrameType = Composite Saddle = Bontrager Race Review = Disappointing for the price. The frame feels heavier than I expected. ReviewSentiment = Negative ReviewTerm = Disappointing ReviewTerm = Price ReviewTerm = Heavier beherbergt: –quasi kein Schema, jeder Record beschreibt sich bzw. repräsentiert sich selbst und kann prinzipiell ein eigenes Schema haben –Multi-value Datenfelder sind möglich –Unstrukturierte Datenfelder sind möglich Modell ist eine Art Key Value store Modell besteht aus –Records / Attributen –Facetten (= Zeiger auf Kantenlisten in Polygonnetzen) Jeder Record ist eine Sammlung von Attribute-Werte-Paaren Keine Aufteilung der Daten(-speicherung) in Tabellen Merkmale & Beispiel Endeca Record Copyright 2012 Oracle and it's affiliates. All rights reserved.

12 TxnID = ProductID = 506 Category = Mountain Bike Amount = $ Suspension = Fox 32 F-Series FrameType = Aluminium Saddle = Bontrager SSR Mountain Accessories = Fork and shock sag meter Mountain Accessories = Water Bottle Review = A great bike for off road. Smooth ride over the bumps ReviewSentiment = Positive ReviewTerm = Great ReviewTerm = Off Road ReviewTerm = Smooth ReviewTerm = Bumps TxnID = ProductID = 506 Category = Mountain Bike Amount = $ Suspension = Fox 32 F-Series FrameType = Aluminium Saddle = Bontrager SSR Mountain Accessories = Fork and shock sag meter Mountain Accessories = Water Bottle Review = A great bike for off road. Smooth ride over the bumps ReviewSentiment = Positive ReviewTerm = Great ReviewTerm = Off Road ReviewTerm = Smooth ReviewTerm = Bumps TxnID = ProductID = 507 Category = Road Bike Amount = $ Weight = 20lb. FrameType = Composite Saddle = Bontrager Race Review = Disappointing for the price. The frame feels heavier than I expected. ReviewSentiment = Negative ReviewTerm = Disappointing ReviewTerm = Price ReviewTerm = Heavier TxnID = ProductID = 507 Category = Road Bike Amount = $ Weight = 20lb. FrameType = Composite Saddle = Bontrager Race Review = Disappointing for the price. The frame feels heavier than I expected. ReviewSentiment = Negative ReviewTerm = Disappointing ReviewTerm = Price ReviewTerm = Heavier ETL Strukturierte Daten können direkt via ETL in ein Faceted Data Model geladen und gespeichert werden –Jedes Tupel wird zu einem Record –Jede Spalte wird zu einem Attribut Transaction TxnIDProductIDCategoryAmount Mountain Bike Road Bike1399 Relationale Tabelle OEID Architektur | Server – Faceted Data Model Integration strukturierter Daten Copyright 2012 Oracle and it's affiliates. All rights reserved.

13 OEID Architektur | Server – Faceted Data Model Integration semi-strukturierter Daten TxnID = ProductID = 506 Category = Mountain Bike Amount = $ Suspension = Fox 32 F-Series FrameType = Aluminium Saddle = Bontrager SSR Mountain Accessories = Fork and shock sag meter Mountain Accessories = Water Bottle Review = A great bike for off road. Smooth ride over the bumps ReviewSentiment = Positive ReviewTerm = Great ReviewTerm = Off Road ReviewTerm = Smooth ReviewTerm = Bumps TxnID = ProductID = 506 Category = Mountain Bike Amount = $ Suspension = Fox 32 F-Series FrameType = Aluminium Saddle = Bontrager SSR Mountain Accessories = Fork and shock sag meter Mountain Accessories = Water Bottle Review = A great bike for off road. Smooth ride over the bumps ReviewSentiment = Positive ReviewTerm = Great ReviewTerm = Off Road ReviewTerm = Smooth ReviewTerm = Bumps TxnID = ProductID = 507 Category = Road Bike Amount = $ Weight = 20lb. FrameType = Composite Saddle = Bontrager Race Review = Disappointing for the price. The frame feels heavier than I expected. ReviewSentiment = Negative ReviewTerm = Disappointing ReviewTerm = Price ReviewTerm = Heavier TxnID = ProductID = 507 Category = Road Bike Amount = $ Weight = 20lb. FrameType = Composite Saddle = Bontrager Race Review = Disappointing for the price. The frame feels heavier than I expected. ReviewSentiment = Negative ReviewTerm = Disappointing ReviewTerm = Price ReviewTerm = Heavier Semi-strukturierte Daten können als Key-Value-Paare aus XML Quellen, Feeds,Unternehmens-Applikationen,, etc. geladen werden Typische Datenstruktur, die auch im Polizeiumfeld vielfach verwendet wird Fox 32 F-Series Aluminium Bontrager SSR Fork and shock sag meter Water Bottle 20lb. Composite Bontrager Race ETL XML Quelle Copyright 2012 Oracle and it's affiliates. All rights reserved.

14 OEID Architektur | Server – Faceted Data Model Integration unstrukturierter Daten TxnID = ProductID = 506 Category = Mountain Bike Amount = $ Suspension = Fox 32 F-Series FrameType = Aluminium Saddle = Bontrager SSR Mountain Accessories = Fork and shock sag meter Mountain Accessories = Water Bottle Review = A great bike for off road. Smooth ride over the bumps ReviewSentiment = Positive ReviewTerm = Great ReviewTerm = Off Road ReviewTerm = Smooth ReviewTerm = Bumps TxnID = ProductID = 506 Category = Mountain Bike Amount = $ Suspension = Fox 32 F-Series FrameType = Aluminium Saddle = Bontrager SSR Mountain Accessories = Fork and shock sag meter Mountain Accessories = Water Bottle Review = A great bike for off road. Smooth ride over the bumps ReviewSentiment = Positive ReviewTerm = Great ReviewTerm = Off Road ReviewTerm = Smooth ReviewTerm = Bumps TxnID = ProductID = 507 Category = Road Bike Amount = $ Weight = 20lb. FrameType = Composite Saddle = Bontrager Race Review = Disappointing for the price. The frame feels heavier than I expected. ReviewSentiment = Negative ReviewTerm = Disappointing ReviewTerm = Price ReviewTerm = Heavier TxnID = ProductID = 507 Category = Road Bike Amount = $ Weight = 20lb. FrameType = Composite Saddle = Bontrager Race Review = Disappointing for the price. The frame feels heavier than I expected. ReviewSentiment = Negative ReviewTerm = Disappointing ReviewTerm = Price ReviewTerm = Heavier Unstrukturierte Daten können mit anderen Records über einen beliebigen Schlüssel verbunden werden Unstrukturierte Elemente können separat als eigene Records für side by side Analysen gespeichert werden Endeca Content Acquisition System (CAS) lädt Dokumente, RSS-Feeds und kann Twitter, Facebook, Web-Foren crawlen Review: #1301 Product: 506 A great bike for off road. Smooth ride over the bumps Review: #1327 Product: 507 Disappointing for the price. The frame feels heavier than I expected. CAS+ETL Copyright 2012 Oracle and it's affiliates. All rights reserved.

15 OEID Architektur | Server – Faceted Data Model Daten anreichern TxnID = ProductID = 506 Category = Mountain Bike Amount = $ Suspension = Fox 32 F-Series FrameType = Aluminium Saddle = Bontrager SSR Mountain Accessories = Fork and shock sag meter Mountain Accessories = Water Bottle Review = A great bike for off road. Smooth ride over the bumps ReviewSentiment = Positive ReviewTerm = Great ReviewTerm = Off Road ReviewTerm = Smooth ReviewTerm = Bumps TxnID = ProductID = 506 Category = Mountain Bike Amount = $ Suspension = Fox 32 F-Series FrameType = Aluminium Saddle = Bontrager SSR Mountain Accessories = Fork and shock sag meter Mountain Accessories = Water Bottle Review = A great bike for off road. Smooth ride over the bumps ReviewSentiment = Positive ReviewTerm = Great ReviewTerm = Off Road ReviewTerm = Smooth ReviewTerm = Bumps TxnID = ProductID = 507 Category = Road Bike Amount = $ Weight = 20lb. FrameType = Composite Saddle = Bontrager Race Review = Disappointing for the price. The frame feels heavier than I expected. ReviewSentiment = Negative ReviewTerm = Disappointing ReviewTerm = Price ReviewTerm = Heavier TxnID = ProductID = 507 Category = Road Bike Amount = $ Weight = 20lb. FrameType = Composite Saddle = Bontrager Race Review = Disappointing for the price. The frame feels heavier than I expected. ReviewSentiment = Negative ReviewTerm = Disappointing ReviewTerm = Price ReviewTerm = Heavier Jedes unstrukturierte Attribut kann prinzipiell um weitere Informationen angereichert werden, z.B. durch Text Analytics zur Erweiterung Datensatzstruktur Gängige Techniken: –Automatic Tagging –Named Entity Extraction –Sentiment Analysis –Term Extraction –Geospatial Matching Copyright 2012 Oracle and it's affiliates. All rights reserved.

16 Supports a breadth of structured and unstructured search capabilities : Guided Navigation Keyword search Boolean search Parametric search Wildcard search Dimension search Dimension filters Dimension precedence rules Numeric range filters Geospatial filters Date/Time filters Security filters Spell correction/suggestion, DYM Find similar 1- and 2-way synonyms Stemming and lemmatization Keyword-in-context snippeting Results clustering Relevance ranking Sorting and paging Language support Search index storage and analysis build on same column storage core as structured data store / indexes Acme Corp, 375 value, id Corp, 375 (89,72) word, id, position Structured Data Column Search Index Column OEID Architektur | Server – Volltextsuche Copyright 2012 Oracle and it's affiliates. All rights reserved.

17 OEID Architektur | Information Integration Endeca Server OEID Studio OEID Information Integration Endeca Workbench CloverETL Copyright 2012 Oracle and it's affiliates. All rights reserved.

18 Erweiterbares Framework für die Anbindung und Behandlung unstrukturierter Datenquellen Crawler für Dateiserver und das Web Adapter für Content-Management Systeme Text und Metadaten Extraktion Text Enrichment Fähigkeiten Flexible und agile ETL Umgebung Adapters für JDBC und übliche Dateitypen (XML, delimited, fixed-width, etc.) Java SDK Framework zum Erstellen eigener Adapter und Module zur Daten Manipulation Open API Endeca Server Unstructured Data Structured and Semi-Structured Data Enterprise Structured Data OEID Architektur | Information Integration Data Integrator (aktuell: CloverETL) Content Aquisition System (CAS) Copyright 2012 Oracle and it's affiliates. All rights reserved.

19 Interaktive, Komponenten- basierte Benutzeroberfläche Komplette Bibliothek mit fertigen BI-Komponenten enthalten – Realisiert mit Best Practice Design Pattern für die User Interface Entwicklung – AJAX Interaktion Setzt auf Industriestandards Enterprise-class manageability Endeca Server OEID Studio OEID Information Integration OEID Architektur | OEID Studio Copyright 2012 Oracle and it's affiliates. All rights reserved.

20 Advanced Visualization Bookmarks Breadcrumbs Chart Data Sources Guided Navigation ® Performance Metrics Range Filters Record Details Results Table Search Box Metrics Bar Cross Tab Find Similar OEID Studio Komponenten – Out-of-the-Box OEID Architektur | OEID Studio Copyright 2012 Oracle and it's affiliates. All rights reserved.

21 OEID Architektur | OEID Studio Jede Komponente hat eigene Kontrolleinstellungen und einen Editor Copyright 2012 Oracle and it's affiliates. All rights reserved.

22 Demonstration

23 Integration strukturierter, semi-strukturierter und unstrukturierter Daten erfolgt via Endeca Content Aquisition System (CAS) und ETL OEID | Zusammenfassung Structured Semi-Structured Unstructured Interaktive und Geführte Suche über dynamische Filter, Drill- down Datenvisualsierung, z.B. mit Tag Clouds, Geodaten, Master- Detail-Diagramme Power-User Funktionen (vgl. mit infoZoom) Oracle Endeca Information Discovery kombiniert eine leistungsfähige Suchmaschine mit einer Analytischen Datenbank zu einer agilen Business Intelligence Lösung Endeca Server (MDEX- Engine): spaltenorientierte Datenhaltung, In-Memory Technologie, Faceted Datenmodell Offene API / zusätzliche Komponenten, z.B. für Entitäten Extraktion, Sentiment Analyse

24 OEID | Zusammenfassung

25 KOMBINIERTER ANALYSE-ANSATZ MIT ORACLE BIG DATA / ENDECA Copyright 2012 Oracle and it's affiliates. All rights reserved.

26 File Systems SOA, ESB, Web Service Databases Content Mgt Systems Internet / Social Networks Enterprise Systems & Content Stores Un-/Semi- structured Data Sources Datenstrom | Erfassen | Organisieren | Analysieren | Entscheiden Hadoop Distributed File System (HDFS) Oracle NoSQL Database Oracle OLTP Database Information Integration ETL/ELT-Systems (Warehouse Builder. Data Integrator) Oracle Loader für HADOOP Hadoop MapReduce (Framework) Data Snapshots Unstructured Data Transformation Data Warehouse & Data Marts Oracle Data Warehouse Database Endeca In-Memory DB OLAP Cubes Data Marts, Analysis Sandpits In-Database Analytics (R, Data Mining, etc.) Information Discovery & Search Information Delivery Oracle Business Intelligence Oracle Endeca Studio Oracle Endeca Integration Suite Analytical Applications Reports, Visualisierung,... Embedded Analytics / Search …. Multidim. Analysis & Search Oracle Big Data Appliance Oracles kombinierter Analyse-Ansatz Copyright 2012 Oracle and it's affiliates. All rights reserved.

27 Kontakt und mehr Informationen Oracle Data Warehouse Community Mitglied werden Viele kostenlose Seminare und Events Download – Server: Nächste deutschsprachige Oracle DWH Konferenz: März 2013 Kassel

28


Herunterladen ppt "Oracle Data Warehouse Mit Big Data neue Horizonte für das Data Warehouse ermöglichen Alfred Schlaucher, Detlef Schroeder DATA WAREHOUSE."

Ähnliche Präsentationen


Google-Anzeigen