ROBOT VISION Lesson 7/8: Shape from X Matthias Rüther

ROBOT VISION Lesson 7/8: Shape from X Matthias Rüther
Kawada Industries Inc. has introduced the HRP-2P for Robodex 2002. This humanoid appears to be very impressive. It is 154 cm (60") tall, weighs 58kg (127 lbs) and has 30 DOF. Here is a news release. Notice the LACK of a battery pack. Here is a new story about HRP2.

Shape from Monocular Images
2 dim + “?” => 3 dim A priori Wissen Verschiedene Möglichkeiten: Shape from Shading Streifenmethode Photometrisches Stereo Shape from Texture Statistische Methoden Strukturelle Methoden Geometrische Szeneneigenschaften Golfball

Overview Shape from Shading Shape from Specularities
Shape from X techniques Shape from Shading Shape from Specularities Three-Dimensional Information from Shadows Shape from Multiple Light Sources, Photometric Stereo Shape from Laser Ranging and Structured Light Images Shape from Texture Shape and Structure from Perspective Effects, Vanishing Points Three-Dimensional Reconstruction from Different Views Depth from Focus, Changing Camera Parameters Surface and Shape from Contours

Shape from Shading Schattierung der Oberfläche beeinflußt die Raumempfindung: Bsp.: Makeup ändert nicht nur Oberflächentextur erzeugt Highlights und Schatten => Raumempfindung

Shape from Shading In Computer Vision erstmals durch Horn [Horn77]:
Oberflächenreflexion von untexturierten Objekten beinhaltet Tiefeninformation

Flächengrenzen Flächengrenzen spielen entscheidende Rolle bei Interpretation durch Menschen: Begrenzungslinien werden als Grenze zu Hintergrund interpretiert: => gleiche Schattierung -verschiedene Form

Formrekonstruktion Zur Vereinfachung wird immer Normalprojektion verwendet: => Objekt ist entfernt und nahe der optischen Achse

Beleuchtung ist eine Funktion der Richtung und nicht Entfernung
Reflectance Map Reflexionseigenschaften sind zentraler Bestandteil von Shape from Shading Beleuchtung ist eine Funktion der Richtung und nicht Entfernung Beleuchtungsquelle ist weit entfernt keine Zweitbeleuchtung: Reflexion von anderen Gegenständen auf das Objekt ist ausgeschlossen Es gibt keine Schlagschatten, Objekt ist unverdeckt bezüglich Lichtquelle - nur Selbstschatten

Reflectance Map Lambert‘sche Oberflächen:
Helligkeit hängt nur von Beleuchtungsrichtung ab, nicht von Beobachtungsrichtung ein Oberflächenpunkt hat in allen Beobachtungsrichtungen gleiche Helligkeit keine Totalreflexion Reflectance Map einer Oberfläche: 2dim Plot des Gradientenraums (p,q) der normalisierten Bildhelligkeit einer Oberfläche als Funktion der Oberflächenorientierung

Gradient Space z=f(x,y) Oberflächennormale

Reflectance Map Equations

REFLECTANCE MODELS n LAMBERTIAN MODEL PHONG MODEL L = E r COS q
L = E (a COS q + b COS a) n albedo Diffuse albedo Specular a=0.3, b=0.7, n= a=0.7, b=0.3, n=0.5

Surface Reflectance Map

Reflectance Map 2dim Plot des Gradientenraums (p,q) der normalisierten Bildhelligkeit I einer Oberfläche als Funktion der Oberflächenorientierung

Nachteil des Modells: Es gibt keine komplett matten Oberflächen
Reflectance Map gerade Linie im Gradientenraum trennt beleuchtete von Schattenregionen: => Terminator Nachteil des Modells: Es gibt keine komplett matten Oberflächen Perfekter Spiegel: Licht wird nur in eine spezielle Richtung reflektiert (Einfallswinkel = Ausfallswinkel), in alle anderen Richtungen keine Reflexion

Reflectance Map Realität Kombination aus matten und spiegelnden Reflexionseigenschaften: Gewichteter Durchschnitt der diffusen und spiegelnden Komponente einer Oberfläche Reflectance Maps für eine bestimmte Oberfläche müssen experimentell bestimmt werden für allgemeines Shape from Shading nicht möglich für mache Oberflächen jedoch bereits bestimmt => Normform

Normform der Reflectance Map
Spiegelnde Reflexion durch Ellipse in Reflectance Map Übergang spiegelnd - diffus als Parabel Diffuser Teil: Hyperbel Grenze Licht - Schatten (Terminator) = Linie

Shape from Specularity
Suitable for highly reflective Surfaces Specular Reflection map of a single point source forms a sharp peak (Specular model, Phong model) Reflectance map

Shape from Specularity
Principle: If a reflection is seen by the camera and the position of the point source is known, the surface normal can be determined. => use several point sources with known position: structured highlight inspection Panorama mirror

Shape From Shadow Also: Shape from Darkness
Reconstruct Surface Topography from self-occlusion E.g. Building reconstruction in SAR images, terrain reconstruction in remote sensing

Shape From Shadow A static camera C observes a scene.
Light source L travels over the scene x, position of L is given by angle . L and C are an infinite distance away (orthographic projection). Shadowgram: binary function f(x, ), stating whether scene point x was shadowed at light position .

Photometric Stereo Multiple images, static camera, different illumination directions At least three images Known illumination direction Known reflection model (Lambert) Object may be textured

Photometric Stereo Albedo (reflectivity) Reflection model
3 unknowns per pixel at least three illumination directions

Photometric Stereo

Photometric Stereo: Example
Input images From Forsyth & Ponce Recovered normals Recovered albedo

Range Finder principles:
Runtime Range Finder Triangulation Range Finder - Sender and receiver with known position, triangulation similar to stereo principle Optical Range Finder Ultrasound Range Finder e.g.: Spot Projectors Moiré Range Finder Structured Light Range Finder Pattern (stripe) projection 1 image 2 dim + geometry => 3 dim Depth information Receiver/Sender position

Runtime Range Finder Determine sensor-object distance by measuring radiation runtime: 1. Sender (coherent light) 2. Scanning Unit 3. Receiver (light detector) 4. Phase detector Alt. Method: send light pulses => LIDAR (Light Radar), defense industry Problem: generating pulses, measuring runtime (both very short) Die Signalphaserelativ zur reference phase ist proportional zur Laufzeit. Die Signalamplitude ist proportional zur zur Oberflächenreflektion. => Lichtintensität und Range Data

Ultrasound Range Finder
Used in commercial cameras (Autofocus Spot) Advantages Independent of sorrounding light, Slow speed of ray Disadvantages coarse resolution, Bad accuracy, Pointwise, scanner necessary Multiple Reflection/Echoes Applications: Obstacle Detection: e.g.: “Car parking radar” Level Measurement, silos, tanks, … Underwater ranging (sonar), ... Typ. specifications: Range: 5cm to 1-5m Accuracy: +-3mm

Spot Projection Determine sensor-object distance pointwise
1. Sender (Laser beam) 2. Scanning unit 3. Receiver (CCD Camera) Distance by triangulation Laser Tracker: [Ishii76], early systems 1968 [Forsen68]

Triangulation Range Finder
Principle: Projection of a plane onto a plane -> intersection is line (stripe) Distortion of stripe gives object depth by Triangulation

Structured Light Range Finder
1. Sender (projects plane) 2. Receiver (CCD Camera) Geometry Z- direction X- direction Sensor image

1 plane -> 1 object profile
To get a 3D profile: Move the object Scanning Unit for projected plane Move the Sensor Object motion by conveyor band: => synchronization: measure distance along conveyor => y-accuracy determined by distance measurement Scanning Units (z.b: rotating mirror) are rare (accurate measurement of mirror motion is hard, small inaccuracy there -> large inaccuracy in geometry Move the sensor: e.g. railways: sensor in wagon coupled to speed measurement

Stripe Projection Determine object structure by projecting multiple stripes simultaneously and subsequent triangulation 1. Sender (planes) 2. Receiver (Camera)

Line projector (z.b: LCD-640)
Lamp Lens system LCD - Shutter Pattern structure Line projector (z.b: LCD-640) Focusing lens (e.g.: 150mm) Example

Pattern projection Camera: IMAG CCD, Res:750x590, f:16 mm
Projected light stripes Camera: IMAG CCD, Res:750x590, f:16 mm Projector: Liquid Crystal Display (LCD 640), f: 200mm, Distance to object plane: 120cm Range Image

Moiré Range Finder 1. Sender (Projektor mit Linien)
Project line structure, observe line structure through a grid 1. Sender (Projektor mit Linien) 2. Receiver (CCD Camera with line filter) Problem: identification of line ordering possible but hard, unsharp lines => inaccurate results Moiré Pattern Moiré Image

Triangulation Principle
Baseline b . CCD Kamera Z • Object point

Lichtpunkttechnik Einfachste Methode zur Messung von Entfernungen:
Punktweise Beleuchtung der Objektoberfläche Laser: Position auf der x-Achse : bekannter Richtungswinkel des Lasers zur Kamera : bekannter Schwenkwinkel des Lasers R: Schnittpunkt der Schwenkebene mit der optischen Achse P: zu vermessender Objektpunkt p(x,y): bekannter projizierter Objektpunkt P Z-achse: optische Achse der Kamera Es gilt für kamerazentriertes Koordinatensystem: räumlicher Fall eines projizierten Lichtpunktes

Lichtpunktstereoanalyse
Kombination der Lichtpunkttechnik mit Verfahren der statischen Stereoanalyse: Laserpunkt wird an beliebiger Stelle auf das zu vermessende Objekt projiziert und mit zwei Kameras aufgenommen. Entfernungsberechnung: aus der Position des Laserpunktes analog zur statischen Stereoanalyse Vorteil gegenüber Lichtpunkttechnik: keine aufwendige Kalibrierung, da Orientierung des Laserstrahls nicht in Berechnung eingeht. Prinzipieller Aufbau für ein Verfahren nach der Lichtpunktstereoanalyse

Zur Vereinfachung steht Lichtebene senkrecht zur Referenzeben.
Lichtschnittechnik 1) Projektion einer Lichtebene auf das Objekt (mittels Laser oder Projektoren) 2) Schnitt der Lichtebene mit der Objektoberfläche als Lichtstreifen sichtbar 3) Lokalisierung des Lichtstreifens mittels Kantenerkennung und Segmentierung Entfernungsberechnung: Kalibrierung: b, f, , Generierung eines Look-Up-Tables für Entfernungswerte Allgemein: Die Tiefenauflösung ist sowohl von der Auflösung des Bildsensors als auch von der Breite der erzeugten Lichtebene abhängig. 1) peak at position xo 2) left side und right side of the peal using a threshold Vt 3) center points Zur Vereinfachung steht Lichtebene senkrecht zur Referenzeben.

Gleichzeitige Projektion mehrerer Lichtschnitte
Anstatt einer Lichtebene werden mehrere Lichtebenen auf das Objekt projeziert, um die Anzahl der aufzunehmenden Bilder zu reduzieren. Entfernungsberechnung: wie mit einer Lichtebene, jedoch muß jeder Lichtstreifen im Bild eindeutig identifizierbar sein. Problem: Aufgrund von Verdeckungen sind einzelne Streifen teilweise oder gar nicht im Kamerabild sichtbar -> keine eindeutige Identifikation der Lichtstreifen Anwendung: Glattheitsüberprüfung bei planaren Oberflächen ohne Tiefenwertberechnung. 1) peak at position xo 2) left side und right side of the peal using a threshold Vt 3) center points

Binärcodierte Lichtschnittechik
Motivation: .) Lichtpunkt oder Lichtstreifen erfordern große Anzahl von Aufnahmen .) Mehrere projzierte Lichtmuster können nicht eindeutig identifiziert werden Codierung der Lichtmuster: Zur Unterscheidung der Lichtebenen Erzeugung zeitlich aufeinander folgender verschiedener binärcodierter Lichtmuster Vorteil: wesentlich weniger Aufnahmen für gleiche Anzahl von zu vermessenden Objektpunkten notwendig (log n) Codierung der Lichtstreifen nach Gray Code. Jede Lichtebene ist codiert jedes Pixel kann einer Lichtebene zugeordnet werden Tiefenberechnung erfolgt mit Hilfe einer Lookup Tabelle, für jede Lichtebene wird die Triangulation vorausberechnet (2d) Codierung

Tiefenberechnung für Streifenprojektor
Durch die zeitliche Abfolge der Aufnahmen wird es ermöglicht, daß jeder Lichtstreifen im Kamerabild identifiziert wird. Verschiedene Lichtstreifen sind notwendig, um für jeden Bildpunkt einen zugehörigen Lichtstreifen festzustellen zu können. 1) Unterschiedlich breite Lichtstreifen werden zeitlich aufeinanderfolgend in die Szene projiziert und von der Kamera aufgenommen. 2) Für jede Aufnahme wird für jeden Bildpunkt festgestellt, ob dieser beleuchtet wird oder nicht. 3) Diese Information wird für jeden Bildpunkt und für jede Aufnahme im sog. Bit-Plane Stack abgespeichert. 4) Findet man im Bit-Plane Stack für einen Bildpunkt die Information, daß er bei den Aufnahmen z.B. hell, dunkel, dunkel, hell war (Code 1001), dann folgt daraus, daß dieser Bildpunkt vom vierten Lichtstreifen beleuchtet wird. 5) eindeutige Zuordnung „Lichtstreifen – Bildpunkt“ möglich

Abschattungen Laser erreicht Laser erreicht Oberflächenp. nicht
Oberflächenpunkt keine Tiefeninformation Kamera sieht Oberflächenp. nicht keine Tiefeninformation Kamera sieht Oberflächenpunkt keine Tiefeninformation Tiefeninformation Laser Abschattung keine Kamera Abschattung keine Laser Abschattung

2 Projektoren erzeugen 2 Lichtebenen = uneindeutig
Abschattungen 2 virtuelle Kameras 2 Projektoren erzeugen 2 Lichtebenen = uneindeutig

Tiefenbild bzw. Range-Image
Vergleich Intensitätsbild-Tiefenbild Aufnahme eines Intensitätsbildes Aufnahme eines Tiefenbildes

Tiefenbild einer Fischdose sowie einer Zigarettenschachtel
Beispiele (1) Tiefenbild einer Fischdose sowie einer Zigarettenschachtel Tiefenbild Intensitätsbild Tiefenbild Kantenbild

Tiefenbild einer Ebene und eines Spielzeug LKW's
Beispiele (2) Tiefenbild einer Ebene und eines Spielzeug LKW's Ebene im KKS ungefiltert gefiltert

Beispiele (3) Tiefenbild eines Augapfels (Modell) mit Tumor
Tiefenbild eines Augapfels (Modell) ohne Tumor Kantenbild eines Augapfels (Modell) mit Tumor Kantenbild eines Augapfels (Modell) ohne Tumor

Shape from Texture A cat sitting on a table edge, depth cue is the change of texture size Discrimination of depth levels by size of boxes. Depth information within texture What is texture?

Texture Principle of Texture: repetition of a basic pattern.
Basic pattern not deterministic for natural objects Basic pattern approximately deterministic for man-made objects Repetition of pattern neither regular nor deterministic Only statistic regularity = uniform distribution Object shape can be recovered from geometric distortion of texture [Julesz75] Texture is not a local property, characterisitc for an area much larger than basic pattern. => Only basic shapes can be reconstructed (planes, spheres, …)

Texture Texture is the abstraction of a statistical homogenity in a part of the observer‘s field of view which contains much more information than the observer can handle. Texture is scale dependent: Many Texture elements need to be observed but must not exceed a certain maximum, such that their size is always bigger than sensor resolution.

Statistical Texture Analysis
Suitable for natural textures, since they are not deterministic but follow statistical rules. First order grey value statistics: statistics of single pixels compared to pairs, triples, quadruples etc. Normalized texture histogram: grey value distribution function Disadvantage of first order statistics: insensitive against permutations => Checkerboard may have same statistics as salt‘n pepper noise. Second order grey value statistics: Distribution of neighborhood relations (Co-occurence Matrix) E.g.: relation „right of“: black-white, grey-white, white-white More suited for texture classification, because no geometrical properties are modeled.

Structural Texture Analysis
For deterministic textures: Man made objects usually have deterministic textures Assumes that Texture is generated by a basic element „TEXEL“ which is repeated in a regular fashion. Perspective distortion of texels gives surface orientation/shape Characterization by invariant features (circles -> oriented ellipses)

Structural Texture Analysis
Distortion of a planar circle local surface orientation Slant Tilt

2. 1. Texel Texels are not unique
Different texels => identical resulting shape Exact description of texels necessary to model their distortion Simple, geometrical texels easier to handle 2. 1.

Texel Aloimonos & Swain: Shape from Shading  Shape from Texture
Circle = simple texel Same problem as shading: Object border including texture determines the shape Shape reconstruction based on perspective projection => Texture is a 2D surface pattern, not a 3D property. E.g. brick wall, printed cloth Aloimonos & Swain: Shape from Shading  Shape from Texture

Shape from Texel Based on local perspective distortion
Texel must be identifiable Texel must not overlap All Texels must have same spatial dimension Texel are „small„, locally planar and have a unique surface normal

Shape calculation Determine position of a plane using ist texture (slant,tilt):

Artificial exmaples using black circles
Examples Tree Sunflower Audience Artificial exmaples using black circles Algorithms usable in a limited fashion on „real“ scenes.

Examples Golfball

Geometrical Scene Constraints
Reconstruct shape from geometrical relations (similar to human vision). E.g.: parallel lines confine a plane Planes are oriented by their vanishing points Vanishing points and vanishing line give the plane orientation. Problem: find vanishing points

Geometrical Scene Constraints
e.g. intersecting area of parallel lines, symmetry axis of arbitrary objects

Shape from Parallel Lines

Shape From Focus Recover shape of surfaces from limitied depth of view. Requires visibly rough surfaces Typical application: optical microscopy

Shape From Focus Visibly Rough Surfaces
Optical roughness: the smallest spatial variations are much larger than the wavelength of incident electromagnetic wave. Visible roughness: smallest spatial variations are comparable to viewing area of discrete elements (pixels). Magnification: Multi-facet level: w1 >> wf, smooth texture Facet level: w2 ~= wf, rough texture

Shape From Focus Focused / Defocused images Focused:
Defocused: object point is mapped to spot with radius => defocusing is equivalent to convolution with low pass kernel (pillbox function)

Shape From Focus Changing Focus
Displacing the sensor: changes sharp region, magnification and brightness Moving the lens: changes sharp region, magnification and brightness Moving the object: changes sharp region only => Object is moved in front of static camera

Shape From Focus Overview:
At facet level magnification, rough surfaces give texture-rich images A defocused image is equivalent to a low-passed image As S moves towards focused plane, its focus increases. When S is best focused, Challenges: How to measure focus? How to find best focus from finite number of measurements?

Shape From Focus Focus measure operator Purpose: Possible solution:
respond to high frequency variations in image intensity within a small image area produce maximum response when image area is perfectly focused Possible solution: determine high frequency content using Fourier transform (slow) Alternative: High-pass image using Laplacian (problem with elimination) Modified Laplacian

Shape From Focus INxM … local intensity function (image window)
Sum Modified Laplacian Tenengrad Focus Measure Alternatives: variance of intensities, variance of gradients INxM … local intensity function (image window)

Shape From Focus Example Infinite DOF DEM

Shape From Focus Sampling the focus measure function
Consider a single image point (x,y) Focus measure F is function of depth d: F(d) Goal find Fpeak from finite number of samples F1…F8

Shape from focus Sampling the focus measure function
Possibility1: find highest discrete sample

Shape from focus Sampling the focus measure function
Possibility2: Gaussian interpolation Fit Gauss function to three strongest samples

Shape from X Demos Camera Calibration Shape from Stereo
point triangulation Ransac plane fitting Photometric stereo Shading Illusions

ROBOT VISION Lesson 7/8: Shape from X Matthias Rüther

Ähnliche Präsentationen

Präsentation zum Thema: "ROBOT VISION Lesson 7/8: Shape from X Matthias Rüther"— Präsentation transkript:

Ähnliche Präsentationen

Über Projekt

Feedback

Anmelden

Anmeldung über soziales Netzwerk:

ROBOT VISION Lesson 7/8: Shape from X Matthias Rüther

Ähnliche Präsentationen

Präsentation zum Thema: "ROBOT VISION Lesson 7/8: Shape from X Matthias Rüther"— Präsentation transkript:

Ähnliche Präsentationen

Über Projekt

Feedback