Main idea: Based on the computation of the vanishing points (VP) and their matching through different images coming from different cameras, (hemispherically-tiled set of images captured from a single position in space) the computation of the relative camera rotation is performed.
This work first compute the vanishing points present in each image based on the direction of the edges extracted from the scene. The representation used for these edges is the intersection of the plane through the edge and the focal point with the Gaussian sphere. The direction of the 3D edges (vanishing points) is computed using an EM approach.
The vanishing point estimation is composed of two tightly-coupled sub-problems: classification (grouping observed edges into parallel sets) and estimation (finding the best VP for each set). The authors present an hybrid approach to VP detection and estimation which combines the Hough transform (HT) for detection with the acccuracy of least squares for estimation. An EM algorithm is formulated to probabilistically model edges and their directional uncertainty. A final verification step rejects spurious directions.
link: http://ieeexplore.ieee.org/xpls/abs_all.jsp?arnumber=854809
Tuesday, April 7, 2009
Monday, April 6, 2009
Video Compass
Main idea: Extract the relative orientation of the camera with respect to the scene. This approach is based on the properties of the vanishing points. The vanishing points are computed from the intersection of parallel lines in uncalibrated images. It assumes that the majority of lines in man-made environments is alligned with the principal orthogonal directions of the world coordinate frame.
This approach can be summarized as follows:
This approach can be summarized as follows:
- Line Detection.
- Lines are grouped into the dominant vanishing directions. In man-made environments are alligned with the principal orthogonal axes of the world reference frame.
- Computation of the vanishing points
- Two cases: calibrated and uncalibrated camera. This work assumes the uncalibrated case.
- The grouping and the estimation of the vanishing points is performed at the same time using an EM approach.
- The relative orientation of the camera wrt the scene is computed using the vanishing point constraints.
- Calibration
- The relation between image coordinates of a point and its counterpart in the calibrated and uncalibrated cases
- x' denotes a pixel coordinate of X and K is the intrinsic parameter matrix of the camera. The unit vectors of the world coordinate frame are e_i = [1,0,0]', e_j = [0,1,0]' and e_k = [0,0,1]' . The vanishing points corresponding to 3D lines parallel to either if these directions are
- v_i = KRe_i
- v_j = KRe_j
- v_k = KRe_k
- the orthogonallity relation between e_i, e_j, e_k readily provide constraints on the calibration matrix K. In particular
e'_i e_j = 0 -> v'_i K^(-T)RR^(-T)K¯¹v_j = v'_iK^(-T)K¯¹v_j = v'_i Sv_j = 0
- where S is the metric associated with the uncalibrated camera S = K^(-T)K^(-1).
- When three different vanishing points are detected, they provide three independient constraints on matrix S
- v_i S v_j = 0, v'_i S v_k = 0 v'_j S v_k = 0
- The zero skew constraint expresses the fact that the image axes are orthogonal can be written as [1, 0, 0]' S [0, 1, 0] = 0
- with these constraints the matrix K can be computed by minimizing ||Bs||²
- Relative orientation
- since the vanishing directions are projections of the vectors associated with three orthogonal directions i, j, k, they depend on rotation only. In particular we can write that:
K¯¹v_i = Re_i K¯¹v_j = Re_j K¯¹v_k = Re_k
- with each vanishing direction being proportional to the column of the rotation matrix R = [r_1, r_2, r_3]. Choosing the two best vanishing directions, properly normalizing them, the third row can be obtained as r_3 = cross(r_1,r_2) by enforcing the orthogonality constraints. There is a four way ambiguity in R due to the sign ambiguity in r1 and r2. Additional solutions can be eliminated by considering relative orientation or structure constraints.
Planar Ego-Motion Without Correspondences
Motion estimation as Hough
- Camera viewing points P_i vectors relative to the camera's reference system.
- P_i's are projected to p_i = P_i/||P_i || by a spherical projective projection
- The camera undergoes a rigid motion in the X-Y plane
- This motion is given by (R,t) where R = R_z(α) and t, a vector in theX-Y plane
- t = R_z(θ)·e_1
- From the camera's new coordinate frame, the world points are given as Q_i = RP_i + t and the world points Q_i map to image points q_i = Q_i/||Q_i ||
- We infer that Rp_i, q_i and t lie on the same plane X-Y. This is expressed as
(Rz(α)pi × qi)'Rz(θ) e1 = 0.
- Extension to any plane given by the angles β and γ.
- The vector Rz(γ)Ry(β)e_3 is orthogonal to all vectors lying in the plane.
- We can define any plane of motions with rotations.
- Motions on the plane (β, γ) are simply a coordinate frame rotation from motions in the X-Y plane.
- If the point pairs (p_i, q_i) satisfy a motion in the plane (β, γ), then the point pairs (p'_i, q'_i ), where {p'_i, q'_i } = Ry(−β)Rz(−γ){p_i, q_i} satisfie the same motion in the X-Y plane.
Rz((Rz(α)(Rz(γ)Ry(β)' p_i × (Rz(γ)Ry(β))' q_i)'Rz(θ) e1 = 0
Subscribe to:
Posts (Atom)