Monday, April 6, 2009

Video Compass

Main idea: Extract the relative orientation of the camera with respect to the scene. This approach is based on the properties of the vanishing points. The vanishing points are computed from the intersection of parallel lines in uncalibrated images. It assumes that the majority of lines in man-made environments is alligned with the principal orthogonal directions of the world coordinate frame.

This approach can be summarized as follows:
  1. Line Detection.
  2. Lines are grouped into the dominant vanishing directions. In man-made environments are alligned with the principal orthogonal axes of the world reference frame.
  3. Computation of the vanishing points
    • Two cases: calibrated and uncalibrated camera. This work assumes the uncalibrated case.
    • The grouping and the estimation of the vanishing points is performed at the same time using an EM approach.
  4. The relative orientation of the camera wrt the scene is computed using the vanishing point constraints.
  • Calibration
    • The relation between image coordinates of a point and its counterpart in the calibrated and uncalibrated cases
λx = RX+ T and λx' = KRX+ KT
    • x' denotes a pixel coordinate of X and K is the intrinsic parameter matrix of the camera. The unit vectors of the world coordinate frame are e_i = [1,0,0]', e_j = [0,1,0]' and e_k = [0,0,1]' . The vanishing points corresponding to 3D lines parallel to either if these directions are
      • v_i = KRe_i
      • v_j = KRe_j
      • v_k = KRe_k
      • the orthogonallity relation between e_i, e_j, e_k readily provide constraints on the calibration matrix K. In particular
e'_i e_j = 0 -> v'_i K^(-T)RR^(-T)K¯¹v_j = v'_iK^(-T)K¯¹v_j = v'_i Sv_j = 0

    • where S is the metric associated with the uncalibrated camera S = K^(-T)K^(-1).
    • When three different vanishing points are detected, they provide three independient constraints on matrix S
    • v_i S v_j = 0, v'_i S v_k = 0 v'_j S v_k = 0
    • The zero skew constraint expresses the fact that the image axes are orthogonal can be written as [1, 0, 0]' S [0, 1, 0] = 0
    • with these constraints the matrix K can be computed by minimizing ||Bs||²

  • Relative orientation
    • since the vanishing directions are projections of the vectors associated with three orthogonal directions i, j, k, they depend on rotation only. In particular we can write that:
K¯¹v_i = Re_i K¯¹v_j = Re_j K¯¹v_k = Re_k
    • with each vanishing direction being proportional to the column of the rotation matrix R = [r_1, r_2, r_3]. Choosing the two best vanishing directions, properly normalizing them, the third row can be obtained as r_3 = cross(r_1,r_2) by enforcing the orthogonality constraints. There is a four way ambiguity in R due to the sign ambiguity in r1 and r2. Additional solutions can be eliminated by considering relative orientation or structure constraints.

No comments:

Post a Comment