One fundamental problem in computer vision and image processing is modeling the image formation of a camera, i.e., mapping a point in three-dimensional space to its projected position on the camera’s image plane. If ...One fundamental problem in computer vision and image processing is modeling the image formation of a camera, i.e., mapping a point in three-dimensional space to its projected position on the camera’s image plane. If the relationship between the space and the image plane is assumed to be linear, the relationship can be expressed in terms of a transfor-mation matrix and the matrix is often identified by regression. In this paper, we show that the space-to-image relation-ship in a camera can be modeled by a simple neural network. Unlike most other cases employing neural networks, the structure of the network is optimized so as for each link between neurons to have a physical meaning. This makes it possible to effectively initialize link weights and quickly train the network.展开更多
文摘One fundamental problem in computer vision and image processing is modeling the image formation of a camera, i.e., mapping a point in three-dimensional space to its projected position on the camera’s image plane. If the relationship between the space and the image plane is assumed to be linear, the relationship can be expressed in terms of a transfor-mation matrix and the matrix is often identified by regression. In this paper, we show that the space-to-image relation-ship in a camera can be modeled by a simple neural network. Unlike most other cases employing neural networks, the structure of the network is optimized so as for each link between neurons to have a physical meaning. This makes it possible to effectively initialize link weights and quickly train the network.