Translating Images into Maps

学生

Unlike previous approaches, we treat the transformation to BEV as an image-to-world translation problem, where the objective is to learn an alignment between vertical scan lines in the image and polar rays in BEV.
Transformers are well-suited to the image-to- BEV transformation problem, as they can reason about interdependence between objects, depths and the lighting of the scene to achieve a globally consistent representation.

Input: image, intrinsic matrix.

Output: semantic BEV maps for static and dynamic classes