BirdNet+: two-stage 3D object detection in LiDAR through a sparsity-invariant bird's eye view

Barrera Del Pozo, Alejandro; Beltrán de la Cita, Jorge; Guindel Gómez, Carlos; Iglesias Martínez, José Antonio; García Fernández, Fernando

Publication:
BirdNet+: two-stage 3D object detection in LiDAR through a sparsity-invariant bird's eye view

Identifiers

URI: https://hdl.handle.net/10016/36461

DOI: https://doi.org/10.1109/ACCESS.2021.3131389

UXXI: AR/0000030633

Files

BirdNet(plus)_IEEEA_2021.pdf (3.77 MB)

Publication date

2021-11-30

Authors

Barrera Del Pozo, Alejandro

Beltrán de la Cita, Jorge

Guindel Gómez, Carlos

Iglesias Martínez, José Antonio

García Fernández, Fernando

Publisher

IEEE

Impact

Export

Abstract

Autonomous navigation relies upon an accurate understanding of the elements in the surroundings. Among the different on-board perception tasks, 3D object detection allows the identification of dynamic objects that cannot be registered by maps, being key for safe navigation. Thus, it often requires the use of LiDAR data, which is able to faithfully represent the scene geometry. However, although raw laser point clouds contain rich features to perform object detection, more compact representations such as the bird's eye view (BEV) projection are usually preferred in order to meet the time requirements of the control loop. This paper presents an end-to-end object detection network based on the well-known Faster R-CNN architecture that uses BEV images as input to produce the final 3D boxes. Our regression branches can infer not only the axis-aligned bounding boxes but also the rotation angle, height, and elevation of the objects in the scene. The proposed network provides state-of-the-art results for car, pedestrian, and cyclist detection with a single forward pass when evaluated on the KITTI 3D Object Detection Benchmark, with an accuracy that exceeds 64% mAP 3D for the Moderate difficulty. Further experiments on the challenging nuScenes dataset show the generalizability of both the method and the proposed BEV representation against different LiDAR devices and across a wider set of object categories by being able to reach more than 30% mAP with a single LiDAR sweep and almost 40% mAP with the usual 10-sweep accumulation.

Keywords

Bird's eye view (BEV), LiDAR, Object detection, Autonomous driving

Bibliographic citation

Barrera, A., Beltran, J., Guindel, C., Iglesias, J. A. & Garcia, F. (2021). BirdNet+: Two-Stage 3D Object Detection in LiDAR Through a Sparsity-Invariant Bird’s Eye View. IEEE Access, 9, 160299-160316.

Collections

DISA - LSI - Artículos de Revistas
DI - CAOS - Artículos de Revistas

Full item page

Publication:
BirdNet+: two-stage 3D object detection in LiDAR through a sparsity-invariant bird's eye view

Identifiers

Files

Publication date

Defense date

Authors

Advisors

Tutors

Journal Title

Journal ISSN

Volume Title

Publisher

Impact

Export

Research Projects

Organizational Units

Journal Issue

Abstract

Description

Keywords

Bibliographic citation

Collections

Publication: BirdNet+: two-stage 3D object detection in LiDAR through a sparsity-invariant bird's eye view

Identifiers

Files

Publication date

Defense date

Authors

Advisors

Tutors

Journal Title

Journal ISSN

Volume Title

Publisher

Impact

Export

Research Projects

Organizational Units

Journal Issue

Abstract

Description

Keywords

Bibliographic citation

Collections

Publication:
BirdNet+: two-stage 3D object detection in LiDAR through a sparsity-invariant bird's eye view