Publication: BirdNet+: two-stage 3D object detection in LiDAR through a sparsity-invariant bird's eye view
dc.affiliation.dpto | UC3M. Departamento de Ingeniería de Sistemas y Automática | es |
dc.affiliation.dpto | UC3M. Departamento de Informática | es |
dc.affiliation.grupoinv | UC3M. Grupo de Investigación: Laboratorio de Sistemas Inteligentes | es |
dc.affiliation.grupoinv | UC3M. Grupo de Investigación: Laboratorio de Control, Aprendizaje y Optimización de Sistemas (CAOS) | es |
dc.contributor.author | Barrera Del Pozo, Alejandro | |
dc.contributor.author | Beltrán de la Cita, Jorge | |
dc.contributor.author | Guindel Gómez, Carlos | |
dc.contributor.author | Iglesias Martínez, José Antonio | |
dc.contributor.author | García Fernández, Fernando | |
dc.contributor.funder | Comunidad de Madrid | es |
dc.date.accessioned | 2023-02-03T13:51:10Z | |
dc.date.available | 2023-02-03T13:51:10Z | |
dc.date.issued | 2021-11-30 | |
dc.description.abstract | Autonomous navigation relies upon an accurate understanding of the elements in the surroundings. Among the different on-board perception tasks, 3D object detection allows the identification of dynamic objects that cannot be registered by maps, being key for safe navigation. Thus, it often requires the use of LiDAR data, which is able to faithfully represent the scene geometry. However, although raw laser point clouds contain rich features to perform object detection, more compact representations such as the bird's eye view (BEV) projection are usually preferred in order to meet the time requirements of the control loop. This paper presents an end-to-end object detection network based on the well-known Faster R-CNN architecture that uses BEV images as input to produce the final 3D boxes. Our regression branches can infer not only the axis-aligned bounding boxes but also the rotation angle, height, and elevation of the objects in the scene. The proposed network provides state-of-the-art results for car, pedestrian, and cyclist detection with a single forward pass when evaluated on the KITTI 3D Object Detection Benchmark, with an accuracy that exceeds 64% mAP 3D for the Moderate difficulty. Further experiments on the challenging nuScenes dataset show the generalizability of both the method and the proposed BEV representation against different LiDAR devices and across a wider set of object categories by being able to reach more than 30% mAP with a single LiDAR sweep and almost 40% mAP with the usual 10-sweep accumulation. | en |
dc.description.sponsorship | This work was supported in part by the Government of Madrid (Comunidad de Madrid) under the Multiannual Agreement with the University Carlos III of Madrid (UC3M) in the line of "Fostering Young Doctors Research"(PEAVAUTO-CM-UC3M), and in part by the Context of the V Regional Programme of Research and Technological Innovation (PRICIT). | en |
dc.format.extent | 18 | |
dc.identifier.bibliographicCitation | Barrera, A., Beltran, J., Guindel, C., Iglesias, J. A. & Garcia, F. (2021). BirdNet+: Two-Stage 3D Object Detection in LiDAR Through a Sparsity-Invariant Bird’s Eye View. IEEE Access, 9, 160299-160316. | en |
dc.identifier.doi | https://doi.org/10.1109/ACCESS.2021.3131389 | |
dc.identifier.publicationfirstpage | 160299 | |
dc.identifier.publicationlastpage | 160316 | |
dc.identifier.publicationtitle | IEEE Access | en |
dc.identifier.publicationvolume | 9 | |
dc.identifier.uri | https://hdl.handle.net/10016/36461 | |
dc.identifier.uxxi | AR/0000030633 | |
dc.language.iso | eng | |
dc.publisher | IEEE | |
dc.relation.projectID | Comunidad de Madrid. PEAVAUTO-CM-UC3M | es |
dc.rights | © The authors, 2021 | en |
dc.rights | Atribución 3.0 España | * |
dc.rights.accessRights | open access | en |
dc.rights.uri | http://creativecommons.org/licenses/by/3.0/es/ | * |
dc.subject.eciencia | Informática | es |
dc.subject.eciencia | Ingeniería Mecánica | es |
dc.subject.eciencia | Robótica e Informática Industrial | es |
dc.subject.other | Bird's eye view (BEV) | en |
dc.subject.other | LiDAR | en |
dc.subject.other | Object detection | en |
dc.subject.other | Autonomous driving | en |
dc.title | BirdNet+: two-stage 3D object detection in LiDAR through a sparsity-invariant bird's eye view | en |
dc.type | research article | * |
dc.type.hasVersion | VoR | * |
dspace.entity.type | Publication |
Files
Original bundle
1 - 1 of 1
No Thumbnail Available
- Name:
- BirdNet(plus)_IEEEA_2021.pdf
- Size:
- 3.77 MB
- Format:
- Adobe Portable Document Format