Summary

As a postdoctoral researcher at the EcoVision lab at University of Zurich, I collaborate with Jan Wegner to design deep learning methods for remote sensing and environmental applications. Before joining UZH, I completed my PhD on "Efficient Learning on Large-Scale 3D Point Clouds" at IGN and ENGIE Lab CRIGEN, under the supervision of Loïc Landrieu and Bruno Vallet.

You like trees 🌳 ? You like satellites 🛰️ ? You like me 😊



📃 Publications

🖼 Poster | 🎤 Oral

SuperCluster
Scalable 3D Panoptic Segmentation as Superpoint Graph Clustering
Damien Robert, Hugo Raguet, Loïc Landrieu
3DV 2024 Oral 🎤 (top 5.3% submissions)

SuperCluster proposes a framework for efficient panoptic segmentation of large-scale point clouds. We formulate the panoptic task as a graph clustering problem. We train a model to predict desirable node and edge attributes to be used as input for a downstream graph clustering algorithm. This allows for training a model with only local supervision, without the need for non-maximum suppression, instance matching, and without any prerequisite on the number of objects in the scene. SuperCluster achieves new SOTA panoptic segmentation on indoor datasets S3DIS Area 5 (50.1 PQ (+7.8)) and ScanNetV2 (58.7 PQ (+25.2)), as well as outdoor datasets KITTI-360 (48.3 PQ) and DALES (61.2 PQ).
🦋 210k param. | ⚡ Train S3DIS F5 in 4h | 💾 20M-point inference on 1 GPU



superpoint transformer
Efficient 3D Semantic Segmentation with Superpoint Transformer
Damien Robert, Hugo Raguet, Loïc Landrieu
Paper | Webpage | Code
ICCV 2023 Poster 🖼 (top 26.8% submissions)

SPT is a superpoint-based transformer 🤖 architecture that efficiently ⚡ performs semantic segmentation on large-scale 3D scenes. This method includes a fast algorithm that partitions 🧩 point clouds into a hierarchical superpoint structure, as well as a self-attention mechanism to exploit the relationships between superpoints at multiple scales. We reach SOTA on S3DIS 6-Fold (76.0 mIoU), KITTI-360 Val (63.5 mIoU), and DALES (79.6 mIoU)n with:
🦋 212k param. | ⚡ Train S3DIS F5 in 3h | ⌚ SPG preprocessing ÷7



deepviewagg
Learning Multi-View Aggregation In the Wild for Large-Scale 3D Semantic Segmentation
Damien Robert, Bruno Vallet, Loïc Landrieu
Paper | Webpage | Code | Video
CVPR 2022 Oral 🎤 and Best Paper finalist 🎉 (top 0.4% submissions)

An end-to-end multi-view aggregation method for 3D semantic segmentation from images and point clouds. We reach SOTA on S3DIS and KITTI-360 without requiring point cloud colorization, meshing, or depth sensors: just point clouds ☁, images 📸, and their poses.



📑 Short Resume



🖼🎤 Talks & Presentations

🖼 Poster | 🎤 Oral



📚 Teaching

  • 05/2024 NeRFs, and Diffusion at University of Zurich (Course Instructor, M2, 5.5 hours)
  • 01/2023 Deep Learning for Remote Sensing at ENSG (Course Instructor, M2, 13 hours)
  • 06/2022 3D Deep Learning for Remote Sensing at ISPRS 2022 (Tutorial Instructor, 1 day)
  • 05/2022 3D Deep Learning, Torch-Points3D & DeepViewAgg at ENGIE Lab CRIGEN (Tutorial Instructor, 1 day)
  • 01/2022 Deep Learning for Remote Sensing at ENSG (Course Instructor, M2, 9 hours)
  • 11/2020 Deep Learning for Computer Vision at Ecole Polytechnique (Teaching Assistant, M1, 12 hours)


🏠 Affiliations