UnderOneFacade: Worldwide Facade Semantic Segmentation Benchmark Dataset

Anna Klimkowska; Benjamin Busam; Brian Sheil; Christoph Holst; Fan Wang; Filip Biljecki; Olaf Wysocki; Prabin Gyawali; Wanru Yang; Yi Wang

arxiv: 2607.02018 · v1 · pith:37ZOAJBInew · submitted 2026-07-02 · 💻 cs.CV

UnderOneFacade: Worldwide Facade Semantic Segmentation Benchmark Dataset

Yi Wang , Fan Wang , Prabin Gyawali , Ziyang Xu , Anna Klimkowska , Yixiong Jing , Wanru Yang , Filip Biljecki

show 4 more authors

Christoph Holst Benjamin Busam Brian Sheil Olaf Wysocki

This is my paper

Pith reviewed 2026-07-03 16:12 UTC · model grok-4.3

classification 💻 cs.CV

keywords 3D facade segmentationsemantic segmentation benchmarkpoint cloud datasetcross-domain generalizationhierarchical labelsarchitectural elementsLoFG3 benchmark

0 comments

The pith

UnderOneFacade supplies the largest cross-continent 3D facade benchmark, where top models reach only 33 IoU on fine-grained labels.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper establishes that progress toward globally consistent 3D facade segmentation has been limited by narrow or inconsistent datasets. It releases UnderOneFacade, a collection of centimeter-accurate point clouds from multiple countries and continents carrying 2.7 billion harmonized hierarchical labels. Systematic tests of point-based, graph-based, and transformer architectures reveal sharp drops in performance when models move across geographic domains and weak recognition of detailed architectural elements.

Core claim

UnderOneFacade is the largest cross-country and cross-continent 3D facade benchmark to date, built from centimeter-accurate point clouds with hierarchical, harmonized semantic labels, and shows that current methods achieve at most 33 IoU on the fine-grained LoFG3 benchmark while degrading across domains.

What carries the argument

The UnderOneFacade dataset of 2.7 billion harmonized hierarchical semantic annotations on geographically diverse, high-precision point clouds.

If this is right

Models can now be trained and evaluated under a single standardized label set for cross-domain transfer.
Research attention will shift toward architectures that better capture fine architectural details such as window styles or ornamentation.
Pretrained models released with the dataset provide a common starting point for further geographic generalization work.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

Global digital-twin construction projects gain a concrete testbed for checking whether segmentation models transfer beyond training cities.
Similar hierarchical harmonization efforts could be applied to other urban 3D elements such as roofs or street furniture.
The scale of the data invites experiments that combine the point clouds with aligned imagery for multi-modal improvements.

Load-bearing premise

The hierarchical semantic labels have been harmonized consistently across countries and the point clouds are both centimeter-accurate and representative of real-world facade variation.

What would settle it

A model that exceeds 40 IoU on the LoFG3 benchmark while maintaining performance across the geographic splits would indicate that current limitations are overstated.

Figures

Figures reproduced from arXiv: 2607.02018 by Anna Klimkowska, Benjamin Busam, Brian Sheil, Christoph Holst, Fan Wang, Filip Biljecki, Olaf Wysocki, Prabin Gyawali, Wanru Yang, Yi Wang, Yixiong Jing, Ziyang Xu.

**Figure 1.** Figure 1: UnderOneFacade: We present a centimeter-accurate and cross-continental facade point clouds, with fine-grained semantic segmentation of architectural elements, and hierarchical facade taxonomy enabling multi-level evaluation (LoFG2–LoFG3), totaling 2.7B annotated points. Abstract. Globally consistent semantic digital twins require centimeteraccurate and geographically transferable 3D facade segmentation.… view at source ↗

**Figure 2.** Figure 2: Comparison of per-point class distributions of UnderOneFacade (2.7BN points) and ZAHA (0.6 BN). Cross-country aggregation yields a substantially heavier-tailed and structurally more diverse distribution. Data Splits. Each country subset is divided into training, validation, and test splits following a 70/20/10 ratio based on the total number of points. Splitting is performed per building, ensuring that co… view at source ↗

**Figure 3.** Figure 3: Qualitative facade segmentation results on UnderOneFacade on the LoFG3: We visualize predictions of representative architectures across scenes from different countries. Rows show example facades, while columns correspond to different segmentation models. Despite correct segmentation of dominant structures such as walls and roofs, models struggle to consistently recognize fine-grained facade elements, incl… view at source ↗

read the original abstract

Globally consistent semantic digital twins require centimeter-accurate and geographically transferable 3D facade segmentation. However, progress in facade parsing is limited by the lack of large-scale, standardized benchmarks for evaluating cross-domain generalization. Existing datasets are geographically narrow, semantically inconsistent, or insufficiently precise. We introduce UnderOneFacade, the largest cross-country and cross-continent 3D facade benchmark to date, comprising centimeter-accurate point clouds with hierarchical, harmonized, and architecturally grounded semantic labels totaling 2.7 billion annotated points. Through a systematic evaluation of representative point-, graph- and transformer-based architectures, we show that current methods struggle to recognize fine-grained architectural elements and degrade significantly across geographic domains, with the best models achieving only up to 33 IoU on the fine-grained LoFG3 benchmark. By combining geometric precision with standardized semantics at unprecedented scale, UnderOneFacade establishes a rigorous benchmark for developing robust and transferable 3D segmentation models. The dataset, evaluation scripts, and pretrained models will be released upon publication.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

UnderOneFacade offers a large cross-continent facade point cloud dataset but its 33 IoU and domain degradation claims cannot be checked because the data is unreleased and no labeling or split details are given.

read the letter

UnderOneFacade brings a large new cross-continent 3D facade dataset with harmonized hierarchical labels, but the reported performance numbers rest on details that are not provided and data that is not yet released.

The paper's strength is in the dataset itself. Collecting 2.7 billion points from multiple continents with consistent semantic labels at different levels of granularity fills a gap that existing facade datasets leave open. Those are often limited to single cities or regions and use varying label sets. The evaluation across different architectures showing that fine-grained elements are hard and that performance drops across domains points to a real issue for building models that work globally for digital twins.

It frames the problem clearly in terms of needing geographically transferable segmentation.

The soft spots are significant though. The abstract gives the 33 IoU figure for the best models on LoFG3 but supplies no information on how the labels were harmonized across countries, what the validation process was, or the train and test splits. There are also no specifics on the model configurations. Since the dataset and scripts are only promised for release later, the central claims about cross-domain degradation cannot be verified or reproduced right now. This leaves open the possibility that some of the observed issues come from annotation inconsistencies rather than model limitations.

This paper is mainly for researchers in 3D computer vision working on urban scene understanding and domain adaptation. Someone building or testing segmentation models for city-scale applications could find the benchmark useful once it is out.

I would recommend sending it to peer review if the authors include the missing methodological details and make the data available at submission time. As presented, the empirical part is too thin to stand on its own.

Referee Report

3 major / 2 minor

Summary. The paper introduces UnderOneFacade, the largest cross-country 3D facade segmentation benchmark to date, consisting of centimeter-accurate point clouds totaling 2.7 billion annotated points with hierarchical, harmonized semantic labels. It systematically evaluates representative point-, graph-, and transformer-based architectures and reports that current methods struggle with fine-grained architectural elements, with the best models achieving only up to 33 IoU on the fine-grained LoFG3 benchmark and showing significant degradation across geographic domains. The dataset, evaluation scripts, and pretrained models are promised for release upon publication.

Significance. If the label harmonization protocol, inter-annotator statistics, train/test splits, and model configurations are rigorously documented and the dataset is released with reproducible evaluation code, the benchmark could meaningfully advance research on geographically transferable 3D facade parsing. The reported scale (2.7B points, multi-continent coverage) and the empirical observation of low fine-grained performance address a documented gap in existing facade datasets. The absence of these details in the current manuscript, however, prevents assessment of whether the 33 IoU figure and cross-domain claims reflect genuine architectural generalization failure or annotation/metric artifacts.

major comments (3)

[Abstract] Abstract: The central empirical claims (best model 33 IoU on LoFG3; significant cross-domain degradation) are stated without any information on train/test splits, label validation process, inter-annotator agreement, or exact model configurations and hyperparameters. These omissions are load-bearing because the paper's primary contribution is the benchmark evaluation itself.
[Dataset construction] Dataset construction section (inferred from abstract description of 'harmonized' labels): No protocol, country-specific validation, or consistency checks are supplied for the hierarchical semantic label harmonization across continents. Without this, it is impossible to separate genuine cross-domain generalization failure from annotation drift, directly undermining the claim that the benchmark enables 'rigorous' evaluation of transferable models.
[Evaluation] Evaluation section: The manuscript states that 'the dataset, evaluation scripts, and pretrained models will be released upon publication' but provides no current access, no code, and no supplementary material containing splits or validation statistics. This renders the reported IoU numbers and degradation results unverifiable at review time.

minor comments (2)

[Abstract] Abstract: The phrase 'centimeter-accurate point clouds' is repeated without supporting evidence or error metrics; a brief quantitative statement on point-cloud accuracy would strengthen the claim.
[Dataset] The hierarchical label taxonomy (LoFG3 etc.) is referenced but never defined or illustrated; a table or figure showing the label hierarchy and example annotations per country would improve clarity.

Simulated Author's Rebuttal

3 responses · 0 unresolved

We thank the referee for the detailed and constructive report. The comments highlight important areas where additional documentation is needed to strengthen the reproducibility of the benchmark. We address each major comment below and outline the revisions we will make to the manuscript.

read point-by-point responses

Referee: [Abstract] Abstract: The central empirical claims (best model 33 IoU on LoFG3; significant cross-domain degradation) are stated without any information on train/test splits, label validation process, inter-annotator agreement, or exact model configurations and hyperparameters. These omissions are load-bearing because the paper's primary contribution is the benchmark evaluation itself.

Authors: We agree that the abstract, constrained by length, omits these details. The full manuscript contains dedicated sections describing the train/test splits (Section 3.3), label validation process (Section 3.2), and model configurations with hyperparameters (Section 4.2). However, inter-annotator agreement statistics are not currently reported. We will revise the abstract to include a concise reference to these elements and add a new supplementary table with exact hyperparameters, split statistics, and inter-annotator agreement metrics. revision: partial
Referee: [Dataset construction] Dataset construction section (inferred from abstract description of 'harmonized' labels): No protocol, country-specific validation, or consistency checks are supplied for the hierarchical semantic label harmonization across continents. Without this, it is impossible to separate genuine cross-domain generalization failure from annotation drift, directly undermining the claim that the benchmark enables 'rigorous' evaluation of transferable models.

Authors: The current manuscript provides only a high-level description of the harmonization process. We will add a detailed subsection in the Dataset construction section that specifies the harmonization protocol, including the mapping rules between country-specific label sets, country-specific validation procedures performed by domain experts, and quantitative consistency checks (e.g., overlap metrics across annotators from different continents). This addition will directly support the claim of rigorous cross-domain evaluation. revision: yes
Referee: [Evaluation] Evaluation section: The manuscript states that 'the dataset, evaluation scripts, and pretrained models will be released upon publication' but provides no current access, no code, and no supplementary material containing splits or validation statistics. This renders the reported IoU numbers and degradation results unverifiable at review time.

Authors: We acknowledge that the current submission provides no reviewer-accessible materials. To enable verification during review, we will include an anonymized supplementary archive with the train/test splits, label validation statistics, evaluation scripts, and a subset of the data sufficient to reproduce the reported IoU figures. The full dataset and models will still be released publicly upon acceptance, consistent with the original statement. revision: yes

Circularity Check

0 steps flagged

No circularity: empirical dataset release with no derivations or fitted predictions

full rationale

The paper introduces a new benchmark dataset and evaluates existing segmentation models on it. There are no mathematical derivations, first-principles predictions, parameter fits, or self-citation chains that reduce claims to inputs by construction. The central claims concern dataset scale, label harmonization, and observed model performance (e.g., 33 IoU), which are empirical statements rather than derived results. No steps match the enumerated circularity patterns.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Dataset introduction paper with no free parameters, axioms, or invented entities beyond the dataset construction itself.

pith-pipeline@v0.9.1-grok · 5745 in / 1078 out tokens · 29645 ms · 2026-07-03T16:12:59.026202+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

55 extracted references · 55 canonical work pages · 2 internal anchors

[1]

ISPRS International Journal of Geo- Information4(4), 2842–2889 (2015) 16 Y

Biljecki, F., Stoter, J., Ledoux, H., Zlatanova, S., Çöltekin, A.: Applications of 3D city models: State of the art review. ISPRS International Journal of Geo- Information4(4), 2842–2889 (2015) 16 Y. Wang et al

work page 2015
[2]

The International Archives of the Photogrammetry, Remote Sensing and Spatial Information SciencesXLIII-B1-2021, 125–131 (2021)

Blaser, S., Meyer, J., Nebiker, S.: Open urban and forest datasets from a high- performance mobile mapping backpack – a contribution for advancing the creation of digital city twins. The International Archives of the Photogrammetry, Remote Sensing and Spatial Information SciencesXLIII-B1-2021, 125–131 (2021)

work page 2021
[3]

ISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences IV-1, 21–28 (2018)

Borgmann, B., Schatz, V., Kieritz, H., Scherer-Klöckling, C., Hebel, M., Arens, M.: Data processing and recording using a versatile multi-sensor vehicle. ISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences IV-1, 21–28 (2018)

work page 2018
[4]

ISPRS Journal of Photogrammetry and Remote Sensing232, 675–688 (2026)

Chen, Y., Li, J., Han, T., Feng, H., Chen, J., Wang, C.: City-facade: A city-level large-scale point cloud building facade dataset for semantic & instance segmenta- tion. ISPRS Journal of Photogrammetry and Remote Sensing232, 675–688 (2026)

work page 2026
[5]

In: Proceedings of the IEEE conference on computer vision and pattern recognition

Dai,A.,Chang,A.X.,Savva,M.,Halber,M.,Funkhouser,T.,Nießner,M.:Scannet: Richly-annotated 3d reconstructions of indoor scenes. In: Proceedings of the IEEE conference on computer vision and pattern recognition. pp. 5828–5839 (2017)

work page 2017
[6]

Proceedings of the European Conference on Computer Vision (ECCV) pp

Dai, A., Nießner, M.: 3dmv: Joint 3d-multi-view prediction for 3d semantic scene segmentation. Proceedings of the European Conference on Computer Vision (ECCV) pp. 452–468 (2018)

work page 2018
[7]

Australasian Conference on Robotics and Automation2(1) (2013)

De Deuge, M., Quadros, A., Hung, C., Douillard, B.: Unsupervised feature learning for classification of outdoor 3d scans. Australasian Conference on Robotics and Automation2(1) (2013)

work page 2013
[8]

Remote Sensing13(22), 4713 (2021)

Deschaud, J.E., Duque, D., Richa, J.P., Velasco-Forero, S., Marcotegui, B., Goulette, F.: Paris-CARLA-3D: A real and synthetic outdoor point cloud dataset for challenging tasks in 3D mapping. Remote Sensing13(22), 4713 (2021)

work page 2021
[9]

ISPRS Journal of Photogrammetry and Remote Sensing 163, 327–342 (2020)

Dong, Z., Liang, F., Yang, B., Xu, Y., Zang, Y., Li, J., Wang, Y., Dai, W., Fan, H., Hyyppä, J., et al.: Registration of large-scale terrestrial laser scanner point clouds: A review and benchmark. ISPRS Journal of Photogrammetry and Remote Sensing 163, 327–342 (2020)

work page 2020
[10]

Gehrung, J.A.: Change detection in point clouds of urban street spaces using fuzzy spatial reasoning. Ph.D. thesis, Technical University of Munich, Munich, Germany (2022)

work page 2022
[11]

geofabrik

Geofabrik: Openstreetmap data extracts.https : / / download . geofabrik . de/ (2020), Accessed: 2020-10-01

work page 2020
[12]

Geosystems, L.: Leica blk arc.https://leica-geosystems.com/en-gb/products/ laser-scanners/scanners/leica-blk-arc(2026), Accessed: 2026-01-30

work page 2026
[13]

com / en - gb/products/laser- scanners/software/leica- cyclone/leica- cyclone- 3dr (2026), Accessed: 2026-01-30

Geosystems, L.: Leica cyclone 3dr.https : / / leica - geosystems . com / en - gb/products/laser- scanners/software/leica- cyclone/leica- cyclone- 3dr (2026), Accessed: 2026-01-30

work page 2026
[14]

Geosystems, L.: Leica rtc360 3d laser scanner.https://leica-geosystems.com/ products/laser-scanners/scanners/leica-rtc360(2026), Accessed: 2026-01-30

work page 2026
[15]

A2d2: Audi autonomous driving dataset , year =

Geyer, J., Kassahun, Y., Mahmudi, M., Ricou, X., Durgesh, R., Chung, A.S., Hauswald, L., Pham, V.H., Mühlegg, M., Dorn, S., et al.: A2D2: Audi autonomous driving dataset. arXiv preprint arXiv:2004.06320 (2020)

work page arXiv 2004
[16]

Expert Systems with Applications238, 121842 (2024)

González-Collazo, S.M., Balado, J., Garrido, I., Grandío, J., Rashdi, R., Tsir- anidou, E., del Río-Barral, P., Rúa, E., Puente, I., Lorenzo, H.: Santiago urban dataset sud: Combination of handheld and mobile laser scanning point clouds. Expert Systems with Applications238, 121842 (2024)

work page 2024
[17]

SynthCity: A large scale synthetic point cloud

Griffiths, D., Boehm, J.: SynthCity: A large scale synthetic point cloud. arXiv preprint arXiv:1907.04758 (2019)

work page internal anchor Pith review Pith/arXiv arXiv 1907
[18]

Semantic3D.net: A new Large-scale Point Cloud Classification Benchmark

Hackel, T., Savinov, N., Ladicky, L., Wegner, J.D., Schindler, K., Pollefeys, M.: Semantic3d.net: A new large-scale point cloud classification benchmark. arXiv preprint arXiv:1704.03847 (2017) UnderOneFacade 17

work page internal anchor Pith review Pith/arXiv arXiv 2017
[19]

In: Shi, W., Goodchild, M.F., Batty, M., Kwan, M.P., Zhang, A

Kolbe, T.H., Donaubauer, A.: Semantic 3D city modeling and BIM. In: Shi, W., Goodchild, M.F., Batty, M., Kwan, M.P., Zhang, A. (eds.) Urban Informatics. pp. 609–636. Springer Singapore, Singapore (2021)

work page 2021
[20]

PFG – Journal of Photogrammetry, Remote Sensing and Geoin- formation Science88(1), 1–19 (2020)

Kutzner, T., Chaturvedi, K., Kolbe, T.H.: CityGML 3.0: New functions open up new applications. PFG – Journal of Photogrammetry, Remote Sensing and Geoin- formation Science88(1), 1–19 (2020)

work page 2020
[21]

ITcon17(9) (2012)

Laakso, M., Kiviniemi, A.: The IFC standard: A review of history, development, and standardization, information technology. ITcon17(9) (2012)

work page 2012
[22]

In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition

Lai, X., Liu, J., Jiang, L., Wang, L., Zhao, H., Liu, S., Qi, X., Jia, J.: Stratified transformer for 3d point cloud segmentation. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. pp. 8500–8509 (2022)

work page 2022
[23]

Lande, M.B.: Automatic registration of partially overlapping terrestrial laser scan- ner point clouds.https://prs.igp.ethz.ch/research/completed_projects/ automatic_registration_of_point_clouds.html(2012), Accessed: 2020-10-30

work page 2012
[24]

Advances in Neural Information Processing System (NeurIPS) 31(2018)

Li, Y., Bu, R., Sun, M., Wu, W., Di, X., Chen, B.: Pointcnn: Convolution on x- transformed points. Advances in Neural Information Processing System (NeurIPS) 31(2018)

work page 2018
[25]

Kitti-360: A novel dataset and benchmarks for urban scene understanding in 2d and 3d,

Liao, Y., Xie, J., Geiger, A.: KITTI-360: A novel dataset and benchmarks for urban scene understanding in 2D and 3D. arXiv preprint arXiv:2109.13410 (2021)

work page arXiv 2021
[26]

In: Avidan, S., Brostow, G., Cissé, M., Farinella, G.M., Hassner, T

Loiseau, R., Aubry, M., Landrieu, L.: Online segmentation of lidar sequences: Dataset and algorithm. In: Avidan, S., Brostow, G., Cissé, M., Farinella, G.M., Hassner, T. (eds.) Computer Vision – ECCV 2022. pp. 301–317. Springer Nature Switzerland, Cham (2022)

work page 2022
[27]

The International Archives of the Photogram- metry, Remote Sensing and Spatial Information SciencesXLIII-B2-2020, 1419– 1426 (2020)

Matrone, F., Lingua, A., Pierdicca, R., Malinverni, E.S., Paolanti, M., Grilli, E., Remondino, F., Murtiyoso, A., Landes, T.: A benchmark for large-scale heritage point cloud semantic segmentation. The International Archives of the Photogram- metry, Remote Sensing and Spatial Information SciencesXLIII-B2-2020, 1419– 1426 (2020)

work page 2020
[28]

ISPRS International Journal of Geo-Information9(9), 535 (2020)

Matrone, F., Grilli, E., Martini, M., Paolanti, M., Pierdicca, R., Remondino, F.: Comparing machine and deep learning methods for large 3D heritage semantic segmentation. ISPRS International Journal of Geo-Information9(9), 535 (2020)

work page 2020
[29]

International conference on intelligent robots and systems (IROS) pp

Maturana, D., Scherer, S.: Voxnet: A 3d convolutional neural network for real- time object recognition. International conference on intelligent robots and systems (IROS) pp. 922–928 (2015)

work page 2015
[30]

IEEE/CVF Conference on Com- puter Vision and Pattern Recognition (CVPR) pp

Munoz, D., Bagnell, J.A.D., Vandapel, N., Hebert, M.: Contextual classification with functional Max-Margin Markov networks. IEEE/CVF Conference on Com- puter Vision and Pattern Recognition (CVPR) pp. 975 – 982 (June 2009)

work page 2009
[31]

In: Thirteenth International Conference on 3D Vision (2026)

Nguyen, D., Lai, Y.L., Zhang, Q., Gyawali, P., Wang, F., Schwab, B., Wysocki, O., Kolbe, T.H.: Truecity: Real and simulated urban data for cross-domain 3d scene understanding. In: Thirteenth International Conference on 3D Vision (2026)

work page 2026
[32]

Automation in Construction141, 104430 (2022)

Pantoja-Rosero, B.G., Achanta, R., Kozinski, M., Fua, P., Perez-Cruz, F., Beyer, K.: Generating LoD3 building models from structure-from-motion and semantic segmentation. Automation in Construction141, 104430 (2022)

work page 2022
[33]

Data in Brief60, 111661 (2025).https://doi.org/https://doi.org/10.1016/ j.dib.2025.111661,https://www.sciencedirect.com/science/article/pii/ S2352340925003919

Pellis, E., Masiero, A., Betti, M., Tucci, G., Grussenmeyer, P.: A photogram- metric image-point dataset for the semantic segmentation of heritage buildings. Data in Brief60, 111661 (2025).https://doi.org/https://doi.org/10.1016/ j.dib.2025.111661,https://www.sciencedirect.com/science/article/pii/ S2352340925003919

work page arXiv 2025
[34]

Neural Networks108, 533–543 (2018) 18 Y

Phan, A.V., Le Nguyen, M., Nguyen, Y.L.H., Bui, L.T.: Dgcnn: A convolutional neural network over large-scale labeled graphs. Neural Networks108, 533–543 (2018) 18 Y. Wang et al

work page 2018
[35]

In: Proceedings of the IEEE conference on computer vision and pattern recognition

Qi, C.R., Su, H., Mo, K., Guibas, L.J.: Pointnet: Deep learning on point sets for 3d classification and segmentation. In: Proceedings of the IEEE conference on computer vision and pattern recognition. pp. 652–660 (2017)

work page 2017
[36]

Advances in neural information processing systems30(2017)

Qi, C.R., Yi, L., Su, H., Guibas, L.J.: Pointnet++: Deep hierarchical feature learn- ing on point sets in a metric space. Advances in neural information processing systems30(2017)

work page 2017
[37]

In: IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)

Riegler, G., Osman Ulusoy, A., Geiger, A.: Octnet: Learning deep 3d represen- tations at high resolutions. In: IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). pp. 3577–3586 (2017)

work page 2017
[38]

In: Proceedings of the IEEE/CVF International Conference on Computer Vision

Robert, D., Raguet, H., Landrieu, L.: Efficient 3d semantic segmentation with su- perpoint transformer. In: Proceedings of the IEEE/CVF International Conference on Computer Vision. pp. 17195–17204 (2023)

work page 2023
[39]

The International Journal of Robotics Research37(6), 545–557 (2018)

Roynard, X., Deschaud, J.E., Goulette, F.: Paris-Lille-3D: A large and high-quality ground-truth urban point cloud dataset for automatic segmentation and classifica- tion. The International Journal of Robotics Research37(6), 545–557 (2018)

work page 2018
[40]

In: Proceedings of the International Con- ference on Pattern Recognition Applications and Methods

Serna, A., Marcotegui, B., Goulette, F., Deschaud, J.E.: Paris-rue-Madame database: As 3D mobile laser scanner dataset for benchmarking urban detection, segmentation and classification methods. In: Proceedings of the International Con- ference on Pattern Recognition Applications and Methods. ACM, Angers, France, 6–8 March. pp. 819–824 (2014)

work page 2014
[41]

Springer Science & Business Media (2010)

Szeliski, R.: Computer vision: Algorithms and applications. Springer Science & Business Media (2010)

work page 2010
[42]

In: IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)

Tan, W., Qin, N., Ma, L., Li, Y., Du, J., Cai, G., Yang, K., Li, J.: Toronto-3D: A large-scale mobile LiDAR dataset for semantic segmentation of urban roadways. In: IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW). pp. 202–203 (2020)

work page 2020
[43]

In: Proceedings of the IEEE/CVF international conference on computer vision

Thomas, H., Qi, C.R., Deschaud, J.E., Marcotegui, B., Goulette, F., Guibas, L.J.: Kpconv: Flexible and deformable convolution for point clouds. In: Proceedings of the IEEE/CVF international conference on computer vision. pp. 6411–6420 (2019)

work page 2019
[44]

Computers & Graphics49, 126–133 (2015)

Vallet, B., Brédif, M., Serna, A., Marcotegui, B., Paparoditis, N.: TerraMo- bilita/iQmulus urban point cloud analysis benchmark. Computers & Graphics49, 126–133 (2015)

work page 2015
[45]

Wang,P.S.:Octformer:Octree-basedtransformersfor3dpointclouds.ACMTrans- actions on Graphics (TOG)42(4), 1–11 (2023)

work page 2023
[46]

In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition

Wu, X., Jiang, L., Wang, P.S., Liu, Z., Liu, X., Qiao, Y., Ouyang, W., He, T., Zhao, H.: Point transformer v3: Simpler faster stronger. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. pp. 4840–4851 (2024)

work page 2024
[47]

Advances in Neural Information Processing Systems35, 33330–33342 (2022)

Wu, X., Lao, Y., Jiang, L., Liu, X., Zhao, H.: Point transformer v2: Grouped vector attention and partition-based pooling. Advances in Neural Information Processing Systems35, 33330–33342 (2022)

work page 2022
[48]

The International Archives of the Photogrammetry, Remote Sensing and Spatial Information SciencesXL VI- 2/W1-2022, 529–536 (2022)

Wysocki, O., Hoegner, L., Stilla, U.: TUM-FAÇADE: Reviewing and enriching point cloud benchmarks for façade segmentation. The International Archives of the Photogrammetry, Remote Sensing and Spatial Information SciencesXL VI- 2/W1-2022, 529–536 (2022)

work page 2022
[49]

In: Pro- ceedings of the Winter Conference on Applications of Computer Vision (WACV)

Wysocki, O., Tan, Y., Froech, T., Xia, Y., Wysocki, M., Hoegner, L., Cremers, D., Holst, C.: ZAHA: Introducing the Level of Facade Generalization and the Large- Scale Point Cloud Facade Semantic Segmentation Benchmark Dataset. In: Pro- ceedings of the Winter Conference on Applications of Computer Vision (WACV). pp. 7637–7647 (February 2025) UnderOneFacade 19

work page 2025
[50]

IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) pp

Wysocki, O., Xia, Y., Wysocki, M., Grilli, E., Hoegner, L., Cremers, D., Stilla, U.: Scan2LoD3: Reconstructing semantic 3D building models at LoD3 using ray casting and Bayesian networks. IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) pp. 6547–6557 (2023)

work page 2023
[51]

Computational Visual Media11(1), 83–101 (2025)

Yang, Y.Q., Guo, Y.X., Xiong, J.Y., Liu, Y., Pan, H., Wang, P.S., Tong, X., Guo, B.: Swin3d: A pretrained transformer backbone for 3d indoor scene understanding. Computational Visual Media11(1), 83–101 (2025)

work page 2025
[52]

In: Proceedings of the IEEE/CVF International Conference on Computer Vision

Yeshwanth,C.,Liu,Y.C.,Nießner,M.,Dai,A.:ScanNet++:Ahigh-fidelitydataset of 3d indoor scenes. In: Proceedings of the IEEE/CVF International Conference on Computer Vision. pp. 12–22 (2023)

work page 2023
[53]

In: Proceed- ings of the IEEE/CVF international conference on computer vision

Zhao, H., Jiang, L., Jia, J., Torr, P.H., Koltun, V.: Point transformer. In: Proceed- ings of the IEEE/CVF international conference on computer vision. pp. 16259– 16268 (2021)

work page 2021
[54]

In: Proceedings of the IEEE/CVF international conference on computer vision

Zhou, H., Feng, Y., Fang, M., Wei, M., Qin, J., Lu, T.: Adaptive graph convolution for point cloud analysis. In: Proceedings of the IEEE/CVF international conference on computer vision. pp. 4965–4974 (2021)

work page 2021
[55]

Remote Sensing12(11), 1875 (2020)

Zhu, J., Gehrung, J., Huang, R., Borgmann, B., Sun, Z., Hoegner, L., Hebel, M., Xu, Y., Stilla, U.: TUM-MLS-2016: An annotated mobile LiDAR dataset of the TUM City Campus for semantic point cloud interpretation in urban areas. Remote Sensing12(11), 1875 (2020)

work page 2016

[1] [1]

ISPRS International Journal of Geo- Information4(4), 2842–2889 (2015) 16 Y

Biljecki, F., Stoter, J., Ledoux, H., Zlatanova, S., Çöltekin, A.: Applications of 3D city models: State of the art review. ISPRS International Journal of Geo- Information4(4), 2842–2889 (2015) 16 Y. Wang et al

work page 2015

[2] [2]

The International Archives of the Photogrammetry, Remote Sensing and Spatial Information SciencesXLIII-B1-2021, 125–131 (2021)

Blaser, S., Meyer, J., Nebiker, S.: Open urban and forest datasets from a high- performance mobile mapping backpack – a contribution for advancing the creation of digital city twins. The International Archives of the Photogrammetry, Remote Sensing and Spatial Information SciencesXLIII-B1-2021, 125–131 (2021)

work page 2021

[3] [3]

ISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences IV-1, 21–28 (2018)

Borgmann, B., Schatz, V., Kieritz, H., Scherer-Klöckling, C., Hebel, M., Arens, M.: Data processing and recording using a versatile multi-sensor vehicle. ISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences IV-1, 21–28 (2018)

work page 2018

[4] [4]

ISPRS Journal of Photogrammetry and Remote Sensing232, 675–688 (2026)

Chen, Y., Li, J., Han, T., Feng, H., Chen, J., Wang, C.: City-facade: A city-level large-scale point cloud building facade dataset for semantic & instance segmenta- tion. ISPRS Journal of Photogrammetry and Remote Sensing232, 675–688 (2026)

work page 2026

[5] [5]

In: Proceedings of the IEEE conference on computer vision and pattern recognition

Dai,A.,Chang,A.X.,Savva,M.,Halber,M.,Funkhouser,T.,Nießner,M.:Scannet: Richly-annotated 3d reconstructions of indoor scenes. In: Proceedings of the IEEE conference on computer vision and pattern recognition. pp. 5828–5839 (2017)

work page 2017

[6] [6]

Proceedings of the European Conference on Computer Vision (ECCV) pp

Dai, A., Nießner, M.: 3dmv: Joint 3d-multi-view prediction for 3d semantic scene segmentation. Proceedings of the European Conference on Computer Vision (ECCV) pp. 452–468 (2018)

work page 2018

[7] [7]

Australasian Conference on Robotics and Automation2(1) (2013)

De Deuge, M., Quadros, A., Hung, C., Douillard, B.: Unsupervised feature learning for classification of outdoor 3d scans. Australasian Conference on Robotics and Automation2(1) (2013)

work page 2013

[8] [8]

Remote Sensing13(22), 4713 (2021)

Deschaud, J.E., Duque, D., Richa, J.P., Velasco-Forero, S., Marcotegui, B., Goulette, F.: Paris-CARLA-3D: A real and synthetic outdoor point cloud dataset for challenging tasks in 3D mapping. Remote Sensing13(22), 4713 (2021)

work page 2021

[9] [9]

ISPRS Journal of Photogrammetry and Remote Sensing 163, 327–342 (2020)

Dong, Z., Liang, F., Yang, B., Xu, Y., Zang, Y., Li, J., Wang, Y., Dai, W., Fan, H., Hyyppä, J., et al.: Registration of large-scale terrestrial laser scanner point clouds: A review and benchmark. ISPRS Journal of Photogrammetry and Remote Sensing 163, 327–342 (2020)

work page 2020

[10] [10]

Gehrung, J.A.: Change detection in point clouds of urban street spaces using fuzzy spatial reasoning. Ph.D. thesis, Technical University of Munich, Munich, Germany (2022)

work page 2022

[11] [11]

geofabrik

Geofabrik: Openstreetmap data extracts.https : / / download . geofabrik . de/ (2020), Accessed: 2020-10-01

work page 2020

[12] [12]

Geosystems, L.: Leica blk arc.https://leica-geosystems.com/en-gb/products/ laser-scanners/scanners/leica-blk-arc(2026), Accessed: 2026-01-30

work page 2026

[13] [13]

com / en - gb/products/laser- scanners/software/leica- cyclone/leica- cyclone- 3dr (2026), Accessed: 2026-01-30

Geosystems, L.: Leica cyclone 3dr.https : / / leica - geosystems . com / en - gb/products/laser- scanners/software/leica- cyclone/leica- cyclone- 3dr (2026), Accessed: 2026-01-30

work page 2026

[14] [14]

Geosystems, L.: Leica rtc360 3d laser scanner.https://leica-geosystems.com/ products/laser-scanners/scanners/leica-rtc360(2026), Accessed: 2026-01-30

work page 2026

[15] [15]

A2d2: Audi autonomous driving dataset , year =

Geyer, J., Kassahun, Y., Mahmudi, M., Ricou, X., Durgesh, R., Chung, A.S., Hauswald, L., Pham, V.H., Mühlegg, M., Dorn, S., et al.: A2D2: Audi autonomous driving dataset. arXiv preprint arXiv:2004.06320 (2020)

work page arXiv 2004

[16] [16]

Expert Systems with Applications238, 121842 (2024)

González-Collazo, S.M., Balado, J., Garrido, I., Grandío, J., Rashdi, R., Tsir- anidou, E., del Río-Barral, P., Rúa, E., Puente, I., Lorenzo, H.: Santiago urban dataset sud: Combination of handheld and mobile laser scanning point clouds. Expert Systems with Applications238, 121842 (2024)

work page 2024

[17] [17]

SynthCity: A large scale synthetic point cloud

Griffiths, D., Boehm, J.: SynthCity: A large scale synthetic point cloud. arXiv preprint arXiv:1907.04758 (2019)

work page internal anchor Pith review Pith/arXiv arXiv 1907

[18] [18]

Semantic3D.net: A new Large-scale Point Cloud Classification Benchmark

Hackel, T., Savinov, N., Ladicky, L., Wegner, J.D., Schindler, K., Pollefeys, M.: Semantic3d.net: A new large-scale point cloud classification benchmark. arXiv preprint arXiv:1704.03847 (2017) UnderOneFacade 17

work page internal anchor Pith review Pith/arXiv arXiv 2017

[19] [19]

In: Shi, W., Goodchild, M.F., Batty, M., Kwan, M.P., Zhang, A

Kolbe, T.H., Donaubauer, A.: Semantic 3D city modeling and BIM. In: Shi, W., Goodchild, M.F., Batty, M., Kwan, M.P., Zhang, A. (eds.) Urban Informatics. pp. 609–636. Springer Singapore, Singapore (2021)

work page 2021

[20] [20]

PFG – Journal of Photogrammetry, Remote Sensing and Geoin- formation Science88(1), 1–19 (2020)

Kutzner, T., Chaturvedi, K., Kolbe, T.H.: CityGML 3.0: New functions open up new applications. PFG – Journal of Photogrammetry, Remote Sensing and Geoin- formation Science88(1), 1–19 (2020)

work page 2020

[21] [21]

ITcon17(9) (2012)

Laakso, M., Kiviniemi, A.: The IFC standard: A review of history, development, and standardization, information technology. ITcon17(9) (2012)

work page 2012

[22] [22]

In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition

Lai, X., Liu, J., Jiang, L., Wang, L., Zhao, H., Liu, S., Qi, X., Jia, J.: Stratified transformer for 3d point cloud segmentation. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. pp. 8500–8509 (2022)

work page 2022

[23] [23]

Lande, M.B.: Automatic registration of partially overlapping terrestrial laser scan- ner point clouds.https://prs.igp.ethz.ch/research/completed_projects/ automatic_registration_of_point_clouds.html(2012), Accessed: 2020-10-30

work page 2012

[24] [24]

Advances in Neural Information Processing System (NeurIPS) 31(2018)

Li, Y., Bu, R., Sun, M., Wu, W., Di, X., Chen, B.: Pointcnn: Convolution on x- transformed points. Advances in Neural Information Processing System (NeurIPS) 31(2018)

work page 2018

[25] [25]

Kitti-360: A novel dataset and benchmarks for urban scene understanding in 2d and 3d,

Liao, Y., Xie, J., Geiger, A.: KITTI-360: A novel dataset and benchmarks for urban scene understanding in 2D and 3D. arXiv preprint arXiv:2109.13410 (2021)

work page arXiv 2021

[26] [26]

In: Avidan, S., Brostow, G., Cissé, M., Farinella, G.M., Hassner, T

Loiseau, R., Aubry, M., Landrieu, L.: Online segmentation of lidar sequences: Dataset and algorithm. In: Avidan, S., Brostow, G., Cissé, M., Farinella, G.M., Hassner, T. (eds.) Computer Vision – ECCV 2022. pp. 301–317. Springer Nature Switzerland, Cham (2022)

work page 2022

[27] [27]

The International Archives of the Photogram- metry, Remote Sensing and Spatial Information SciencesXLIII-B2-2020, 1419– 1426 (2020)

Matrone, F., Lingua, A., Pierdicca, R., Malinverni, E.S., Paolanti, M., Grilli, E., Remondino, F., Murtiyoso, A., Landes, T.: A benchmark for large-scale heritage point cloud semantic segmentation. The International Archives of the Photogram- metry, Remote Sensing and Spatial Information SciencesXLIII-B2-2020, 1419– 1426 (2020)

work page 2020

[28] [28]

ISPRS International Journal of Geo-Information9(9), 535 (2020)

Matrone, F., Grilli, E., Martini, M., Paolanti, M., Pierdicca, R., Remondino, F.: Comparing machine and deep learning methods for large 3D heritage semantic segmentation. ISPRS International Journal of Geo-Information9(9), 535 (2020)

work page 2020

[29] [29]

International conference on intelligent robots and systems (IROS) pp

Maturana, D., Scherer, S.: Voxnet: A 3d convolutional neural network for real- time object recognition. International conference on intelligent robots and systems (IROS) pp. 922–928 (2015)

work page 2015

[30] [30]

IEEE/CVF Conference on Com- puter Vision and Pattern Recognition (CVPR) pp

Munoz, D., Bagnell, J.A.D., Vandapel, N., Hebert, M.: Contextual classification with functional Max-Margin Markov networks. IEEE/CVF Conference on Com- puter Vision and Pattern Recognition (CVPR) pp. 975 – 982 (June 2009)

work page 2009

[31] [31]

In: Thirteenth International Conference on 3D Vision (2026)

Nguyen, D., Lai, Y.L., Zhang, Q., Gyawali, P., Wang, F., Schwab, B., Wysocki, O., Kolbe, T.H.: Truecity: Real and simulated urban data for cross-domain 3d scene understanding. In: Thirteenth International Conference on 3D Vision (2026)

work page 2026

[32] [32]

Automation in Construction141, 104430 (2022)

Pantoja-Rosero, B.G., Achanta, R., Kozinski, M., Fua, P., Perez-Cruz, F., Beyer, K.: Generating LoD3 building models from structure-from-motion and semantic segmentation. Automation in Construction141, 104430 (2022)

work page 2022

[33] [33]

Data in Brief60, 111661 (2025).https://doi.org/https://doi.org/10.1016/ j.dib.2025.111661,https://www.sciencedirect.com/science/article/pii/ S2352340925003919

Pellis, E., Masiero, A., Betti, M., Tucci, G., Grussenmeyer, P.: A photogram- metric image-point dataset for the semantic segmentation of heritage buildings. Data in Brief60, 111661 (2025).https://doi.org/https://doi.org/10.1016/ j.dib.2025.111661,https://www.sciencedirect.com/science/article/pii/ S2352340925003919

work page arXiv 2025

[34] [34]

Neural Networks108, 533–543 (2018) 18 Y

Phan, A.V., Le Nguyen, M., Nguyen, Y.L.H., Bui, L.T.: Dgcnn: A convolutional neural network over large-scale labeled graphs. Neural Networks108, 533–543 (2018) 18 Y. Wang et al

work page 2018

[35] [35]

In: Proceedings of the IEEE conference on computer vision and pattern recognition

Qi, C.R., Su, H., Mo, K., Guibas, L.J.: Pointnet: Deep learning on point sets for 3d classification and segmentation. In: Proceedings of the IEEE conference on computer vision and pattern recognition. pp. 652–660 (2017)

work page 2017

[36] [36]

Advances in neural information processing systems30(2017)

Qi, C.R., Yi, L., Su, H., Guibas, L.J.: Pointnet++: Deep hierarchical feature learn- ing on point sets in a metric space. Advances in neural information processing systems30(2017)

work page 2017

[37] [37]

In: IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)

Riegler, G., Osman Ulusoy, A., Geiger, A.: Octnet: Learning deep 3d represen- tations at high resolutions. In: IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). pp. 3577–3586 (2017)

work page 2017

[38] [38]

In: Proceedings of the IEEE/CVF International Conference on Computer Vision

Robert, D., Raguet, H., Landrieu, L.: Efficient 3d semantic segmentation with su- perpoint transformer. In: Proceedings of the IEEE/CVF International Conference on Computer Vision. pp. 17195–17204 (2023)

work page 2023

[39] [39]

The International Journal of Robotics Research37(6), 545–557 (2018)

Roynard, X., Deschaud, J.E., Goulette, F.: Paris-Lille-3D: A large and high-quality ground-truth urban point cloud dataset for automatic segmentation and classifica- tion. The International Journal of Robotics Research37(6), 545–557 (2018)

work page 2018

[40] [40]

In: Proceedings of the International Con- ference on Pattern Recognition Applications and Methods

Serna, A., Marcotegui, B., Goulette, F., Deschaud, J.E.: Paris-rue-Madame database: As 3D mobile laser scanner dataset for benchmarking urban detection, segmentation and classification methods. In: Proceedings of the International Con- ference on Pattern Recognition Applications and Methods. ACM, Angers, France, 6–8 March. pp. 819–824 (2014)

work page 2014

[41] [41]

Springer Science & Business Media (2010)

Szeliski, R.: Computer vision: Algorithms and applications. Springer Science & Business Media (2010)

work page 2010

[42] [42]

In: IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)

Tan, W., Qin, N., Ma, L., Li, Y., Du, J., Cai, G., Yang, K., Li, J.: Toronto-3D: A large-scale mobile LiDAR dataset for semantic segmentation of urban roadways. In: IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW). pp. 202–203 (2020)

work page 2020

[43] [43]

In: Proceedings of the IEEE/CVF international conference on computer vision

Thomas, H., Qi, C.R., Deschaud, J.E., Marcotegui, B., Goulette, F., Guibas, L.J.: Kpconv: Flexible and deformable convolution for point clouds. In: Proceedings of the IEEE/CVF international conference on computer vision. pp. 6411–6420 (2019)

work page 2019

[44] [44]

Computers & Graphics49, 126–133 (2015)

Vallet, B., Brédif, M., Serna, A., Marcotegui, B., Paparoditis, N.: TerraMo- bilita/iQmulus urban point cloud analysis benchmark. Computers & Graphics49, 126–133 (2015)

work page 2015

[45] [45]

Wang,P.S.:Octformer:Octree-basedtransformersfor3dpointclouds.ACMTrans- actions on Graphics (TOG)42(4), 1–11 (2023)

work page 2023

[46] [46]

In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition

Wu, X., Jiang, L., Wang, P.S., Liu, Z., Liu, X., Qiao, Y., Ouyang, W., He, T., Zhao, H.: Point transformer v3: Simpler faster stronger. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. pp. 4840–4851 (2024)

work page 2024

[47] [47]

Advances in Neural Information Processing Systems35, 33330–33342 (2022)

Wu, X., Lao, Y., Jiang, L., Liu, X., Zhao, H.: Point transformer v2: Grouped vector attention and partition-based pooling. Advances in Neural Information Processing Systems35, 33330–33342 (2022)

work page 2022

[48] [48]

The International Archives of the Photogrammetry, Remote Sensing and Spatial Information SciencesXL VI- 2/W1-2022, 529–536 (2022)

Wysocki, O., Hoegner, L., Stilla, U.: TUM-FAÇADE: Reviewing and enriching point cloud benchmarks for façade segmentation. The International Archives of the Photogrammetry, Remote Sensing and Spatial Information SciencesXL VI- 2/W1-2022, 529–536 (2022)

work page 2022

[49] [49]

In: Pro- ceedings of the Winter Conference on Applications of Computer Vision (WACV)

Wysocki, O., Tan, Y., Froech, T., Xia, Y., Wysocki, M., Hoegner, L., Cremers, D., Holst, C.: ZAHA: Introducing the Level of Facade Generalization and the Large- Scale Point Cloud Facade Semantic Segmentation Benchmark Dataset. In: Pro- ceedings of the Winter Conference on Applications of Computer Vision (WACV). pp. 7637–7647 (February 2025) UnderOneFacade 19

work page 2025

[50] [50]

IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) pp

Wysocki, O., Xia, Y., Wysocki, M., Grilli, E., Hoegner, L., Cremers, D., Stilla, U.: Scan2LoD3: Reconstructing semantic 3D building models at LoD3 using ray casting and Bayesian networks. IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) pp. 6547–6557 (2023)

work page 2023

[51] [51]

Computational Visual Media11(1), 83–101 (2025)

Yang, Y.Q., Guo, Y.X., Xiong, J.Y., Liu, Y., Pan, H., Wang, P.S., Tong, X., Guo, B.: Swin3d: A pretrained transformer backbone for 3d indoor scene understanding. Computational Visual Media11(1), 83–101 (2025)

work page 2025

[52] [52]

In: Proceedings of the IEEE/CVF International Conference on Computer Vision

Yeshwanth,C.,Liu,Y.C.,Nießner,M.,Dai,A.:ScanNet++:Ahigh-fidelitydataset of 3d indoor scenes. In: Proceedings of the IEEE/CVF International Conference on Computer Vision. pp. 12–22 (2023)

work page 2023

[53] [53]

In: Proceed- ings of the IEEE/CVF international conference on computer vision

Zhao, H., Jiang, L., Jia, J., Torr, P.H., Koltun, V.: Point transformer. In: Proceed- ings of the IEEE/CVF international conference on computer vision. pp. 16259– 16268 (2021)

work page 2021

[54] [54]

In: Proceedings of the IEEE/CVF international conference on computer vision

Zhou, H., Feng, Y., Fang, M., Wei, M., Qin, J., Lu, T.: Adaptive graph convolution for point cloud analysis. In: Proceedings of the IEEE/CVF international conference on computer vision. pp. 4965–4974 (2021)

work page 2021

[55] [55]

Remote Sensing12(11), 1875 (2020)

Zhu, J., Gehrung, J., Huang, R., Borgmann, B., Sun, Z., Hoegner, L., Hebel, M., Xu, Y., Stilla, U.: TUM-MLS-2016: An annotated mobile LiDAR dataset of the TUM City Campus for semantic point cloud interpretation in urban areas. Remote Sensing12(11), 1875 (2020)

work page 2016