1
|
Yang L, Liu K, Ou R, Qian P, Wu Y, Tian Z, Zhu C, Feng S, Yang F. Surface Defect-Extended BIM Generation Leveraging UAV Images and Deep Learning. SENSORS (BASEL, SWITZERLAND) 2024; 24:4151. [PMID: 39000929 PMCID: PMC11243814 DOI: 10.3390/s24134151] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 05/15/2024] [Revised: 06/14/2024] [Accepted: 06/24/2024] [Indexed: 07/16/2024]
Abstract
Defect inspection of existing buildings is receiving increasing attention for digitalization transfer in the construction industry. The development of drone technology and artificial intelligence has provided powerful tools for defect inspection of buildings. However, integrating defect inspection information detected from UAV images into semantically rich building information modeling (BIM) is still challenging work due to the low defect detection accuracy and the coordinate difference between UAV images and BIM models. In this paper, a deep learning-based method coupled with transfer learning is used to detect defects accurately; and a texture mapping-based defect parameter extraction method is proposed to achieve the mapping from the image U-V coordinate system to the BIM project coordinate system. The defects are projected onto the surface of the BIM model to enrich a surface defect-extended BIM (SDE-BIM). The proposed method was validated in a defect information modeling experiment involving the No. 36 teaching building of Nantong University. The results demonstrate that the methods are widely applicable to various building inspection tasks.
Collapse
Affiliation(s)
- Lei Yang
- Key Laboratory of Urban Land Resources Monitoring and Simulation, Ministry of Natural Resources, Shenzhen 518034, China;
- School of Transportation and Civil Engineering, Nantong University, Nantong 226019, China; (Y.W.); (Z.T.)
| | - Keju Liu
- Nantong Key Laboratory of Spatial Information Technology R&D and Application, Nantong University, Nantong 226019, China; (K.L.); (R.O.)
| | - Ruisi Ou
- Nantong Key Laboratory of Spatial Information Technology R&D and Application, Nantong University, Nantong 226019, China; (K.L.); (R.O.)
| | - Peng Qian
- School of Transportation and Civil Engineering, Nantong University, Nantong 226019, China; (Y.W.); (Z.T.)
- College of Geographic Science, Nantong University, Nantong 226019, China; (C.Z.); (S.F.)
| | - Yunjie Wu
- School of Transportation and Civil Engineering, Nantong University, Nantong 226019, China; (Y.W.); (Z.T.)
| | - Zhuang Tian
- School of Transportation and Civil Engineering, Nantong University, Nantong 226019, China; (Y.W.); (Z.T.)
| | - Changping Zhu
- College of Geographic Science, Nantong University, Nantong 226019, China; (C.Z.); (S.F.)
| | - Sining Feng
- College of Geographic Science, Nantong University, Nantong 226019, China; (C.Z.); (S.F.)
| | - Fan Yang
- Key Laboratory of Urban Land Resources Monitoring and Simulation, Ministry of Natural Resources, Shenzhen 518034, China;
- Nantong Key Laboratory of Spatial Information Technology R&D and Application, Nantong University, Nantong 226019, China; (K.L.); (R.O.)
- College of Geographic Science, Nantong University, Nantong 226019, China; (C.Z.); (S.F.)
| |
Collapse
|
2
|
Kaiser S, Linkiewicz M, Meißner H, Baumbach D. 3D Visual Reconstruction as Prior Information for First Responder Localization and Visualization. SENSORS (BASEL, SWITZERLAND) 2023; 23:7785. [PMID: 37765842 PMCID: PMC10536287 DOI: 10.3390/s23187785] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/05/2023] [Revised: 08/31/2023] [Accepted: 09/01/2023] [Indexed: 09/29/2023]
Abstract
In professional use cases like police or fire brigade missions, coordinated and systematic force management is crucial for achieving operational success during intervention by the emergency personnel. A real-time situation picture enhances the coordination of the team. This situation picture includes not only an overview of the environment but also the positions, i.e., localization, of the emergency forces. The overview of the environment can be obtained either from known situation pictures like floorplans or by scanning the environment with the aid of visual sensors. The self-localization problem can be solved outdoors using the Global Navigation Satellite System (GNSS), but it is not fully solved indoors, where the GNSS signal might not be received or might be degraded. In this paper, we propose a novel combination of an inertial localization technique based on simultaneous localization and mapping (SLAM) with 3D building scans, which are used as prior information, for geo-referencing the positions, obtaining a situation picture, and finally visualizing the results with an appropriate visualization tool. We developed a new method for converting point clouds into a hexagonal prism map specifically designed for our SLAM algorithm. With this combination, we could keep the equipment for first responders as lightweight as required. We showed that the positioning led to an average accuracy of less than 1m indoors, and the final visualization including the building layout obtained by the 3D building reconstruction will be advantageous for coordinating first responder operations.
Collapse
Affiliation(s)
- Susanna Kaiser
- German Aerospace Center (DLR), Institute of Communications and Navigation, Oberpfaffenhofen, D-82234 Wessling, Germany
| | - Magdalena Linkiewicz
- German Aerospace Center (DLR), Institute of Optical Sensor Systems, D-12489 Berlin, Germany
| | - Henry Meißner
- German Aerospace Center (DLR), Institute of Optical Sensor Systems, D-12489 Berlin, Germany
| | - Dirk Baumbach
- German Aerospace Center (DLR), Institute of Optical Sensor Systems, D-12489 Berlin, Germany
| |
Collapse
|
3
|
Yang L, Zhang F, Yang F, Qian P, Wang Q, Wu Y, Wang K. Generating Topologically Consistent BIM Models of Utility Tunnels from Point Clouds. SENSORS (BASEL, SWITZERLAND) 2023; 23:6503. [PMID: 37514796 PMCID: PMC10384953 DOI: 10.3390/s23146503] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/10/2023] [Revised: 07/14/2023] [Accepted: 07/17/2023] [Indexed: 07/30/2023]
Abstract
The development and utilization of urban underground space is an important way to solve the "great urban disease". As one of the most important types of urban underground foundations, utility tunnels have become increasingly popular in municipal construction. The investigation of utility tunnels is a general task and three-dimensional laser scanning technology has played a significant role in surveying and data acquisition. However, three-dimensional laser scanning technology suffers from noise and occlusion in narrow congested utility tunnel spaces, and the acquired point clouds are imperfect; hence, errors and redundancies are introduced in the extracted geometric elements. The topology of reconstructed BIM objects cannot be ensured. Therefore, in this study, a hierarchical segmentation method for point clouds and a topology reconstruction method for building information model (BIM) objects in utility tunnels are proposed. The point cloud is segmented into facades, planes, and pipelines hierarchically. An improved mean-shift algorithm is proposed to extract wall line features and a local symmetry-based medial axis extraction algorithm is proposed to extract pipelines from point clouds. A topology reconstruction method that searches for the neighbor information of wall and pipeline centerlines and establishes collinear, perpendicular, and intersecting situations is used to reconstruct a topologically consistent 3D model of a utility tunnel. An experiment on the Guangzhou's Nansha District dataset successfully reconstructed 24 BIM wall objects and 12 pipelines within the utility tunnel, verifying the efficiency of the method.
Collapse
Affiliation(s)
- Lei Yang
- School of Transportation and Civil Engineering, Nantong University, Nantong 226019, China
| | - Fangshuo Zhang
- School of Geographical Sciences, Nantong University, Nantong 226019, China
| | - Fan Yang
- School of Geographical Sciences, Nantong University, Nantong 226019, China
- Key Laboratory of Urban Land Resources Monitoring and Simulation, Ministry of Natural Resources, Shenzhen 518034, China
- Key Laboratory of Spatial Information Technology R&D and Application, College of Geographic Science, Nantong University, Nantong 226019, China
| | - Peng Qian
- School of Transportation and Civil Engineering, Nantong University, Nantong 226019, China
- School of Geographical Sciences, Nantong University, Nantong 226019, China
| | - Quankai Wang
- School of Transportation and Civil Engineering, Nantong University, Nantong 226019, China
| | - Yunjie Wu
- School of Transportation and Civil Engineering, Nantong University, Nantong 226019, China
| | - Keli Wang
- School of Geographical Sciences, Nantong University, Nantong 226019, China
| |
Collapse
|
4
|
Abdollahi A, Arefi H, Malihi S, Maboudi M. Progressive Model-Driven Approach for 3D Modeling of Indoor Spaces. SENSORS (BASEL, SWITZERLAND) 2023; 23:5934. [PMID: 37447783 DOI: 10.3390/s23135934] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/14/2023] [Revised: 06/09/2023] [Accepted: 06/19/2023] [Indexed: 07/15/2023]
Abstract
This paper focuses on the 3D modeling of the interior spaces of buildings. Three-dimensional point clouds from laser scanners can be considered the most widely used data for 3D indoor modeling. Therefore, the walls, ceiling and floor are extracted as the main structural fabric and reconstructed. In this paper, a method is presented to tackle the problems related to the data including obstruction, clutter and noise. This method reconstructs indoor space in a model-driven approach using watertight predefined models. Employing the two-step implementation of this process, the algorithm is able to model non-rectangular spaces with an even number of sides. Afterwards, an "improvement" process increases the level of details by modeling the intrusion and protrusion of the model. The 3D model is formed by extrusion from 2D to 3D. The proposed model-driven algorithm is evaluated with four benchmark real-world datasets. The efficacy of the proposed method is proved by the range of [77%, 95%], [85%, 97%] and [1.7 cm, 2.4 cm] values of completeness, correctness and geometric accuracy, respectively.
Collapse
Affiliation(s)
- Ali Abdollahi
- School of Engineering, Faculty of Surveying and Geospatial Engineering, University of Tehran, Tehran 1417614411, Iran
| | - Hossein Arefi
- School of Engineering, Faculty of Surveying and Geospatial Engineering, University of Tehran, Tehran 1417614411, Iran
- Department of Geoinformatics and Surveying, School of Engineering, Mainz University of Applied Sciences, 55128 Mainz, Germany
| | - Shirin Malihi
- School of Engineering, University of Edinburgh, Edinburgh EH9 3JL, UK
| | - Mehdi Maboudi
- Institute of Geodesy and Photogrammetry, Technische Universität Braunschweig, 38106 Braunschweig, Germany
| |
Collapse
|
5
|
Agarwal S, Maity S, Barua HB, Bhowmick B. Robo-vision! 3D mesh generation of a scene for a robot for planar and non-planar complex objects. MULTIMEDIA TOOLS AND APPLICATIONS 2023:1-19. [PMID: 37362698 PMCID: PMC10107568 DOI: 10.1007/s11042-023-15111-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 03/01/2022] [Revised: 02/06/2023] [Accepted: 03/12/2023] [Indexed: 06/28/2023]
Abstract
This paper presents a novel architecture to generate a world model in terms of mesh from a continuous image stream with depth information extracted from a robot's ego-vision camera. We propose two algorithms for planar and non-planar mesh generation. A Cartesian grid-based mesh fitting algorithm is proposed for mesh generation of planar objects. For mesh generation of non-planar objects, we propose a Self Organization Map based algorithm. The proposed algorithm better approaches the boundary and overall shape of the objects compared to State-Of-the-Art (SOA). Extensive experiments done on three public datasets show that our method surpasses SOA both qualitatively and quantitatively.
Collapse
Affiliation(s)
- Swapna Agarwal
- Robotics & Autonomous Systems group, TCS Research, Kolkata, India
| | - Soumyadip Maity
- Robotics & Autonomous Systems group, TCS Research, Kolkata, India
| | | | | |
Collapse
|
6
|
Modeling and Optimization of the Para-Xylene Continuous Suspension Crystallization Separation Process via a Morphology Technique and a Multi-Dimensional Population Balance Equation. Processes (Basel) 2023. [DOI: 10.3390/pr11030770] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/08/2023] Open
Abstract
In this study, we carried out a para-xylene crystallization experiment at constant temperature and concentration levels. Throughout the process, the kinetics of nucleation, growth, breakage, and aggregation of para-xylene particles were measured and built using a morphological approach. An additional a three-stage continuous suspension crystallization separation experiment was carried out, the process for which was simulated using the population balance model based on correlated kinetic equations. The population balance equation was solved using an extended moment of classes algorithm, and the solving process was implemented in MATLAB. In this case, the predicted particle size distribution of the products matched well with the experiment. In order to provide references for the optimization of the industrial para-xylene crystallization process, a three-stage suspension crystallization separation experiment was designed and conducted, in which each crystallizer had a distinct operating temperature and mean residence time. The effects of operating parameters on the final product were investigated further. The proposed models and algorithms can also be applied in other cases and provide an alternative approach for optimizing continuous crystallization processes.
Collapse
|
7
|
Eyvazpour R, Shoaran M, Karimian G. Hardware implementation of SLAM algorithms: a survey on implementation approaches and platforms. Artif Intell Rev 2022. [DOI: 10.1007/s10462-022-10310-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
|
8
|
Robot Path Planning Method Based on Indoor Spacetime Grid Model. REMOTE SENSING 2022. [DOI: 10.3390/rs14102357] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
In the context of digital twins, smart city construction and artificial intelligence technology are developing rapidly, and more and more mobile robots are performing tasks in complex and time-varying indoor environments, making, at present, the unification of modeling, dynamic expression, visualization of operation, and wide application between robots and indoor environments a pressing problem to be solved. This paper presents an in-depth study on this issue and summarizes three major types of methods: geometric modeling, topological modeling, and raster modeling, and points out the advantages and disadvantages of these three types of methods. Therefore, in view of the current pain points of robots and complex time-varying indoor environments, this paper proposes an indoor spacetime grid model based on the three-dimensional division framework of the Earth space and innovatively integrates time division on the basis of space division. On the basis of the model, a dynamic path planning algorithm for the robot in the complex time-varying indoor environment is designed, that is, the Spacetime-A* algorithm (STA* for short). Finally, the indoor spacetime grid modeling experiment is carried out with real data, which verifies the feasibility and correctness of the spacetime relationship calculation algorithm encoded by the indoor spacetime grid model. Then, experiments are carried out on the multi-group path planning algorithms of the robot under the spacetime grid, and the feasibility of the STA* algorithm under the indoor spacetime grid and the superiority of the spacetime grid are verified.
Collapse
|
9
|
Topologically Consistent Reconstruction for Complex Indoor Structures from Point Clouds. REMOTE SENSING 2021. [DOI: 10.3390/rs13193844] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Indoor structures are composed of ceilings, walls and floors that need to be modeled for a variety of applications. This paper proposes an approach to reconstructing models of indoor structures in complex environments. First, semantic pre-processing, including segmentation and occlusion construction, is applied to segment the input point clouds to generate semantic patches of structural primitives with uniform density. Then, a primitives extraction method with detected boundary is introduced to approximate both the mathematical surface and the boundary of the patches. Finally, a constraint-based model reconstruction is applied to achieve the final topologically consistent structural model. Under this framework, both the geometric and structural constraints are considered in a holistic manner to assure topologic regularity. Experiments were carried out with both synthetic and real-world datasets. The accuracy of the proposed method achieved an overall reconstruction quality of approximately 4.60 cm of root mean square error (RMSE) and 94.10% Intersection over Union (IoU) of the input point cloud. The development can be applied for structural reconstruction of various complex indoor environments.
Collapse
|
10
|
Survey Solutions for 3D Acquisition and Representation of Artificial and Natural Caves. APPLIED SCIENCES-BASEL 2021. [DOI: 10.3390/app11146482] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/18/2023]
Abstract
A three-dimensional survey of natural caves is often a difficult task due to the roughness of the investigated area and the problems of accessibility. Traditional adopted techniques allow a simplified acquisition of the topography of caves characterized by an oversimplification of the geometry. Nowadays, the advent of LiDAR and Structure from Motion applications eased three-dimensional surveys in different environments. In this paper, we present a comparison between other three-dimensional survey systems, namely a Terrestrial Laser Scanner, a SLAM-based portable instrument, and a commercial photo camera, to test their possible deployment in natural caves survey. We presented a comparative test carried out in a tunnel stretch to calibrate the instrumentation on a benchmark site. The choice of the site is motivated by its regular geometry and easy accessibility. According to the result obtained in the calibration site, we presented a methodology, based on the Structure from Motion approach that resulted in the best compromise among accuracy, feasibility, and cost-effectiveness, that could be adopted for the three-dimensional survey of complex natural caves using a sequence of images and the structure from motion algorithm. The methods consider two different approaches to obtain a low resolution complete three-dimensional model of the cave and ultra-detailed models of most peculiar cave morphological elements. The proposed system was tested in the Gazzano Cave (Piemonte region, Northwestern Italy). The obtained result is a three-dimensional model of the cave at low resolution due to the site’s extension and the remarkable amount of data. Additionally, a peculiar speleothem, i.e., a stalagmite, in the cave was surveyed at high resolution to test the proposed high-resolution approach on a single object. The benchmark and the cave trials allowed a better definition of the instrumentation choice for underground surveys regarding accuracy and feasibility.
Collapse
|
11
|
Gómez JL, Villalonga G, López AM. Co-Training for Deep Object Detection: Comparing Single-Modal and Multi-Modal Approaches. SENSORS 2021; 21:s21093185. [PMID: 34064323 PMCID: PMC8125436 DOI: 10.3390/s21093185] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/30/2021] [Revised: 04/22/2021] [Accepted: 04/28/2021] [Indexed: 11/19/2022]
Abstract
Top-performing computer vision models are powered by convolutional neural networks (CNNs). Training an accurate CNN highly depends on both the raw sensor data and their associated ground truth (GT). Collecting such GT is usually done through human labeling, which is time-consuming and does not scale as we wish. This data-labeling bottleneck may be intensified due to domain shifts among image sensors, which could force per-sensor data labeling. In this paper, we focus on the use of co-training, a semi-supervised learning (SSL) method, for obtaining self-labeled object bounding boxes (BBs), i.e., the GT to train deep object detectors. In particular, we assess the goodness of multi-modal co-training by relying on two different views of an image, namely, appearance (RGB) and estimated depth (D). Moreover, we compare appearance-based single-modal co-training with multi-modal. Our results suggest that in a standard SSL setting (no domain shift, a few human-labeled data) and under virtual-to-real domain shift (many virtual-world labeled data, no human-labeled data) multi-modal co-training outperforms single-modal. In the latter case, by performing GAN-based domain translation both co-training modalities are on par, at least when using an off-the-shelf depth estimation model not specifically trained on the translated images.
Collapse
Affiliation(s)
- Jose L. Gómez
- Computer Vision Center (CVC), Universitat Autònoma de Barcelona (UAB), 08193 Bellaterra, Spain; (G.V.); (A.M.L.)
- Computer Science Department, Universitat Autònoma de Barcelona (UAB), 08193 Bellaterra, Spain
- Correspondence:
| | - Gabriel Villalonga
- Computer Vision Center (CVC), Universitat Autònoma de Barcelona (UAB), 08193 Bellaterra, Spain; (G.V.); (A.M.L.)
| | - Antonio M. López
- Computer Vision Center (CVC), Universitat Autònoma de Barcelona (UAB), 08193 Bellaterra, Spain; (G.V.); (A.M.L.)
- Computer Science Department, Universitat Autònoma de Barcelona (UAB), 08193 Bellaterra, Spain
| |
Collapse
|
12
|
A Comparison of Low-Cost Sensor Systems in Automatic Cloud-Based Indoor 3D Modeling. REMOTE SENSING 2020. [DOI: 10.3390/rs12162624] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/20/2023]
Abstract
The automated 3D modeling of indoor spaces is a rapidly advancing field, in which recent developments have made the modeling process more accessible to consumers by lowering the cost of instruments and offering a highly automated service for 3D model creation. We compared the performance of three low-cost sensor systems; one RGB-D camera, one low-end terrestrial laser scanner (TLS), and one panoramic camera, using a cloud-based processing service to automatically create mesh models and point clouds, evaluating the accuracy of the results against a reference point cloud from a higher-end TLS. While adequately accurate results could be obtained with all three sensor systems, the TLS performed the best both in terms of reconstructing the overall room geometry and smaller details, with the panoramic camera clearly trailing the other systems and the RGB-D offering a middle ground in terms of both cost and quality. The results demonstrate the attractiveness of fully automatic cloud-based indoor 3D modeling for low-cost sensor systems, with the latter providing better model accuracy and completeness, and with all systems offering a rapid rate of data acquisition through an easy-to-use interface.
Collapse
|