Graph learning
Train models on nodes and edges that capture BIM relationships such as contains, bounded_by, hosted_by and supported_by.
Large-scale synthetic IFC building graphs for machine learning
Buildata is a large-scale synthetic dataset of IFC building graphs for machine learning. Instead of focusing on 3D geometry as the primary unit, Buildata organizes each sample as a building graph with BIM elements, relationships, attributes and metadata.
building graph dataset
Each building contains BIM elements represented as IFC entities such as walls, doors, windows, slabs, spaces and structural components.
Relationships describe how building elements connect, contain, host, support or bound one another inside the graph.
Attributes, property sets and metadata make the graph useful for classification, reasoning, prediction and benchmarking workflows.
what buildata contains
Buildata is designed for AI systems that need to learn BIM semantic structures. Each sample represents a complete building organized as a graph, with entities, relationships, hierarchy and structured attributes aligned with IFC logic.
Train models on nodes and edges that capture BIM relationships such as contains, bounded_by, hosted_by and supported_by.
Learn to identify IFC entities, predefined types and semantic patterns across complete buildings.
Use features and property sets to test metadata completion, validation and BIM reasoning workflows.
featured datasets
technology
Buildata organizes each building sample into graph-ready files that separate metadata, nodes, edges, hierarchy and schema information for reproducibility.
Describes the building sample, typology, schema, generator profile, counts and reproducibility settings.
Represents BIM elements as IFC nodes with attributes, features and selected property sets.
Captures semantic relationships that convert the building into a BIM graph for machine learning.
Preserves the IFC hierarchy from project and site down to building, storey and space levels.
explore buildata
Download the first sample, inspect its graph structure and evaluate how IFC semantic data can support machine learning workflows.