Interpretable model and explicit formula for 3D printed recycled aggregate concrete strength prediction

Tilin Wang; Chao Liu; Chao Zhu; Wenxuan Zhu; Huawei Liu

doi:10.70401/jbde.2026.0035

Interpretable model and explicit formula for 3D printed recycled aggregate concrete strength prediction

Tilin Wang

Chao Liu

Chao Zhu

Wenxuan Zhu

Huawei Liu

Affiliation +

*Correspondence to: Huawei Liu, School of Civil Engineering, Xi’an University of Architecture and Technology, Xi’an 710055, Shaanxi, China. E-mail: liuhuawei@xauat.edu.cn

J Build Des Environ. 2026;4:2025140. 10.70401/jbde.2026.0035

Received: December 18, 2025Accepted: March 12, 2026Published: March 17, 2026

This article belongs to the Special lssue Advances in Low-Carbon Emission-Reduction Materials for Sustainable Buildings

This manuscript is made available in its unedited form to allow early access to the reported findings. Further editing will be completed before final publication. As such, the content may include errors, and standard legal disclaimers are applicable.

Abstract

The construction industry is undergoing a low-carbon and digital transformation. The application of recycled aggregates in 3D printed concrete has emerged accordingly, combining the advantages of solid waste recycling and automated manufacturing. However, the optimization of mix design and the prediction of mechanical properties for 3D printed recycled aggregate concrete (3DPRAC) face two main challenges, namely the micro defects inherent in recycled aggregates and the anisotropy caused by layered deposition. This study proposes a Gene Expression Programming (GEP)-based computational framework for the prediction of the splitting tensile strength (STS) of 3DPRAC. A dataset of 110 data points was generated, with layer height and loading direction as key printing parameters representing the anisotropic effect. GEP showed the best performance among models, with an R² of 0.914 on the testing set. Furthermore, the GEP model provides explicit mathematical equations that delineate the contributions of individual input variables and reveal the nonlinear effects of fiber content and printing parameters on STS.

Keywords

3D concrete printing, gene expression programming, recycled aggregate, splitting tensile strength, explicit mathematical formula, anisotropy

1. Introduction

The introduction of recycled aggregates into 3D concrete printing advances sustainable and automated construction by making use of recycled materials and reducing the consumption of natural resources and waste during construction. However, this poses challenges in terms of the material itself as well as the process^[1-3]. At the microscale, residual mortar adhering to the recycled aggregates’ surfaces increases porosity and weakens the interfacial transition zones. At the macroscale, the layer-by-layer deposition process creates numerous inter-layer interfaces, resulting in pronounced anisotropic behavior^[4]. This makes the prediction of the splitting tensile strength (STS) of 3D printed recycled aggregate concrete (3DPRAC) challenging due to the difficulties involved in using conventional empirical or experimental methods.

Accurate prediction of the mechanical performance in 3D printed concrete requires materials that satisfy specific rheological criteria for printability^[5]. Therefore, this study focuses on mixtures with validated printability to accurately predict their hardened mechanical properties. Nevertheless, conventional empirical models and isolated experimental approaches remain inadequate for accurate prediction, even within this rheologically qualified range. This inadequacy arises from the complex nonlinear coupling between material parameters, such as the water-to-binder (w/b) ratio and recycled aggregate replacement ratio (RCA-r), and process-related variables, including layer height and loading direction. Therefore, advanced data-driven approaches capable of capturing nonlinear relationships and providing interpretable predictions are essential to comprehensively evaluate the combined effects of material constituents and printing process parameters on the mechanical properties.

Data-driven computation is growing in popularity to tackle such intricacies while bypassing the prohibitive costs of iteratively executing physical experiments^[6-11]. Techniques from the broader soft computing domain have proven to be effective for reducing experimental workload while achieving accurate results^[12,13]. Specifically, algorithms such as Convolutional Neural Networks^[14,15] and gene expression programming (GEP)^[16,17] have been employed for modeling recycled aggregate concrete (RAC). Researchers have successfully deployed diverse models for STS prediction, including Artificial Neural Networks^[16,18-23], Support Vector Machines^[24], and ensemble methods such as random forest (RF)^[25,26] and XGBoost^[21,27]. Furthermore, recent works have pointed out the great importance of integrating automatic hyperparameter optimization for robust model development, as this mechanism eliminates manual tuning bias and accurately identifies globally optimal configurations^[28,29].

Despite its immense potential, three critical limitations hinder the application of machine learning to the complex 3DPRAC system. First, current models exhibit notable insensitivity to printing process parameters. These models focus primarily on material composition mapping while neglecting the unique characteristics of 3D concrete printing. This oversight prevents the quantitative description of the significant anisotropy induced by variables such as layer height and loading direction. Second, existing studies lack generalization capability across diverse material systems, as they are typically confined to singular material domains. Consequently, these models fail to discern universal evolutionary patterns when confronted with heterogeneous datasets comprising RAC, fiber-reinforced recycled aggregate concrete (FR-RAC), and 3DPRAC, resulting in a significant reduction in prediction accuracy. Third, the “black-box” nature of algorithms is a major obstacle to their uptake. Although deep neural networks achieve acceptable accuracy, the lack of a physical or mathematical expression leads to poor interpretability, thereby severely restricting trust and widespread adoption in engineering practice.

To address these challenges, this study develops a prediction model for the STS of 3DPRAC that integrates mixture proportions and printing process parameters. GEP is employed as the modeling technique due to its capability to automatically generate explicit mathematical expressions through symbolic regression, offering both transparency and direct applicability in engineering practice. The proposed model incorporates printing layer height and loading direction as key input variables, combined with feature engineering, to enhance interpretability and reveal the underlying mechanisms governing the mechanical behavior of 3DPRAC.

The rest of this paper is structured as follows (as depicted in Figure 1). Section 2 describes the dataset collection and preprocessing, and the GEP model development. Section 3 covers the model performance evaluation and the sensitivity analysis of key parameters. Finally, Section 4 concludes the study.

Items	Unit	Notations	Mean	Min	Max	STD	kurtosis	Skewness
Water-to-binder ratio	-	w/b	0.427	0.300	0.651	0.125	-1.494	0.227
RCA replacement ratio	%	RCA-r	53.042	0.000	100.000	37.697	-1.311	-0.037
Fiber volume	%	FV	0.245	0.000	5.000	0.558	48.306	5.984
Layer height	mm	LH	5.890	0.000	18.000	8.486	-1.469	0.747
Loading Direction	-	LD	0.350	0.000	1.000	0.470	-1.620	0.650
Age	d	Age	16.400	3.000	28.000	10.200	-1.250	0.150
Splitting Tensile Strength	MPa	STS	3.095	1.210	6.720	1.063	0.479	0.653

		Inputs
		w/b	RCA-r	FV	LH
Collinearity Statistics	VIF	3.549	3.214	1.317	1.219
Collinearity Statistics	Tolerance	0.282	0.311	0.759	0.819

Feature	CGEP1	CGEP2
Function Set	+, -, *, /	+, -, *, /, Sqrt, Ln
Linking Function	Addition	Addition
Number of Genes	4	5
Chromosomes	60	50
Head Size	12	12
Mutation Rate	0.00138	0.04000
Crossover Rate	0.2	0.3
Fitness Function	RMSE	RMSE

Models	Datasets	R²	RMSE (MPa)	MAE (MPa)
CGEP1	Training sets	0.758	0.512	0.408
CGEP1	Testing sets	0.545	0.752	0.588
CGEP2	Training sets	0.895	0.339	0.267
CGEP2	Testing sets	0.914	0.312	0.244

Model	Training C	Testing R²	Testing RMSE (MPa)	Testing MAE (MPa)
MLR	0.682	0.647	0.623	0.498
RF	0.967	0.883	0.359	0.281
CGEP2	0.895	0.914	0.312	0.244

Input Feature	MAE (Permuted)	PFI	Rank
w/b	0.480	0.250	1
RCA-r	0.410	0.180	2
LD	0.350	0.120	3
LH	0.310	0.080	4
FV	0.280	0.050	5
Material Type	0.250	0.020	6

Variables	x_w/b	x_RCA-r	x_FV	x_LH	x_LD	Age
Recommended Range	0.30-0.65	0-100	0 -5	0-18	0, 1	1-28

Journal of Building Design and Environment

Interpretable model and explicit formula for 3D printed recycled aggregate concrete strength prediction

Tilin Wang

Chao Liu

Chao Zhu

Wenxuan Zhu

Huawei Liu

Abstract

Keywords

References

Copyright

Publisher’s Note

Share And Cite

Science Exploration Style

Download

Export Citation

Article Metrics

Article Updates

Related Articles

Contents

Science Exploration Style

Share Link

Subscribe

Journal of Building Design and Environment

Navigation

Follow us