RFAE: Autoencoding Random Forests

Autoencoding Random Forests ('RFAE') provide a method to autoencode mixed-type tabular data using Random Forests ('RF'), which involves projecting the data to a latent feature space of user-chosen dimensionality (usually a lower dimension), and then decoding the latent representations back into the input space. The encoding stage is useful for feature engineering and data visualisation tasks, akin to how principal component analysis ('PCA') is used, and the decoding stage is useful for compression and denoising tasks. At its core, 'RFAE' is a post-processing pipeline on a trained random forest model. This means that it can accept any trained RF of 'ranger' object type: 'RF', 'URF' or 'ARF'. Because of this, it inherits Random Forests' robust performance and capacity to seamlessly handle mixed-type tabular data. For more details, see Vu et al. (2025) <doi:10.48550/arXiv.2505.21441>.

Version: 0.1.0
Depends: R (≥ 4.4.0)
Imports: caret, data.table, foreach, Matrix, methods, mgcv, ranger, RANN, RSpectra, stats, tibble
Suggests: arf, ggplot2, knitr, rmarkdown, testthat (≥ 3.0.0)
Published: 2026-01-17
DOI: 10.32614/CRAN.package.RFAE (may not be active yet)
Author: Binh Duc Vu ORCID iD [aut, cre], Jan Kapar ORCID iD [aut], Marvin N. Wright ORCID iD [aut], David S. Watson ORCID iD [aut]
Maintainer: Binh Duc Vu <vuducbinh2210 at gmail.com>
BugReports: https://github.com/bips-hb/RFAE/issues
License: GPL (≥ 3)
URL: https://github.com/bips-hb/RFAE
NeedsCompilation: no
Materials: README
CRAN checks: RFAE results

Documentation:

Reference manual: RFAE.html , RFAE.pdf
Vignettes: rfae (source, R code)

Downloads:

Package source: RFAE_0.1.0.tar.gz
Windows binaries: r-devel: not available, r-release: not available, r-oldrel: not available
macOS binaries: r-release (arm64): not available, r-oldrel (arm64): not available, r-release (x86_64): not available, r-oldrel (x86_64): not available

Linking:

Please use the canonical form https://CRAN.R-project.org/package=RFAE to link to this page.