R/`hal9001`

The Scalable Highly Adaptive Lasso

Authors: Jeremy Coyle, Nima Hejazi, and Mark van der Laan

What’s `hal9001`?

hal9001 is an R package providing an implementation of the scalable highly adaptive lasso (HAL), a nonparametric regression estimator that applies L1-regularized lasso regression to a design matrix composed of indicator functions corresponding to the support of the functional over a set of covariates and interactions thereof. HAL regression allows for arbitrarily complex functional forms to be estimated at fast (near-parametric) convergence rates under only global smoothness assumptions (van der Laan 2017a; Bibaut and van der Laan 2019). For detailed theoretical discussions of the highly adaptive lasso estimator, consider consulting, for example, van der Laan (2017a), van der Laan (2017b), and van der Laan and Bibaut (2017). For a computational demonstration of the versatility of HAL regression, see Benkeser and van der Laan (2016). Recent theoretical works have demonstrated success in building efficient estimators of complex parameters when particular variations of HAL regression are used to estimate nuisance parameters (e.g., van der Laan, Benkeser, and Cai 2019; Ertefaie, Hejazi, and van der Laan 2020).

Installation

For standard use, we recommend installing the package from CRAN via

install.packages("hal9001")

To contribute, install the development version of hal9001 from GitHub via remotes:

remotes::install_github("tlverse/hal9001")

Issues

If you encounter any bugs or have any specific feature requests, please file an issue.

Example

Consider the following minimal example in using hal9001 to generate predictions via Highly Adaptive Lasso regression:

# load the package and set a seed
library(hal9001)
#> Loading required package: Rcpp
#> hal9001 v0.2.6: The Scalable Highly Adaptive Lasso
set.seed(385971)

# simulate data
n <- 100
p <- 3
x <- matrix(rnorm(n * p), n, p)
y <- x[, 1] * sin(x[, 2]) + rnorm(n, mean = 0, sd = 0.2)

# fit the HAL regression
hal_fit <- fit_hal(X = x, Y = y)
#> [1] "I'm sorry, Dave. I'm afraid I can't do that."
hal_fit$times
#>                   user.self sys.self elapsed user.child sys.child
#> enumerate_basis       0.001     0.00   0.001          0         0
#> design_matrix         0.001     0.00   0.002          0         0
#> remove_duplicates     0.005     0.00   0.004          0         0
#> reduce_basis          0.000     0.00   0.000          0         0
#> lasso                 0.313     0.02   0.333          0         0
#> total                 0.320     0.02   0.340          0         0

# training sample prediction
preds <- predict(hal_fit, new_data = x)
mean(hal_mse <- (preds - y)^2)
#> [1] 0.006991539

Contributions

Contributions are very welcome. Interested contributors should consult our contribution guidelines prior to submitting a pull request.

Citation

After using the hal9001 R package, please cite the following:

    @manual{coyle2020hal9001,
      author = {Coyle, Jeremy R and Hejazi, Nima S and {van der Laan}, Mark
        J},
      title = {{hal9001}: The scalable highly adaptive lasso},
      year  = {2020},
      howpublished = {\url{https://github.com/tlverse/hal9001}},
      note = {{R} package version 0.2.6},
      url = {https://doi.org/10.5281/zenodo.3558313},
      doi = {10.5281/zenodo.3558313}
    }

License

The contents of this repository are distributed under the GPL-3 license. See file LICENSE for details.

References

Benkeser, David, and Mark J van der Laan. 2016. “The Highly Adaptive Lasso Estimator.” In 2016 IEEE International Conference on Data Science and Advanced Analytics (DSAA). IEEE. https://doi.org/10.1109/dsaa.2016.93.

Bibaut, Aurélien F, and Mark J van der Laan. 2019. “Fast Rates for Empirical Risk Minimization over Càdlàg Functions with Bounded Sectional Variation Norm.” arXiv Preprint arXiv:1907.09244.

Ertefaie, Ashkan, Nima S Hejazi, and Mark J van der Laan. 2020. “Nonparametric Inverse Probability Weighted Estimators Based on the Highly Adaptive Lasso.” http://arxiv.org/abs/2005.11303.

van der Laan, Mark J. 2017a. “A Generally Efficient Targeted Minimum Loss Based Estimator Based on the Highly Adaptive Lasso.” The International Journal of Biostatistics. De Gruyter. https://doi.org/10.1515/ijb-2015-0097.

———. 2017b. “Finite Sample Inference for Targeted Learning.” ArXiv E-Prints.

van der Laan, Mark J, David Benkeser, and Weixin Cai. 2019. “Efficient Estimation of Pathwise Differentiable Target Parameters with the Undersmoothed Highly Adaptive Lasso.” arXiv Preprint arXiv:1908.05607.

van der Laan, Mark J, and Aurélien F Bibaut. 2017. “Uniform Consistency of the Highly Adaptive Lasso Estimator of Infinite-Dimensional Parameters.” arXiv Preprint arXiv:1709.06256.

Name		Name	Last commit message	Last commit date
Latest commit History 258 Commits
R		R
data		data
docs		docs
inst		inst
man		man
paper		paper
sandbox		sandbox
src		src
tests		tests
vignettes		vignettes
.Rbuildignore		.Rbuildignore
.gitignore		.gitignore
.travis.yml		.travis.yml
CONTRIBUTING.md		CONTRIBUTING.md
DESCRIPTION		DESCRIPTION
LICENSE		LICENSE
Makefile		Makefile
NAMESPACE		NAMESPACE
NEWS.md		NEWS.md
README.Rmd		README.Rmd
README.md		README.md
_pkgdown.yml		_pkgdown.yml
appveyor.yml		appveyor.yml
codecov.yml		codecov.yml
cran-comments.md		cran-comments.md
hal9001.Rproj		hal9001.Rproj

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

R/`hal9001`

What’s `hal9001`?

Installation

Issues

Example

Contributions

Citation

License

References

About

Releases

Packages

Languages

License

nedamhd/hal9001

Folders and files

Latest commit

History

Repository files navigation

R/hal9001

What’s hal9001?

Installation

Issues

Example

Contributions

Citation

License

References

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

R/`hal9001`

What’s `hal9001`?

Packages