谷歌浏览器插件
订阅小程序
在清言上使用

ranger: A Fast Implementation of Random Forests for High Dimensional Data in C plus plus and R

JOURNAL OF STATISTICAL SOFTWARE(2017)

引用 1416|浏览26
暂无评分
摘要
We introduce the C++ application and R package ranger. The software is a fast implementation of random forests for high dimensional data. Ensembles of classification, regression and survival trees are supported. We describe the implementation, provide examples, validate the package with a reference implementation, and compare runtime and memory usage with other implementations. The new software proves to scale best with the number of features, samples, trees, and features tried for splitting. Finally, we show that ranger is the fastest and most memory efficient implementation of random forests to analyze data on the scale of a genome-wide association study.
更多
查看译文
关键词
C plus,classification,machine learning,R,random forests,Rcpp,recursive partitioning,survival analysis
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要