关键词:GPU;随机森林;BIDMachRF;算法优化
摘 要:We have designed BIDMachRF – an implementation of Random Forest with high CPU and GPU throughput and with full scalability. This paper describes the current state of our implementation as well as points for improvement, which we have identified through benchmarks on classical datasets. Our current in progress version has already shown to be 5x faster than implementations such as SciKit-Learn on large sized GBs of data and is estimated to be at least 20x faster than those implementations when complete.